Live

•Black Hat USAAI Business •Black Hat AsiaAI Business •The Tool That Built the Modern World Is Still the Most Powerful Thing in an Engineer’s ArsenalMedium AI •I Tested AI Coding Assistants on the Same Full-Stack App — Here’s the Real WinnerMedium AI •Is the Arrow of Time a Crucial Missing Component in Artificial Intelligence?Medium AI •v0.20.1: Revert "enable flash attention for gemma4 (#15296)" (#15311)Ollama Releases •Automation vs AI: Not Just Similar — They Solve Fundamentally Different ProblemsMedium AI •Walmart's AI Checkout Converted 3x Worse. The Interface Is Why.DEV Community •✨ Why Humanity Still Moves Toward AI.Medium AI •Predicting 10 Minutes in 1 Square Meter: The Ultimate AI Boundary?DEV Community •Oracle Database 26ai: The World’s First AI-Native Database Just Changed EverythingMedium AI •Getting Data from Multiple Sources in Power BIDEV Community •AI APIs That Simplify Complex FeaturesMedium AI •PART FIVE – THE CAPTAIN’S LOGSMedium AI •Black Hat USAAI Business •Black Hat AsiaAI Business •The Tool That Built the Modern World Is Still the Most Powerful Thing in an Engineer’s ArsenalMedium AI •I Tested AI Coding Assistants on the Same Full-Stack App — Here’s the Real WinnerMedium AI •Is the Arrow of Time a Crucial Missing Component in Artificial Intelligence?Medium AI •v0.20.1: Revert "enable flash attention for gemma4 (#15296)" (#15311)Ollama Releases •Automation vs AI: Not Just Similar — They Solve Fundamentally Different ProblemsMedium AI •Walmart's AI Checkout Converted 3x Worse. The Interface Is Why.DEV Community •✨ Why Humanity Still Moves Toward AI.Medium AI •Predicting 10 Minutes in 1 Square Meter: The Ultimate AI Boundary?DEV Community •Oracle Database 26ai: The World’s First AI-Native Database Just Changed EverythingMedium AI •Getting Data from Multiple Sources in Power BIDEV Community •AI APIs That Simplify Complex FeaturesMedium AI •PART FIVE – THE CAPTAIN’S LOGSMedium AI

AI NEWS HUBbyEIGENVECTOR

v0.20.1: Revert "enable flash attention for gemma4 (#15296)" (#15311)

v0.20.1: Revert "enable flash attention for gemma4 (#15296)" (#15311)

Ollama Releasesby dhiltgenApril 4, 20261 min read0 views

This reverts commit c8e0878 .

What's Changed

bench: add prompt calibration, context size flag, and NumCtx reporting by @dhiltgen in #15158
model/parsers: fix gemma4 arg parsing when quoted strings contain " by @drifkin in #15254
ggml: skip cublasGemmBatchedEx during graph reservation by @jessegross in #15301
gemma4: enable flash attention by @dhiltgen in #15296
ggml: fix ROCm build for cublasGemmBatchedEx reserve wrapper by @jessegross in #15305
model/parsers: rework gemma4 tool call handling by @drifkin in #15306

Full Changelog: v0.20.0...v0.20.1-rc2

Original source

Ollama Releases

https://github.com/ollama/ollama/releases/tag/v0.20.1

Was this article helpful?

Sign in to highlight and annotate this article

Ask AI about this article

Powered by Eigenvector · full article context loaded

Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

Knowledge Map

Knowledge Map

TopicsEntitiesSource

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 237 connections

Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Generative UI

Is the Arrow of Time a Crucial Missing Component in Artificial Intelligence?

Generative UILive

Is the Arrow of Time a Crucial Missing Component in Artificial Intelligence?

Why directionality might be constitutive of cognition, not decorative Continue reading on Medium »

1mabout 1 hour ago

Newegg shaves $240 off this well-equipped RTX 5070, 7800X3D gaming PC — at $1,929, this CyberpowerPC is at least $100 less than the current cost of its components

Generative UIFresh

Newegg shaves $240 off this well-equipped RTX 5070, 7800X3D gaming PC — at $1,929, this CyberpowerPC is at least $100 less than the current cost of its components

Newegg shaves $240 off this well-equipped RTX 5070, 7800X3D gaming PC — at $1,929, this CyberpowerPC is at least $100 less than the current cost of its components

tomshardware.com

2mabout 10 hours ago

v0.20.1-rc0

Generative UIFresh

v0.20.1-rc0

enable flash attention for gemma4 ( #15296 )

Ollama Releases

1mabout 6 hours ago

New MIT class uses anthropology to improve chatbots

New MIT class uses anthropology to improve chatbots

MIT computer science students design AI chatbots to help young users become more social, and socially confident.