b8609

llama.cpp Releasesby github-actions[bot]April 1, 20261 min read0 views

CUDA: Add Flash Attention Support for Head Dimension 512 ( #20998 ) flash attention support for head dimension 512 added FA D=512 - match 576 configs, limit ncols2, revert vec cap fix HIP tile kernel build for D=512 fix HIP tile kernel occupancy for D=512 on AMD Apply suggestions from code review Co-authored-by: Johannes Gäßler [email protected] fix tile FA compilation Co-authored-by: Johannes Gäßler [email protected] macOS/iOS: macOS Apple Silicon (arm64) macOS Intel (x64) iOS XCFramework Linux: Ubuntu x64 (CPU) Ubuntu arm64 (CPU) Ubuntu s390x (CPU) Ubuntu x64 (Vulkan) Ubuntu arm64 (Vulkan) Ubuntu x64 (ROCm 7.2) Ubuntu x64 (OpenVINO) Windows: Windows x64 (CPU) Windows arm64 (CPU) Windows x64 (CUDA 12) - CUDA 12.4 DLLs Windows x64 (CUDA 13) - CUDA 13.1 DLLs Windows x64 (Vulkan) Windows x64 (

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Appearance settings

Original source

llama.cpp Releases

https://github.com/ggml-org/llama.cpp/releases/tag/b8609

Was this article helpful?

Ask AI about this article

Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

review

Market News

Global alliance develops $1 billion AI data centre network in Vietnam - Vietnam Investment Review - VIR

Global alliance develops $1 billion AI data centre network in Vietnam Vietnam Investment Review - VIR

Google News - AI Vietnam

1m3 months ago

Releases

From 5G to 6G: how AI is shaping Vietnam’s path to digital leadership - Vietnam Investment Review - VIR

From 5G to 6G: how AI is shaping Vietnam’s path to digital leadership Vietnam Investment Review - VIR

Google News - AI Vietnam

1mabout 2 months ago

Products

There are more AI health tools than ever—but how well do they work? - MIT Technology Review

There are more AI health tools than ever—but how well do they work? MIT Technology Review

GNews AI healthcare

1m3 days ago

Knowledge Map

TopicsEntitiesSource

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 135 connections

Scroll to zoom · drag to pan · click to open

Discussion

No comments yet — be the first to share your thoughts!

More in Products

Products

Top 10 AI Tools Every Legal Professional in Uganda Should Know in 2025 - nucamp.co

Top 10 AI Tools Every Legal Professional in Uganda Should Know in 2025 nucamp.co

Google News - AI Uganda

1m7 months ago

ProductsLive

Cameras have quietly appeared in thousands of U.S. cities – now, their integration with AI is sounding alarms - Japan Today

Cameras have quietly appeared in thousands of U.S. cities – now, their integration with AI is sounding alarms Japan Today

GNews AI USA

1mabout 2 hours ago

ProductsLive

From MOUs to Markets: Transatlantic Deals Face Reality Test

Why transatlantic execution, not transatlantic symbolism, now matters to the electronics and semiconductor supply chain. The post From MOUs to Markets: Transatlantic Deals Face Reality Test appeared first on EE Times . ]]>

eetimes.com

1mabout 1 hour ago

ProductsFresh

RLDatix's Connected Healthcare Summit Draws 400+ Health System Leaders as Company Advances AI-Powered Patient Safety and Provider Performance Solutions - PR Newswire

RLDatix's Connected Healthcare Summit Draws 400+ Health System Leaders as Company Advances AI-Powered Patient Safety and Provider Performance Solutions PR Newswire

GNews AI healthcare

1mabout 5 hours ago