Live
Black Hat USADark ReadingBlack Hat AsiaAI BusinessIs cutting ‘please’ and ‘thank you’ when talking to ChatGPT better for the planet? An expert explains - The IndependentGoogle News: ChatGPTOpenAI CEO and CFO Diverge on IPO Timing - The InformationGoogle News: OpenAII built a faster alternative to cp and rsync — here's how it worksDEV CommunityThe Service Layer: Where Separate Components Become a SystemDEV Community🚀Playwright vs Selenium in 2026: The Ultimate Guide for Modern Test AutomationDEV CommunityBuilding a Decentralized Mesh Network in Rust — Lessons from the Global SouthDEV CommunitySocratic AI: how I learned formal grammars (and built a compiler) without losing control of what I was buildingDEV CommunityOpenAI Is Making Microsoft and Ashton Kutcher Incredibly Rich - inc.comGoogle News: OpenAIQodo vs Tabnine: AI Coding Assistants Compared (2026)DEV CommunityShielding Your LLMs: A Deep Dive into Prompt Injection & Jailbreak DefenseDEV CommunityI Connected 12 MCP Servers to Amazon Q. Here's What BrokeDEV CommunityHow to Publish a Power BI Report and Embed It on a WebsiteDEV CommunityBlack Hat USADark ReadingBlack Hat AsiaAI BusinessIs cutting ‘please’ and ‘thank you’ when talking to ChatGPT better for the planet? An expert explains - The IndependentGoogle News: ChatGPTOpenAI CEO and CFO Diverge on IPO Timing - The InformationGoogle News: OpenAII built a faster alternative to cp and rsync — here's how it worksDEV CommunityThe Service Layer: Where Separate Components Become a SystemDEV Community🚀Playwright vs Selenium in 2026: The Ultimate Guide for Modern Test AutomationDEV CommunityBuilding a Decentralized Mesh Network in Rust — Lessons from the Global SouthDEV CommunitySocratic AI: how I learned formal grammars (and built a compiler) without losing control of what I was buildingDEV CommunityOpenAI Is Making Microsoft and Ashton Kutcher Incredibly Rich - inc.comGoogle News: OpenAIQodo vs Tabnine: AI Coding Assistants Compared (2026)DEV CommunityShielding Your LLMs: A Deep Dive into Prompt Injection & Jailbreak DefenseDEV CommunityI Connected 12 MCP Servers to Amazon Q. Here's What BrokeDEV CommunityHow to Publish a Power BI Report and Embed It on a WebsiteDEV Community
AI NEWS HUBbyEIGENVECTOREigenvector

b8660

llama.cpp Releasesby ggml-orgApril 3, 20261 min read1 views
Source Quiz

ggml-webgpu: move from parameter buffer pool to single buffer with offsets ( #21278 ) Work towards removing bitcast Move rest of existing types over Add timeout back to wait and remove synchronous set_tensor/memset_tensor move to unpackf16 for wider compatibility cleanup Remove deadlock condition in free_bufs Start work on removing parameter buffer pools Simplify and optimize further simplify profile futures Fix stride Try using a single command buffer per batch formatting macOS/iOS: macOS Apple Silicon (arm64) macOS Intel (x64) iOS XCFramework Linux: Ubuntu x64 (CPU) Ubuntu arm64 (CPU) Ubuntu s390x (CPU) Ubuntu x64 (Vulkan) Ubuntu arm64 (Vulkan) Ubuntu x64 (ROCm 7.2) Ubuntu x64 (OpenVINO) Windows: Windows x64 (CPU) Windows arm64 (CPU) Windows x64 (CUDA 12) - CUDA 12.4 DLLs Windows x64 (CU

ggml-webgpu: move from parameter buffer pool to single buffer with offsets (#21278)

  • Work towards removing bitcast

  • Move rest of existing types over

  • Add timeout back to wait and remove synchronous set_tensor/memset_tensor

  • move to unpackf16 for wider compatibility

  • cleanup

  • Remove deadlock condition in free_bufs

  • Start work on removing parameter buffer pools

  • Simplify and optimize further

  • simplify profile futures

  • Fix stride

  • Try using a single command buffer per batch

  • formatting

macOS/iOS:

  • macOS Apple Silicon (arm64)

  • macOS Intel (x64)

  • iOS XCFramework

Linux:

  • Ubuntu x64 (CPU)

  • Ubuntu arm64 (CPU)

  • Ubuntu s390x (CPU)

  • Ubuntu x64 (Vulkan)

  • Ubuntu arm64 (Vulkan)

  • Ubuntu x64 (ROCm 7.2)

  • Ubuntu x64 (OpenVINO)

Windows:

  • Windows x64 (CPU)

  • Windows arm64 (CPU)

  • Windows x64 (CUDA 12) - CUDA 12.4 DLLs

  • Windows x64 (CUDA 13) - CUDA 13.1 DLLs

  • Windows x64 (Vulkan)

  • Windows x64 (SYCL)

  • Windows x64 (HIP)

openEuler:

  • openEuler x86 (310p)

  • openEuler x86 (910b, ACL Graph)

  • openEuler aarch64 (310p)

  • openEuler aarch64 (910b, ACL Graph)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by Eigenvector · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

Knowledge Map

Knowledge Map
TopicsEntitiesSource
b8660llama.cpp R…

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Building knowledge graph…

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!