Live
Black Hat USAAI BusinessBlack Hat AsiaAI Business🔥 ggml-org/llama.cppGitHub Trending🔥 ollama/ollamaGitHub Trending🔥 sponsors/kepanoGitHub Trending🔥 KeygraphHQ/shannonGitHub Trending🔥 sponsors/abhigyanpatwariGitHub TrendingOpenAI Releases Policy Recommendations for AI AgeBloomberg TechnologyBeware the Magical 2-Person, $1 Billion AI-Driven StartupForrester AI Blog[D] ICML 26 - What to do with the zero follow-up questionsReddit r/MachineLearningStop Writing Mega-Prompts: Use These 5 Anthropic Design Patterns InsteadMedium AIBuilding a Semantic Research Assistant: A Production RAG Pipeline Over 120 arXiv PapersMedium AIBuilding a Multi-Agent Investment PlatformMedium AIClaude Code in the Philippines: ₱112/month vs ₱1,120 for ChatGPTDev.to AIBlack Hat USAAI BusinessBlack Hat AsiaAI Business🔥 ggml-org/llama.cppGitHub Trending🔥 ollama/ollamaGitHub Trending🔥 sponsors/kepanoGitHub Trending🔥 KeygraphHQ/shannonGitHub Trending🔥 sponsors/abhigyanpatwariGitHub TrendingOpenAI Releases Policy Recommendations for AI AgeBloomberg TechnologyBeware the Magical 2-Person, $1 Billion AI-Driven StartupForrester AI Blog[D] ICML 26 - What to do with the zero follow-up questionsReddit r/MachineLearningStop Writing Mega-Prompts: Use These 5 Anthropic Design Patterns InsteadMedium AIBuilding a Semantic Research Assistant: A Production RAG Pipeline Over 120 arXiv PapersMedium AIBuilding a Multi-Agent Investment PlatformMedium AIClaude Code in the Philippines: ₱112/month vs ₱1,120 for ChatGPTDev.to AI
AI NEWS HUBbyEIGENVECTOREigenvector

b8658

llama.cpp Releasesby ggml-orgApril 3, 20262 min read1 views
Source Quiz

server: save and clear idle slots on new task ( --clear-idle ) ( #20993 ) server: clear idle slots KV from VRAM (LLAMA_KV_KEEP_ONLY_ACTIVE) server: move idle slot KV clearing to slot release The save "cost" is now paid by the finishing request. server: add --kv-clear-idle flag, enable by default server: skip clearing last idle slot, clear on launch server: test --no-kv-clear-idle flag server: simplify on-release clearing loop server: remove on-release KV clearing, keep launch-only cont : clean-up tests: update log strings after --clear-idle rename tests: use debug tags instead of log message matching test: fix Windows CI by dropping temp log file unlink Co-authored-by: Georgi Gerganov [email protected] macOS/iOS: macOS Apple Silicon (arm64) macOS Intel (x64) iOS XCFramework Linux: Ubuntu

server: save and clear idle slots on new task (--clear-idle) (#20993)

  • server: clear idle slots KV from VRAM (LLAMA_KV_KEEP_ONLY_ACTIVE)

  • server: move idle slot KV clearing to slot release

The save "cost" is now paid by the finishing request.

  • server: add --kv-clear-idle flag, enable by default

  • server: skip clearing last idle slot, clear on launch

  • server: test --no-kv-clear-idle flag

  • server: simplify on-release clearing loop

  • server: remove on-release KV clearing, keep launch-only

  • cont : clean-up

  • tests: update log strings after --clear-idle rename

  • tests: use debug tags instead of log message matching

  • test: fix Windows CI by dropping temp log file unlink

Co-authored-by: Georgi Gerganov [email protected]

macOS/iOS:

  • macOS Apple Silicon (arm64)

  • macOS Intel (x64)

  • iOS XCFramework

Linux:

  • Ubuntu x64 (CPU)

  • Ubuntu arm64 (CPU)

  • Ubuntu s390x (CPU)

  • Ubuntu x64 (Vulkan)

  • Ubuntu arm64 (Vulkan)

  • Ubuntu x64 (ROCm 7.2)

  • Ubuntu x64 (OpenVINO)

Windows:

  • Windows x64 (CPU)

  • Windows arm64 (CPU)

  • Windows x64 (CUDA 12) - CUDA 12.4 DLLs

  • Windows x64 (CUDA 13) - CUDA 13.1 DLLs

  • Windows x64 (Vulkan)

  • Windows x64 (SYCL)

  • Windows x64 (HIP)

openEuler:

  • openEuler x86 (310p)

  • openEuler x86 (910b, ACL Graph)

  • openEuler aarch64 (310p)

  • openEuler aarch64 (910b, ACL Graph)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by Eigenvector · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

llamareleaselaunch

Knowledge Map

Knowledge Map
TopicsEntitiesSource
b8658llamareleaselaunchupdatellama.cpp R…

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 208 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Releases