b8646
rpc : reuse compute graph buffers ( #21299 ) Reuse the buffer for the ggml context which is used for creating the compute graph on the server side. This partially addresses a memory leak created by the CUDA backend due to using buffer addresses as cache keys. ref: #21265 ref: #20315 macOS/iOS: macOS Apple Silicon (arm64) macOS Intel (x64) iOS XCFramework Linux: Ubuntu x64 (CPU) Ubuntu arm64 (CPU) Ubuntu s390x (CPU) Ubuntu x64 (Vulkan) Ubuntu arm64 (Vulkan) Ubuntu x64 (ROCm 7.2) Ubuntu x64 (OpenVINO) Windows: Windows x64 (CPU) Windows arm64 (CPU) Windows x64 (CUDA 12) - CUDA 12.4 DLLs Windows x64 (CUDA 13) - CUDA 13.1 DLLs Windows x64 (Vulkan) Windows x64 (SYCL) Windows x64 (HIP) openEuler: openEuler x86 (310p) openEuler x86 (910b, ACL Graph) openEuler aarch64 (310p) openEuler aarch64 (910b
Provide feedback
Saved searches
Use saved searches to filter your results more quickly
Sign up
Appearance settings
Sign in to highlight and annotate this article

Conversation starters
Daily AI Digest
Get the top 5 AI stories delivered to your inbox every morning.
Knowledge Map
Connected Articles — Knowledge Graph
This article is connected to other articles through shared AI topics and tags.
More in Products

The ERP Is Dead: Why Your Business Needs an AI Operating System
You open your ERP. Navigate three menus. Fill out a 14-field form. Hit save. Repeat. This is not management. This is work about work. And yet, for two decades, this has been the standard. Software that records what you already know, organizes what you already did, and shows you reports about what already happened. Millions of professionals open their ERP every day not because it gives them clarity, but because they have no alternative. That era is over. What ERPs got right (and why it is no longer enough) Credit where it is due: ERPs were revolutionary. Before SAP, Sage, or even QuickBooks, business management lived in filing cabinets, spreadsheets, and the accountant's memory. ERPs centralized data, standardized processes, and created a single source of truth for a company's finances. Tha

MyDBA.dev vs pganalyze: Which PostgreSQL Monitor Should You Choose?
pganalyze vs MyDBA.dev -- A Practical PostgreSQL Monitoring Comparison I've been running PostgreSQL in production for years, and if there's one thing I've learned about monitoring tools, it's this: the best time to evaluate them is before you need them. Not during a 3am incident when you're staring at a chart that says "something is wrong" but gives you no idea how to fix it. Both pganalyze and MyDBA.dev are PostgreSQL-focused monitoring tools -- not generic infrastructure platforms that treat Postgres as an afterthought. But they have meaningfully different philosophies about what monitoring should do. Here's a practical comparison. pganalyze: The Established Player pganalyze has been around since 2013 and has built genuine depth in several areas. Their index advisor uses hypothetical ind

I Built a Chrome Extension That Auto-Saves Your Form Data Locally (Zero Network Requests)
The Problem You're halfway through a long form — maybe a job application, an insurance quote, or a school registration — and you accidentally close the tab. Or the page crashes. Or your session expires. Everything you typed is gone. Browser autofill only covers the basics (name, email, address). It doesn't save the custom fields, text areas, or dropdowns that make up 90% of real forms. What I Built FormVault is a Chrome extension that automatically saves everything you type into any web form — locally, on your machine. No accounts, no cloud sync, no network requests at all. When you come back to a form, FormVault lets you restore your previous inputs with one click. How it works: Detects form fields on any page Saves inputs to Chrome's local storage as you type Restore button appears when

Claude Code at Enterprise Scale: Why You Need an AI Gateway
Transform Claude Code from an individual tool into a governed, cost-controlled platform for your entire engineering organization. Claude Code has gained significant traction among enterprise developers. Teams leverage it to rapidly build new applications, diagnose issues in complex systems, modernize outdated code, and eliminate repetitive developer work through terminal-based automation. However, deploying Claude Code across dozens or hundreds of engineers creates operational problems that individual use cases never surface: unchecked spending on API calls, complete lack of cost attribution by developer, governance gaps, and the risk of relying on a single AI provider. An AI gateway sitting between developers and the Claude provider resolves these issues by intercepting all requests, mana

Discussion
Sign in to join the discussion
No comments yet — be the first to share your thoughts!