Live
Black Hat USAAI BusinessBlack Hat AsiaAI BusinessI Was Told to Write My Thesis in LaTeX. Here's How I Actually Got Started.DEV CommunityBuilding a Multi-Tenant SaaS with Stripe Connect in 2026DEV CommunityPart 3 of 3 — Engineering Intent Series -- Inside the Machine: The ISL Build PipelineDEV CommunityChoosing and Integrating Mobile Video SDKs: FFmpeg, ExoPlayer, and Commercial OptionsDEV CommunityBuild an End-to-End RAG Pipeline for LLM ApplicationsDEV CommunityAgentX-Phase2: 49-Model Byzantine FBA Consensus — Building Cool Agents that Modernize COBOL to RustDEV CommunityWhy Most Agencies Deploy WordPress Multisite for the Wrong ReasonsDEV CommunityHow to Add Structured Logging to Node.js APIs with Pino 9 + OpenTelemetry (2026 Guide)DEV CommunityThe home stretchDEV CommunityAI giant Anthropic says 'exploring' Australia data centre investments - MSNGoogle News: ClaudeMacy’s unveils Google Gemini-based AI shopping assistant - Chain Store AgeGoogle News: GeminiToyota’s Woven Capital appoints new CIO and COO in push for finding the ‘future of mobility’TechCrunch AIBlack Hat USAAI BusinessBlack Hat AsiaAI BusinessI Was Told to Write My Thesis in LaTeX. Here's How I Actually Got Started.DEV CommunityBuilding a Multi-Tenant SaaS with Stripe Connect in 2026DEV CommunityPart 3 of 3 — Engineering Intent Series -- Inside the Machine: The ISL Build PipelineDEV CommunityChoosing and Integrating Mobile Video SDKs: FFmpeg, ExoPlayer, and Commercial OptionsDEV CommunityBuild an End-to-End RAG Pipeline for LLM ApplicationsDEV CommunityAgentX-Phase2: 49-Model Byzantine FBA Consensus — Building Cool Agents that Modernize COBOL to RustDEV CommunityWhy Most Agencies Deploy WordPress Multisite for the Wrong ReasonsDEV CommunityHow to Add Structured Logging to Node.js APIs with Pino 9 + OpenTelemetry (2026 Guide)DEV CommunityThe home stretchDEV CommunityAI giant Anthropic says 'exploring' Australia data centre investments - MSNGoogle News: ClaudeMacy’s unveils Google Gemini-based AI shopping assistant - Chain Store AgeGoogle News: GeminiToyota’s Woven Capital appoints new CIO and COO in push for finding the ‘future of mobility’TechCrunch AI

Running local models on Macs gets faster with Ollama's MLX support

Ars Technica AIMarch 31, 20261 min read0 views
Source Quiz

Running local models on Macs gets faster with Ollama's MLX support

Ollama, a runtime system for operating large language models on a local computer, has introduced support for Apple’s open source MLX framework for machine learning. Additionally, Ollama says it has improved caching performance and now supports Nvidia’s NVFP4 format for model compression, making for much more efficient memory usage in certain models.

Combined, these developments promise significantly improved performance on Macs with Apple Silicon chips (M1 or later)—and the timing couldn’t be better, as local models are starting to gain steam in ways they haven’t before outside researcher and hobbyist communities.

The recent runaway success of OpenClaw—which raced its way to over 300,000 stars on GitHub, made headlines with experiments like Moltbook and became an obsession in China in particular—has many people experimenting with running models on their machines.

As developers get frustrated with rate limits and the high cost of top-tier subscriptions to tools like Claude Code or ChatGPT Codex, experimentation with local coding models has heated up. (Ollama also expanded Visual Studio Code integration recently.)

The new support is available in preview (in Ollama 0.19) and currently supports only one model—the 35 billion-parameter variant of Alibaba’s Qwen3.5. Hardware requirements are intense by normal users’ standards. Users need an Apple Silicon-equipped Mac, sure, but they also need at least 32GB of RAM, according to Ollama’s announcement.

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by AI News Hub · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

llamamodelollama

Knowledge Map

Knowledge Map
TopicsEntitiesSource
Running loc…llamamodelollamaArs Technic…

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 163 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Models