Live
Black Hat USADark ReadingBlack Hat AsiaAI BusinessOpenAI’s AGI boss is taking a leave of absenceThe VergeGoogle's Gemma 4 AI can run on smartphones, no Internet requiredTechSpotb8656llama.cpp ReleasesThe future of RealSense 3D vision with Chris Matthieu - The Robot ReportGoogle News - AI roboticsThe future of RealSense 3D vision with Chris MatthieuThe Robot ReportLinkerbot’s Linker Hand L30 Can Tighten Screws in Seconds - TechEBlog -Google News - AI roboticsPasta-like robot muscles powered by air can lift 100x their weight - Interesting EngineeringGoogle News - AI roboticsAssessing Marvell Technology (MRVL) After Nvidia’s US$2b AI Partnership And Connectivity Push - simplywall.stGNews AI NVIDIADutchess to host artificial intelligence summit at Marist in Poughkeepsie - Daily FreemanGoogle News: AIAnthropic’s Catastrophic Leak May Have Just Handed China the Blueprints to Claude Al - TipRanksGoogle News: ClaudeOpenAI's Fidji Simo Is Taking Medical Leave Amid an Executive Shake-UpWired AIMeta's AI push is reshaping how work gets done inside the companyBusiness InsiderBlack Hat USADark ReadingBlack Hat AsiaAI BusinessOpenAI’s AGI boss is taking a leave of absenceThe VergeGoogle's Gemma 4 AI can run on smartphones, no Internet requiredTechSpotb8656llama.cpp ReleasesThe future of RealSense 3D vision with Chris Matthieu - The Robot ReportGoogle News - AI roboticsThe future of RealSense 3D vision with Chris MatthieuThe Robot ReportLinkerbot’s Linker Hand L30 Can Tighten Screws in Seconds - TechEBlog -Google News - AI roboticsPasta-like robot muscles powered by air can lift 100x their weight - Interesting EngineeringGoogle News - AI roboticsAssessing Marvell Technology (MRVL) After Nvidia’s US$2b AI Partnership And Connectivity Push - simplywall.stGNews AI NVIDIADutchess to host artificial intelligence summit at Marist in Poughkeepsie - Daily FreemanGoogle News: AIAnthropic’s Catastrophic Leak May Have Just Handed China the Blueprints to Claude Al - TipRanksGoogle News: ClaudeOpenAI's Fidji Simo Is Taking Medical Leave Amid an Executive Shake-UpWired AIMeta's AI push is reshaping how work gets done inside the companyBusiness Insider
AI NEWS HUBbyEIGENVECTOREigenvector

Running 1bit Bonsai 8B on 2GB VRAM (MX150 mobile GPU)

Reddit r/LocalLLaMAby /u/OsmanthusBloom https://www.reddit.com/user/OsmanthusBloomApril 3, 20264 min read0 views
Source Quiz

I have an older laptop from ~2018, an Asus Zenbook UX430U. It was quite powerful in its time, with an i7-8550U CPU @ 1.80GHz (4 physical cores and an Intel iGPU), 16GB RAM and an additional NVIDIA MX150 GPU with 2GB VRAM. I think the GPU was intended for CAD applications, Photoshop filters or such - it is definitely not a gaming laptop. I'm using Linux Mint with the Cinnamon desktop using the iGPU only, leaving the MX150 free for other uses. I never thought I would run LLMs on this machine, though I've occasionally used the MX150 GPU to train small PyTorch or TensorFlow models; it is maybe 3 times faster than using just the CPU. However, when the 1-bit Bonsai 8B model was released, I couldn't resist trying out if I could run it on this GPU. So I took the llama.cpp fork from PrismML, compil

Could not retrieve the full article text.

Read on Reddit r/LocalLLaMA →
Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by Eigenvector · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

llamamodelbenchmark

Knowledge Map

Knowledge Map
TopicsEntitiesSource
Running 1bi…llamamodelbenchmarkreleaseapplicationreportReddit r/Lo…

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 101 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Models