Models claude model release new model feature github

llm-all-models-async 0.1

Simon Willison BlogMarch 31, 20261 min read0 views

Release: <a href="https://github.com/simonw/llm-all-models-async/releases/tag/0.1">llm-all-models-async 0.1</a> LLM plugins can define new models in both <a href="https://llm.datasette.io/en/stable/plugins/tutorial-model-plugin.html">sync</a> and <a href="https://llm.datasette.io/en/stable/plugins/advanced-model-plugins.html#async-models">async</a> varieties. The async variants are most common for API-backed models - sync variants tend to be things that run the model directly within the plugin. My <a href="https://simonwillison.net/2026/Mar/30/mr-chatterbox/#running-it-locally-with-llm">llm-mrchatterbox</a> plugin is sync only. I wanted to try it out with various Datasette LLM features (specifically <a href="https://github.com/datasette/datasette-enrichmen

Release

llm-all-models-async 0.1 — Register async versions of models from LLM plugins that only provide a sync version

LLM plugins can define new models in both sync and async varieties. The async variants are most common for API-backed models - sync variants tend to be things that run the model directly within the plugin.

My llm-mrchatterbox plugin is sync only. I wanted to try it out with various Datasette LLM features (specifically datasette-enrichments-llm) but Datasette can only use async models.

So... I had Claude spin up this plugin that turns sync models into async models using a thread pool. This ended up needing an extra plugin hook mechanism in LLM itself, which I shipped just now in LLM 0.30.

Original source

Simon Willison Blog

https://simonwillison.net/2026/Mar/31/llm-all-models-async/#atom-everything

Was this article helpful?

Ask AI about this article

Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

claudemodelrelease

Releases

datasette-llm 0.1a2

Release: <a href="https://github.com/datasette/datasette-llm/releases/tag/0.1a2">datasette-llm 0.1a2</a> <blockquote> <ul> <li><code>actor</code> is now available to the <code>llm_prompt_context</code> plugin hook. <a href="https://github.com/datasette/datasette-llm/pull/2">#2</a></li> </ul> </blockquote> Tags: <a href="https://simonwillison.net/tags/llm">llm</a>, <a href="https://simonwillison.net/tags/datasette">datasette</a>

Simon Willison Blog

1m5 days ago

ReleasesLive

Supply Chain Attack on Axios Pulls Malicious Dependency from npm

<a href="https://socket.dev/blog/axios-npm-package-compromised">Supply Chain Attack on Axios Pulls Malicious Dependency from npm</a> Useful writeup of today's supply chain attack against Axios, the HTTP client NPM package with <a href="https://www.npmjs.com/package/axios">101 million weekly downloads</a>. Versions <code>1.14.1</code> and <code>0.30.4</code> both included a new dependency called <code>plain-crypto-js</code> which was freshly published malware, stealing credentials and installing a remote access trojan (RAT). It looks like the attack came from a leaked long-lived npm token. Axios have <a href="https://github.com/axios/axios/issues/7055">an open issue to adopt trusted publishing</a>, which would ensure that only their GitHub Actions workflows ar

Simon Willison Blog

1mabout 2 hours ago

ProductsLive

Blind `npm install` Execution Risks Security Vulnerabilities: Review Lockfiles to Mitigate Threats

<a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fk3labo0gsfmuphb69nbt.png" class="article-body-image-wrapper"><img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fk3labo0gsfmuphb69nbt.png" alt="cover" width="800" height="420"></a> <h2> Introduction: The Silent Threat in npm Install </h2> The recent attack on the npm ecosystem didn’t target security engineers meticulously reviewing lockfiles. It targeted the rest of us—developers who type <code>npm install</code> and move on, trusting the process implicitly. This blind executi

DEV Community

13m44 minutes ago

Knowledge Map

TopicsEntitiesSource

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 138 connections

Scroll to zoom · drag to pan · click to open

Discussion

No comments yet — be the first to share your thoughts!

More in Models

ModelsLive

I Created a SQL Injection Challenge… And AI Failed to Catch the Biggest Security Flaw 💥

I recently designed a simple SQL challenge. Nothing fancy. Just a login system: Username Password Basic query validation Seemed straightforward, right? So I decided to test it with AI. I gave the same problem to multiple models. Each one confidently generated a solution. Each one looked clean. Each one worked. But there was one problem. 🚨 Every single solution was vulnerable to SQL Injection. Here’s what happened: Most models generated queries like: SELECT * FROM users WHERE username = 'input' AND password = 'input'; Looks fine at first glance. But no parameterization. No input sanitization. No prepared statements. Which means… A simple input like: <

DEV Community

2m31 minutes ago

ModelsLive

From one model to seven — what it took to make TurboQuant model-portable

A KV cache compression plugin that only works on one model is a demo, not a tool. turboquant-vllm v1.0.0 shipped four days ago with one validated architecture: Molmo2. v1.3.0 validates seven — Llama 3.1, Mistral 7B, Qwen2.5, Phi-3-mini, Phi-4, Gemma-2, and Gemma-3. The path between those two points was more interesting than the destination. <h2> What Changed </h2> Fused paged kernels (v1.2.0). The original architecture decompressed KV cache from TQ4 to FP16 in HBM, then ran standard attention on the result. The new fused kernel reads compressed blocks directly from vLLM's page table, decompresses in SRAM, and computes attention in a single pass. HBM traffic: 1,160 → 136 bytes per token. <div class="highlight js-code-highlight"> <pre class="highlight pyth

DEV Community

3m31 minutes ago

ModelsLive

8 Gemini AI Prompts That Turn Ordinary Photos Into Professional Portraits

These eight Google Gemini AI prompts transform ordinary photos into polished portraits for LinkedIn, personal branding, family photos, and more. The post 8 Gemini AI Prompts That Turn Ordinary Photos Into Professional Portraits appeared first on TechRepublic .

TechRepublic AI

1mabout 1 hour ago

ModelsLive

Anthropic teams with Australian government to review AI model safety - NewsBytes

<a href="https://news.google.com/rss/articles/CBMitgFBVV95cUxNeEJMNFVqS09nX290b1pLb0t6bHRydkFxZkYzRHpabEtPTGpJdm50RGdBVUtnSnpoYW1VdjlBMERTQnMyZURUeEZTcmhKSmdNWjBOd0NYU1ZBY1ZjU1piSElxNEt0MDdYbzhGNWVNckdEek1jeW1oNTAyT1Zhd0pEbE1HU3BMZWFicnpLekFJVFFOUXQwQThsRWFsaDRpYmdmR1dRN3lqOG9ld0pIZzJuamlUZTNNUQ?oc=5" target="_blank">Anthropic teams with Australian government to review AI model safety</a> NewsBytes

Google News: AI Safety

1mabout 1 hour ago