Weaviate 1.31 Release

Weaviate Blogby Joon-Pil (JP) HwangJune 3, 20258 min read1 views

1.31 adds MUVERA for multi-vector embeddings, new BM25 operators, the ability to add new object vectors, and more!

Weaviate v1.31 is now available with some really exciting new features as always. It introduces MUVERA encoding for multi-vector embeddings, new BM25 operators for more customizable keyword searching, the ability to add new vectors to existing collections, just to name a few.

This release also adds support for a bunch of new models (model2vec, VoyageAI's v3.5 models, Cohere v3.5 reranker & V4 embed models), and a HUGE list of performance improvements. (Honestly, take a look at the full release notes for the list of changes, and give our engineering team a high five for all the hard work they put in!)

Here are the release ⭐️highlights⭐️!

MUVERA for multi-vector embeddings
New BM25 keyword search operators
Add new vectors to existing collections
HNSW snapshotting
More model integrations
A HUGE list of performance improvements
Community contributions

MUVERA for multi-vector embeddings

ColBERT or ColPali-like multi-vector embeddings went generally available in Weaviate v1.30. As a reminder, here is an illustration showing the difference between single-vector and multi-vector embeddings.

Multi-vector embeddings enable more precise searching through "late interaction", but they have a drawback meaning that they can be much larger than single-vector embeddings.

This is where MUVERA comes in. It flattens variable-length multi-vector embeddings into fixed-length single-vector embeddings. In a majority of cases, this will reduce the size of the embeddings - more so for larger embeddings such as those from long text sources (many tokens) or images (many patches).

There are additional benefits to using MUVERA, such as the general speed-up at import time given that only one vector needs to be inserted into the database (and the vector index). The trade-off is that the simplified vector may reduce the quality of the search.

You can mitigate some of this by changing your vector index settings, such as to set higher ef values.

We'll get into more details on this in a future blog post, coming within the next week or so.

Related resources

New BM25 keyword search operators

Keyword search is a key (sorry, couldn't resist) part of Weaviate's search capabilities. A keyword search will match documents that contain the exact terms (i.e. tokens) in the query, ranked by a BM25 algorithm score that captures the relevance of the match.

The default BM25 algorithm includes documents that contains at least one of the terms in the query. This is a good default for most use cases, but it may not be the best for all use cases.

For example, if you are searching for a specific product (e.g. "noise cancelling microphone"), you may want to only include documents that contain all the terms in the query. Or, if you are searching for technical literature (e.g. "vector embedding quantization technique"), you may want to include documents that contain at least some (e.g. 3) of the terms contained in the query.

Previously, you could sort of do this by combining a BM25 query with a filter. But now, you can use the new And or Or operators to create more complex queries.

Examples

For example, the following query will ONLY match documents that contain all the terms ("australian", "mammal", "cute") in the query, ranked by BM25 score:

from weaviate.classes.query import BM25Operatorjeopardy = client.collections.get("JeopardyQuestion")response = jeopardy.query.bm25( query="Australian mammal cute", operator=BM25Operator.and_(), limit=3,)for o in response.objects: print(o.properties)_

While this one will match documents that contain at least two of the terms ("australian", "mammal", "cute") in the query, ranked by BM25 score:

from weaviate.classes.query import BM25Operatorjeopardy = client.collections.get("JeopardyQuestion")response = jeopardy.query.bm25( query="Australian mammal cute", operator=BM25Operator.or_(minimum_match=2), limit=3,)for o in response.objects: print(o.properties)_

This is a really powerful feature that allows you to customize your keyword search to your use case. Along with our recent indexing performance improvements (see BlockMax WAND blog), Weaviate's keyword search capabilities keep getting better and better.

In case you are wondering - yes, this is available with Weaviate's hybrid search as well. 😉 So you can granularly control the keyword search part of your hybrid search even more.

Related resources

Add new vectors to existing collections

Vectors are, obviously, the core of Weaviate's search capabilities. They are the "fingerprint" of the data that you are storing in Weaviate.

Each object can have multiple vector representations in Weaviate. So a single object, say Movie, can have a vector for its title and one for its description.

While you can plan for these vectors at collection creation time, sometimes your needs change. And that's where this new feature comes in.

From v1.31, you can add new vectors to an existing collection. This is particularly useful if you find that you want to do things like:

Use a different embedding model for a collection (e.g. multi-vector embeddings with MUVERA 😉)
Generate vectors from different parts of the data
Add a new modality to an existing collection (e.g. add a vector for the poster image)

Related resources

HNSW snapshotting

HNSW is Weaviate's default vector index type, as it is efficient and scalable as the size of your collection grows.

You already know that Weaviate includes a ton of great optimizations for HNSW, such as quantizations for in-memory index size, and async indexing for import efficiency.

Now, v1.31 introduces HNSW snapshotting. The tl;dr is that Weaviate can now create snapshots of your HNSW index, instead of having to re-build the index from its write-ahead log (WAL).

Just how much of a difference can this make? Well, it depends on the size of your index of course - but: we've seen around a 10-15x speedup in start-up time for large HNSW indexes.

For a 10 million object index, we've seen it go from 70+ seconds to 5 seconds!

This is enabled by default, so you don't need to do anything to get the benefits. If you want to customize the snapshotting behavior, you can do so as well.

Try it out and watch your instance go 🚀🏎️ .

Related resources

More model integrations

Our list of model integrations keep growing. In v1.31, we add support for:

Cohere - Added support for Cohere's v3.5 reranker model and v4 Embed model. The v4 Embed model is multimodal, too, meaning they can be used for both text and image embeddings.
VoyageAI - Added new set of models from VoyageAI, including voyage-3.5, voyage-3.5-lite and voyage-3-large models.
model2vec - A state of the art static embedding model, meaning it does not use attention mechanisms, making it much faster to run. Use this model where you may have used something like our text2vec-contextionary model.

Related resources

A HUGE list of performance improvements

In these blogs, we tend to talk about the visible new features and improvements - for good reason. But there is a lot of work that you don't necessarily see, all of which adds up to huge improvements in performance over time.

Just as an example, here is a selection of some of the performance improvements that were made in v1.31.

Just a selection of items from the Weaviate 1.31 release notes. (Yes, we realise the font is very small - but we just wanted to convey how many improvements there are!)

Ultimately, these all add up to a much faster and more efficient Weaviate instance over time.

Which - by the way - is why we always recommend you run the latest version of Weaviate, if possible 😉.

So here's a shout-out to our amazing engineering team for all the hard work they put in to make this release (and every other release) possible. 🚀🚀🚀

Related resources

Weaviate is an open-source project. And while of course, much of the work is done by our amazing engineering team, we are always excited to see contributions from the community.

For this release, we are super excited to shout-out the following contributors for their contributions to Weaviate. 🎉🎉🎉

@alingse contributed #7682
@crewone contributed #7641
@cryo-zd contributed #7615
@mohamedawnallah contributed #7779

If you are interested in contributing to Weaviate, please check out our contribution guide, and the list of open issues on GitHub. Filtering for the good-first-issue label is a great way to get started.

Related resources

Summary

Ready to Get Started?

Enjoy the new features and improvements in Weaviate 1.31. The release is available open-source as always on GitHub, and will be available for new Sandboxes on Weaviate Cloud very shortly.

For those of you upgrading a self-hosted version, please check the migration guide for detailed instructions.

It will be available for Serverless clusters on Weaviate Cloud soon as well.

Thanks for reading, see you next time 👋!

Ready to start building?

Check out the Quickstart tutorial, or build amazing apps with a free trial of Weaviate Cloud (WCD).

Original source

Weaviate Blog

https://weaviate.io/blog/weaviate-1-31-release

Was this article helpful?

Ask AI about this article

Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

release

Open Source AILive

Hugging Face Releases TRL v1.0: A Unified Post-Training Stack for SFT, Reward Modeling, DPO, and GRPO Workflows

Hugging Face has officially released TRL (Transformer Reinforcement Learning) v1.0, marking a pivotal transition for the library from a research-oriented repository to a stable, production-ready framework. For AI professionals and developers, this release codifies the Post-Training pipeline—the essential sequence of Supervised Fine-Tuning (SFT), Reward Modeling, and Alignment—into a unified, standardized API. In the early stages […] The post Hugging Face Releases TRL v1.0: A Unified Post-Training Stack for SFT, Reward Modeling, DPO, and GRPO Workflows appeared first on MarkTechPost .

MarkTechPost

1m22 minutes ago

ProductsLive

Antropic's Claude Code leaked and Axios NPM Inflitration

<p><a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fb1ka41vwv76ehjjesu4d.png" class="article-body-image-wrapper"><img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fb1ka41vwv76ehjjesu4d.png" alt=" " width="784" height="478"></a></p> <h2> THE CODE LEAK THAT SHOCKED THE TECH WORLD </h2> <p>This week, Anthropic accidentally opened the floodgates to a wealth of secret information by leaking the full source code of Claude Code via an npm source map. With internal architecture, unreleased features, and multi-agent workflows thrust into the

DEV Community

4m19 minutes ago

ReleasesLive

5 Rust patterns that replaced my Python scripts

<p>I used to reach for Python every time I needed a quick script.<br> File renaming, log parsing, API polling, directory cleanup --<br> Python was the default because it was fast to write and good enough to run.</p> <p>That changed gradually.<br> Not because I decided to rewrite everything in Rust,<br> but because I kept running into the same friction points:<br> shipping the script to another machine, handling errors properly,<br> or running it somewhere Python wasn't available.</p> <p>Here are five patterns where Rust has genuinely replaced Python for me.</p> <h2> 1. Error handling that forces you to think </h2> <p>In Python, the path of least resistance is letting exceptions propagate and hoping for the best.<br> </p> <div class="highlight js-code-highlight"> <pre class="highlight pytho

DEV Community

8m22 minutes ago

Knowledge Map

TopicsEntitiesSource

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 263 connections

Scroll to zoom · drag to pan · click to open

Discussion

No comments yet — be the first to share your thoughts!

More in Releases

ReleasesLive

Q2, Day 1: When Concepts Have to Become Code

<p>Q1 is over. Yesterday I closed it with a retrospective — 20+ build-log entries, four bots running in production, one AI agent writing half of them. The numbers were real, the gaps were real, the promises for Q2 were real.</p> <p>Today is April 1st. Q2, Day 1.</p> <p>The temptation is to write an April Fools post. "I shipped Aether Dynamo overnight." "The bots tripled." "MiCA compliance is a solved problem."</p> <p>None of that is true. The build-log exists to make those gaps visible. So here they are, visible.</p> <h2> The gap between concept and code </h2> <p>Three things were declared for Q2 at the end of yesterday's retrospective:</p> <ol> <li> <strong>AI Compliance Stack</strong> — a MiCA regulatory feed monitor. Not a platform. A working Python script that polls ESMA/EBA feeds and

DEV Community

3m25 minutes ago

ReleasesLive

Handling Extreme Class Imbalance in Fraud Detection

<p><em>Originally published at <a href="https://riskernel.com/blog/extreme-class-imbalance-fraud-detection.html" rel="noopener noreferrer">Riskernel</a>.</em></p> <p>Fraud is one of the easiest machine learning problems to misunderstand because the target is so rare.</p> <p>In many portfolios, fraud is well below one percent of total events. That means a model can look excellent in offline evaluation while still creating a terrible operational outcome once it meets production traffic.</p> <p>If you are evaluating a fraud vendor or building your own stack, the first thing to understand is that this is not a standard classification problem. It is a rare-event decisioning problem with operational consequences.</p> <h2> Why the base rate changes everything </h2> <p>When fraud is extremely rare

DEV Community

4m17 minutes ago

ReleasesLive

5 Rust patterns that replaced my Python scripts

DEV Community

8m22 minutes ago

ReleasesFresh

Available Careers with a Master’s Degree in Business in AI - Boston University

<a href="https://news.google.com/rss/articles/CBMiigFBVV95cUxQUGtldVVjYnFlMlZ6cGFMaXhEVHhHTGFiQWI4RjNVS2ZDWlBjM3ZTejdnS3VyOFlWbFFkY1dlSmxjQ21MczVGcG40YTJoT29NTU1MaXJxa2s5MThQN2xrcDh5TVY1MDViSzdxM1d4VDhQUnJlWThTRFhhY1p4ZmNIUFBNeHlCbDNWRnc?oc=5" target="_blank">Available Careers with a Master’s Degree in Business in AI</a> <font color="#6f6f6f">Boston University</font>

Google News: AI

1mabout 8 hours ago

Weaviate 1.31 Release

MUVERA for multi-vector embeddings​

New BM25 keyword search operators​

Examples​

Add new vectors to existing collections​

HNSW snapshotting​

More model integrations​

A HUGE list of performance improvements​

Summary​

Ready to start building?​

Daily AI Digest

More about

Hugging Face Releases TRL v1.0: A Unified Post-Training Stack for SFT, Reward Modeling, DPO, and GRPO Workflows

Antropic's Claude Code leaked and Axios NPM Inflitration

5 Rust patterns that replaced my Python scripts

Knowledge Map

Connected Articles — Knowledge Graph

Discussion

More in Releases

Q2, Day 1: When Concepts Have to Become Code

Handling Extreme Class Imbalance in Fraud Detection

5 Rust patterns that replaced my Python scripts

Available Careers with a Master’s Degree in Business in AI - Boston University

MUVERA for multi-vector embeddings

New BM25 keyword search operators

Examples

Add new vectors to existing collections

HNSW snapshotting

More model integrations

A HUGE list of performance improvements

Summary

Ready to start building?