MongoDB Introduces Embedding and Reranking API on Atlas

MongoDB Embedding and Reranking API

MongoDB has announced the public preview of its Embedding and Reranking API on MongoDB Atlas, giving developers direct access to Voyage AI’s search models within the managed cloud database. The new API enables the creation of features such as semantic search and AI-powered assistants, with MongoDB citing the elimination of operational complexity as a key benefit.

Why This Matters

The introduction of the Embedding and Reranking API addresses the technical reality of building AI retrieval systems, which currently requires stitching together databases, vector search, and retrieval model providers, introducing significant operational complexity. This complexity can lead to increased costs, slower development times, and reduced system reliability, with MongoDB aiming to simplify this process and reduce the associated costs and failures.

Key Insights

The Embedding and Reranking API is database-agnostic and can be integrated into any tech stack or database, according to MongoDB.
Voyage 4 provides text embedding models that work in the same embedding space, allowing teams to store data using one model and run queries with another.
Automated embedding in vector search is available in preview in the community edition, and Lexical Prefilters for MongoDB Vector Search is in public preview.

Working Example

# Import necessary libraries
import pandas as pd
from pymongo import MongoClient

# Connect to MongoDB Atlas
client = MongoClient("mongodb+srv://username:[email protected]/")

# Create a database and collection
db = client["database-name"]
collection = db["collection-name"]

# Embed and rerank documents using the Embedding and Reranking API
# Note: This example is a placeholder and may not reflect the actual API usage

Practical Applications

Use Case: Companies like Airbnb and Uber can utilize the Embedding and Reranking API to build AI-powered search and recommendation systems, improving user experience and engagement.
Pitfall: Failing to consider the operational complexity and costs associated with building AI retrieval systems can lead to significant technical debt and reduced system reliability.

References:

On This Page

MongoDB Embedding and Reranking API

Why This Matters

Key Insights

Working Example

Practical Applications

Continue reading

Related Content

Building a RAG Application with Spring Boot, Spring AI, MongoDB Atlas Vector Search, and OpenAI

Memori Introduces Full-Scale Memory Layer for AI Agents Using SQL and MongoDB

Build a Web Chatbot with Telnyx AI Assistant: A Step-by-Step Guide