Build Smart Applications With Atlas Vector Search and Google Vertex AI

Venkatesh Shanbhag, Ashwin Gangadhar4 min read • Published Jan 17, 2024 • Updated Jan 17, 2024

Google Cloud Search Python Atlas

Rate this tutorial

The application development landscape is evolving very rapidly. Today, users crave intuitive, context-aware experiences that understand their intent and deliver relevant results even when queries aren't perfectly phrased, putting an end to the keyword-based search practices. This is where MongoDB Atlas and Google Cloud Vertex AI can help users build and deploy scalable and resilient applications.

MongoDB Atlas Vector Search is a cutting-edge tool that indexes and stores high-dimensional vectors, representing the essence of your data. It allows you to perform lightning-fast similarity searches, retrieving results based on meaning and context. Google Vertex AI is a comprehensive AI platform that houses an abundance of pre-trained models and tools, including the powerful Vertex AI PALM. This language model excels at extracting semantic representations from text data, generating those crucial vectors that fuel MongoDB Atlas Vector Search.

Vector Search can be useful in a variety of contexts, such as natural language processing and recommendation systems. It is a powerful technique that can be used to find similar data based on its meaning.

In this tutorial, we will see how to get started with MongoDB Atlas and Vertex AI. If you are new to MongoDB Atlas, refer to the documentation to get set up from Google Cloud Marketplace or use the Atlas registration page.

Before we begin

Make sure that you have the below prerequisites set up before starting to test your application.

MongoDB Atlas access, either by the registration page or from Google Cloud Marketplace
Access to Google Cloud Project to deploy and create a Compute Engine instance

How to get set up

Let us consider a use case where we are loading sample PDF documents to MongoDB Atlas as vectors and deploying an application on Google Cloud to perform a vector search on the PDF documents.

We will start with the creation of MongoDB Atlas Vector Search Index on the collection to store and retrieve the vectors generated by the Google Vertex AI PALM model. To store and access vectors on MongoDB Atlas, we need to create an Atlas Search index.

Create an Atlas Search index

Navigate to the Database Deployments page for your project.
Click on Create Database. Name your Database vertexaiApp and your collection chat-vec.
Click Atlas Search from the Services menu in the navigation bar.
Click Create Search Index and select JSON Editor under Atlas Vector Search. Then, click Next.
In the Database and Collection section, find the database vertexaiApp, and select the chat-vec collection.
Replace the default definition with the following index definition and then click Next. Click on Create Search index on the review page.

Code Snippet

Create a Google Cloud Compute instance

We will create a Google Cloud virtual machine instance to run and deploy the application. The Google Cloud VM can have all the default configurations. To begin, log into your Google Cloud Console and perform the following steps:

In the Google Cloud console, click on Navigation menu > Compute Engine.
Create a new VM instance with the below configurations:
- Name: vertexai-chatapp
- Region: region near your physical location
- Machine configurations:
  - Machine type: High Memory, n1-standard-1
Boot disk: Click on CHANGE
- Increase the size to 100 GB.
- Leave the other options to default (Debian).
Access: Select Allow full access to all Cloud APIs.
Firewall: Select all.
Advanced options:
- Networking: Expand the default network interface.
- For External IP range: Expand the section and click on RESERVE STATIC EXTERNAL IP ADDRESS. This will help users to access the deployed application from the internet.
- Name your IP and click on Done.
Click on CREATE and the VM will be created in about two to three minutes.

Deploy the application

Once the VM instance is created, SSH into the VM instance and clone the GitHub repository.

Code Snippet

The repository contains a script to create and deploy a Streamlit application to transform and store PDFs in MongoDB Atlas, then search them lightning-fast with Atlas Vector Search. The app.py script in the repository uses Python and LangChain to leverage MongoDB Atlas as our data source and Google Vertex AI for generating embeddings.

We start by setting up connections and then utilize LangChain’s ChatVertexAI and Google's Vertex AI embeddings to transform the PDF being loaded into searchable vectors. Finally, we constructed the Streamlit app structure, enabling users to input queries and view the top retrieved documents based on vector similarity.

Install the required dependencies on your virtual machine using the below commands:

Code Snippet

Once the requirements are installed, you can run the application using the below command. Open the application using the public IP of your VM and the port mentioned in the command output:

Code Snippet

The application uses the PdfReader library to read the PDF file from the user and chunks the data into batches. Each chunk is converted into vectors using LangChain’s VertexAIEmbeddings libraries. The uploaded PDF is written into MongoDB Atlas in chunks, along with the vectors. The data can be viewed in vertexaiApp.chat-vec namespace documents.

To perform a search on the data, navigate to the Q&A tab on your application. Type in your query and press enter. The application will perform a vector search on MongoDB documents using the index created earlier. The application uses the chat-bison model to converse with the user.

Conclusion

In this tutorial, you learned how to build a simple application using MongoDB Atlas and Google Cloud Vertex AI using LangChain to access MongoDB Atlas. Also, we have seen how to deploy the application on Google Cloud Compute instance and access it using the public static IP address allocated.

Try MongoDB Atlas for free in Google Cloud Marketplace using our pay-as-you-go model and take advantage of our simplified billing.

Rate this tutorial

Tutorial

Part #3: Semantically Search Your Data With MongoDB Atlas Vector Search

Feb 14, 2024 | 6 min read

Tutorial

RAG Series Part 1: How to Choose the Right Embedding Model for Your Application

Apr 15, 2024 | 16 min read

Tutorial

Building a Restaurant Locator Using Atlas, Neurelo, and AWS Lambda

Apr 02, 2024 | 8 min read

Tutorial

Delivering a Near Real-Time Single View into Customers with a Federated Database

Jun 28, 2023 | 8 min read

Before we begin
How to get set up
Conclusion

Atlas

Build Smart Applications With Atlas Vector Search and Google Vertex AI

Before we begin

How to get set up

Create an Atlas Search index

Create a Google Cloud Compute instance

Deploy the application

Conclusion

Related

Part #3: Semantically Search Your Data With MongoDB Atlas Vector Search

RAG Series Part 1: How to Choose the Right Embedding Model for Your Application

Building a Restaurant Locator Using Atlas, Neurelo, and AWS Lambda

Delivering a Near Real-Time Single View into Customers with a Federated Database

Table of Contents