Generating MQL Shell Commands Using OpenAI and New mongosh Shell

Pavel Duchovny7 min read • Published Jul 23, 2021 • Updated Jul 11, 2023

AI MongoDB Shell

Rate this article

Generating MQL Shell Commands Using OpenAI and New mongosh Shell

OpenAI is a fascinating and growing AI platform sponsored by Microsoft, allowing you to digest text cleverly to produce AI content with stunning results considering how small the “learning data set” you actually provide is.

MongoDB’s Query Language (MQL) is an intuitive language for developers to interact with MongoDB Documents. For this reason, I wanted to put OpenAI to the test of quickly learning the MongoDB language and using its overall knowledge to build queries from simple sentences. The results were more than satisfying to me. Github is already working on a project called Github copilot which uses the same OpenAI engine to code.

In this article, I will show you my experiment, including the game-changing capabilities of the new MongoDB Shell (mongosh) which can extend scripting with npm modules integrations.

What is OpenAI and How Do I Get Access to It?

OpenAI is a unique project aiming to provide an API for many AI tasks built mostly on Natural Language Processing today. You can read more about their projects in this blog.

There are a variety of examples for its text processing capabilities.

If you want to use OpenAI, you will need to get a trial API key first by joining the waitlist on their main page. Once you are approved to get an API key, you will be granted about $18 for three months of testing. Each call in OpenAI is billed and this is something to consider when using in production. For our purposes, $18 is more than enough to test the most expensive engine named “davinci.”

Once you get the API key, you can use various clients to run their AI API from your script/application.

Since we will be using the new mongosh shell, I have used the JS API.

Preparing the mongosh to Use OpenAI

First, we need to install the new shell, if you haven’t done it so far. On my Mac laptop, I just issued:

Code Snippet

Windows users should download the MSI installer from our download page and follow the Windows instructions.

Once my mongosh is ready, I can start using it, but before I do so, let’s install OpenAI JS, which we will import in the shell later on:

Code Snippet

I’ve decided to use the Questions and Answers pattern, in the form of Q: <Question> and A: <Answer>, provided to the text to command completion API to provide the learning material about MongoDB queries for the AI engine. To better feed it, I placed the training questions and answers in a file called AI-input.txt and its content:

Code Snippet

Q: What is the query syntax?
A: db.collection.find(<filter>, <projection> , <options>)
Q:  Query users collection for username with value "boy"
A: db.users.find({"username" : "boy"})
Q:  Query users collection for username with value "girl"A:  db.users.find({"username" : "girl"})
Q: Query users collection for username with age bigger than 16
A:  db.users.find({"age" : {$gt : 16}})n;
Q: Query author collection for username with value "boy"
A: db.authors.find({"username" : "boy"})
Q:Query author collection for age lower than 7
A: db.authors.find({"age" : {$lt : 7}});

Q:insert a json document into collection authors with username equal to "girl"
A: db.authors.insert({"username" : "girl"}, {"age" : 10, "gender" : "female"})
Q: insert many documents into users collections
A: db.users.insertMany([{username : "john doe"}, {username : "joe doe"}]);
Q: The following aggregation uses $geoNear to find documents with a location at most 2 meters from the center [ -73.99279 , 40.719296 ] and a category equal to Parks.
A: db.places.aggregate([{ $geoNear: {near: { type: "Point", coordinates: [ -73.99279 , 40.719296 ] },distanceField: "dist.calculated",maxDistance: 2, query: { category: "Parks" },includeLocs: "dist.location", spherical: true}}])
Q: How to build atlas $search text query?
A: db.collection.aggregate({$search: {"index": <index name> "text": {"query": "<search-string>","path": "<field-to-search>",  "fuzzy": <options>,"score": <options>  }  }})

Q: What is the aggregate syntax?
A: db.collection.aggregate([<stage1>,<stage2>,<stage3>], <projection> , <options>);
Q: aggregate users collection to calculate salary sum per user
A: db.users.aggregate([{$group : { _id : "$username" , salary_sum : { $sum : "$salary" }}}]);
Q: aggregate person collection to calculate salary sum per person
A: db.persons.aggregate([{$group : { _id : "$person" , salary_sum : { $sum : "$salary" }}}]);
Q: Lookup users and orders collection
A: db.users.aggregate([{$lookup : {from: 'orders', localField : "_id", foreignField : "userId", as : "result" }} ]);

Q: What is the update syntax?
A:db.collection.update(query, update, options)
Q:  How to edit collection sports where sportname is 'football' and match is 'england vs portugal' to score of '3-3' and date to current date?
A: db.sports.update({ sportname: "football", match: "england vs portugal"} , {$set : {score: "3-3" , date : new Date()}} })
Q: Query and atomically update collection zoo where animal is "bear" with a counter increment on eat field, if the data does not exist user upsert
A: db.zoo.findOneAndUpdate({animal : "bear"}, {$inc: { eat : 1 }} , {upsert : true})

We will use this file later in our code.

This way, the completion will be based on a similar pattern.

Prepare Your Atlas Cluster

MongoDB Atlas, the database-as-a-platform service, is a great way to have a running cluster in seconds with a sample dataset already there for our test. To prepare it, please use the following steps:

Create an Atlas account (if you don’t have one already) and use/start a cluster. For detailed steps, follow this documentation.
Load the sample data set.
Get your connection string.

Use the copied connection string, providing it to the mongosh binary to connect to the pre-populated Atlas cluster with sample data. Then, switch to sample_restaurants database.

Code Snippet

Using OpenAI Inside the mongosh Shell

Now, we can build our textToMql function by pasting it into the mongosh. The function will receive a text sentence, use our generated OpenAI API key, and will try to return the best MQL command for it:

Code Snippet

In the above function, we first load the OpenAI npm module and initiate a client with the relevant API key from OpenAI.

Code Snippet

The new shell allows us to import built-in and external modules to produce an unlimited flexibility with our scripts.

Then, we read the learning data from our AI-input.txt file. Finally we add our Q: <query> input to the end followed by the A: value which tells the engine we expect an answer based on the provided learningPath and our query.

This data will go over to an OpenAI API call:

Code Snippet

The call performs a completion API and gets the entire initial text as a prompt and receives some additional parameters, which I will elaborate on:

engine: OpenAI supports a few AI engines which differ in quality and purpose as a tradeoff for pricing. The “davinci” engine is the most sophisticated one, according to OpenAI, and therefore is the most expensive one in terms of billing consumption.
temperature: How creative will the AI be compared to the input we gave it? It can be between 0-1. 0.3 felt like a down-to-earth value, but you can play with it.
Max_tokens: Describes the amount of data that will be returned.
Stop: List of characters that will stop the engine from producing further content. Since we need to produce MQL statements, it will be one line based and “\n” is a stop character.

Once the content is returned, we parse the returned JSON and print it with console.log.

Lets Put OpenAI to the Test with MQL

Once we have our function in place, we can try to produce a simple query to test it:

Code Snippet

Atlas atlas-ugld61-shard-0 [primary] sample_restaurants> textToMql("query all restaurants where cuisine is American and name starts with 'Ri'")
 db.restaurants.find({cuisine : "American", name : /^Ri/})

Atlas atlas-ugld61-shard-0 [primary] sample_restaurants> db.restaurants.find({cuisine : "American", name : /^Ri/})
[
  {
    _id: ObjectId("5eb3d668b31de5d588f4292a"),
    address: {
      building: '2780',
      coord: [ -73.98241999999999, 40.579505 ],
      street: 'Stillwell Avenue',
      zipcode: '11224'
    },
    borough: 'Brooklyn',
    cuisine: 'American',
    grades: [
      {
        date: ISODate("2014-06-10T00:00:00.000Z"),
        grade: 'A',
        score: 5
      },
      {
        date: ISODate("2013-06-05T00:00:00.000Z"),
        grade: 'A',
        score: 7
      },
      {
        date: ISODate("2012-04-13T00:00:00.000Z"),
        grade: 'A',
        score: 12
      },
      {
        date: ISODate("2011-10-12T00:00:00.000Z"),
        grade: 'A',
        score: 12
      }
    ],
    name: 'Riviera Caterer',
    restaurant_id: '40356018'
  }
...

Nice! We never taught the engine about the restaurants collection or how to filter with regex operators but it still made the correct AI decisions.

Let's do something more creative.

Code Snippet

Okay, now let's put it to the ultimate test: aggregations!

Code Snippet

Now that is the AI power of MongoDB pipelines!

DEMO

Wrap-Up

MongoDB's new shell allows us to script with enormous power like never before by utilizing npm external packages. Together with the power of OpenAI sophisticated AI patterns, we were able to teach the shell how to prompt text to accurate complex MongoDB commands, and with further learning and tuning, we can probably get much better results.

Try this today using the new MongoDB shell.

Rate this article

Article

Structuring Data With Serde in Rust

Apr 23, 2024 | 5 min read

Article

Building with Patterns: The Polymorphic Pattern

Sep 23, 2022 | 4 min read

Tutorial

How to Build a Go Web Application with Gin, MongoDB, and with the Help of AI

Sep 27, 2023 | 11 min read

Tutorial

Neurelo and MongoDB: Getting Started and Fun Extras

Feb 22, 2024 | 7 min read

Generating MQL Shell Commands Using OpenAI and New mongosh Shell
What is OpenAI and How Do I Get Access to It?
Preparing the mongosh to Use OpenAI
Using OpenAI Inside the mongosh Shell
DEMO
Wrap-Up

MongoDB

Generating MQL Shell Commands Using OpenAI and New mongosh Shell

Generating MQL Shell Commands Using OpenAI and New mongosh Shell

What is OpenAI and How Do I Get Access to It?

Preparing the mongosh to Use OpenAI

Prepare Your Atlas Cluster

Using OpenAI Inside the mongosh Shell

Lets Put OpenAI to the Test with MQL

DEMO

Wrap-Up

Related

Structuring Data With Serde in Rust

Building with Patterns: The Polymorphic Pattern

How to Build a Go Web Application with Gin, MongoDB, and with the Help of AI

Neurelo and MongoDB: Getting Started and Fun Extras

Table of Contents