EventGet 50% off your ticket to MongoDB.local London on October 2. Use code WEB50Learn more >>
MongoDB Developer
C#
plus
Sign in to follow topics
MongoDB Developer Centerchevron-right
Developer Topicschevron-right
Languageschevron-right
C#chevron-right

Integrating MongoDB with TensorFlow and C#

Folasayo Samuel Olayemi8 min read • Published Sep 05, 2024 • Updated Sep 05, 2024
.NETC#
Facebook Icontwitter iconlinkedin icon
Rate this tutorial
star-empty
star-empty
star-empty
star-empty
star-empty
Are you a C# newbie or guru and curious to know how Tensorflow works with MongoDB as the database? This tutorial is for you.
This process involves fetching data from MongoDB, performing data preprocessing, and building a machine-learning model using ML.NET and TensorFlow. This guide is ideal for developers interested in leveraging the power of machine learning within a .NET environment.

What do you know about MongoDB?

MongoDB is a NoSQL database, and it helps you work with large sets of shared data. MongoDB is a document-oriented database that stores data in JSON-like documents. It allows unmatched scalability and flexibility, plus all the querying and indexing that you need.

What do you know about TensorFlow?

TensorFlow is an open-source, end-to-end platform for machine learning, developed by the Google Brain team. It offers a comprehensive system for managing all components of a machine learning setup. This tutorial, however, concentrates on using a specific TensorFlow API to develop and train machine learning models. TensorFlow operates by constructing computational graphs — networks of nodes representing mathematical operations, with edges between nodes representing the multidimensional data arrays (tensors) that flow through these operations.

Use cases of TensorFlow

Here are some useful application examples of TensorFlow.
  1. Image Recognition: TensorFlow is utilized in image recognition applications to detect objects, faces, and scenes in images and videos. This functionality is essential for a variety of applications, including security systems, where it enhances surveillance by recognizing human activities and faces, and healthcare, where it assists in diagnosing diseases by analyzing medical imagery. Learn more about TensorFlow for image recognition.
  2. Natural language processing (NLP): ensorFlow's capacity to manage large datasets and intricate algorithms makes it ideal for NLP tasks. It supports applications like language translation, sentiment analysis, and chatbots, enabling machines to understand, interpret, and generate human language in a contextually meaningful way. Explore TensorFlow applications in NLP.
  3. Recommendation systems: Numerous e-commerce and streaming companies utilize TensorFlow to build recommendation systems that analyze users' past behavior to suggest products or media they might find interesting. This personalization improves the user experience and can significantly boost conversion rates for businesses. Learn about building recommendation systems with TensorFlow.
  4. Autonomous vehicles: TensorFlow is utilized in the automotive industry to develop and improve systems for autonomous vehicles. By processing data from various sensors and cameras, TensorFlow-based models support making decisions about vehicle steering and collision avoidance. Explore how TensorFlow is applied in autonomous driving.
  5. Healthcare: TensorFlow is utilized in various tasks, like disease diagnosis and drug discovery. It examines patterns from large datasets of medical records to predict disease progression and results, facilitating early diagnosis and personalized treatment plans. Discover TensorFlow applications in healthcare.
These instances illustrate the versatility of TensorFlow covering different domains, showcasing its role in driving innovation by transforming how data is interpreted and utilized in building intelligent applications. Each instance link provided offers a deeper swoop into how TensorFlow is used in real-world applications, providing evidence of its broad utility and impact.

Prerequisites

Before we dive into the details, make sure you have the following installed:

NuGet packages to install

For this project, you need to install the following NuGet packages:
  • MongoDB.Driver: This package includes everything you need to interact with MongoDB, including BSON and CRUD operations. Install with dotnet add package MongoDB.Driver.
  • Microsoft.ML: This package is essential for building and training machine learning models in .NET. Install with dotnet add package Microsoft.ML.
  • Microsoft.ML.TensorFlow: This package allows integration with TensorFlow models within ML.NET. Install with dotnet add package Microsoft.ML.TensorFlow.
Make sure MongoDB is running on your local machine. You can download and install MongoDB from the website.
Finally, set up your development environment by initializing a new C# (Console App) project. Follow Visual Studio Code’s guide if you are coding your C# console app project for the first time.

Step-by-step breakdown of the code

Define schemas and models

Before connecting to MongoDB, define the structure of your data by creating classes that represent your models. This approach ensures that when you interact with MongoDB, you're working with strongly-typed objects instead of generic BsonDocument objects. This improves code clarity, maintainability, and type safety.

Connect to MongoDB

With the model classes defined in the previous step, you can now establish a connection to MongoDB using these models. Instead of using a generic BsonDocument, specify the type of documents in the collection (SampleData), making your code more intuitive and type-safe.
Place the following code in your Program.cs file:
Ensure you include the necessary using statements at the top of Program.cs:
This setup allows you to interact with your MongoDB collection using strongly-typed models, improving code readability and maintainability.

Create and insert sample data

  • Sample data: First, define the sample data to be inserted into MongoDB.
  • Insert data: Then, insert the defined document into the MongoDB collection.

Fetch data

The following steps describe how to retrieve and prepare data from your MongoDB database to use as a dataset for training your TensorFlow model.
Retrieve data: Begin by retrieving the data from MongoDB. This code fetches the first document from the collection and assumes that your MongoDB setup contains only one document that holds the entire dataset you need.
Convert BSON array: Once you've retrieved the document, you'll need to extract the data from the BSON arrays and convert them into float arrays. The Select method is used here to convert each element of the X and Y lists from double to float. This conversion is crucial for compatibility with TensorFlow, which often requires data in the form of float arrays.
These steps ensure that you're not just retrieving generic data, but specifically targeting the structured data stored in your MongoDB collection. This setup is designed to be intuitive for both beginners and experienced developers, providing a clear pathway from data retrieval to TensorFlow model training in a C# environment.

Preparing data for ML.NET

  • ML context: Create a new ML.NET context.
  • Data structures: Load the data into ML.NET's data structures

Building and training the model

  • Pipeline definition: Define a machine learning pipeline using ML.NET.
  • Model training: Train the model on the data.

Evaluating the model

  • Transform data: Transform the data using the trained model.
  • Evaluate model: Evaluate the model’s performance using R² and RMSE metrics.

Making predictions

  • Prediction engine: Create a prediction engine.
  • Make predictions: Use the engine to make predictions and display the results.

Running the code

  1. Ensure MongoDB is running: Start MongoDB on your local machine.
  2. Run the code: Execute the program using dotnet run.

Expected output

The expected output of the given C# code, which seeds data into MongoDB and then uses TensorFlow to perform linear regression, includes two main parts: a confirmation message that data has been seeded successfully and the evaluation metrics of the linear regression model, followed by the predicted values for each data point.
Here’s a detailed breakdown of what you should expect:
Data seeding confirmation The first output message confirms that the data has been seeded into MongoDB successfully.
Model evaluation metrics The output includes evaluation metrics for the linear regression model. These metrics help to understand the performance of the model.
R-squared (R^2) This value measures the proportion of variance in the dependent variable that is predictable from the independent variable(s). In this case, a negative R^2 value of -85.09826582520343 indicates that the model is not fitting the data well.
Root mean squared error (RMSE) This metric measures the average magnitude of the prediction errors. A lower RMSE indicates a better fit of the model. Here, the RMSE value is 6.567497795652124.
Predictions For each data point, the output shows the actual X and Y values, along with the predicted Y value from the linear regression model.
These values show the actual X and Y values from the dataset along with the corresponding predicted Y values. The predictions illustrate how the linear regression model approximates the relationship between X and Y. Given the poor performance indicated by the R^2 value, the predicted values may not be close to the actual Y values.

Summary

  • Data seeding confirmation: Confirms that the data was successfully inserted into MongoDB.
  • Model evaluation metrics: Provides insight into the model's performance, indicating poor fit with a negative R^2 and a relatively high RMSE.
  • Predictions: Shows the actual and predicted values, highlighting the model's approximation of the data.

Full expected output example

Console output from dotnet run showing expected output

Conclusion

By following this guide, you’ve successfully integrated MongoDB with TensorFlow and C# using ML.NET. This integration enables you to leverage MongoDB's data storage capabilities with the powerful machine learning framework TensorFlow, all within a .NET environment.
This tutorial demonstrates the ease with which different technologies can be combined to create robust data processing and machine learning solutions.

Appendix

For further learning, check out the TensorFlow and MongoDB documentation, and explore more complex machine learning models and database operations. If you have questions or want to share your work, join us in the MongoDB Developer Community.
Thanks for reading... Happy coding!
Top Comments in Forums
There are no comments on this article yet.
Start the Conversation

Facebook Icontwitter iconlinkedin icon
Rate this tutorial
star-empty
star-empty
star-empty
star-empty
star-empty
Related
Article

MongoDB Provider for EF Core: The Latest Updates


Aug 29, 2024 | 6 min read
Tutorial

Saving Data in Unity3D Using Realm


Aug 13, 2024 | 10 min read
Article

How to Set Up MongoDB Class Maps for C# for Optimal Query Performance and Storage Size


Aug 05, 2024 | 8 min read
Tutorial

MongoDB Atlas Search with .NET Blazor for Full-Text Search


Feb 01, 2024 | 6 min read
Table of Contents