Enhancing the .NET Development Experience with Roslyn Static Analysis

Boris Dogadov
November 17, 2022

The MongoDB .NET/C# driver introduces idiomatic APIs for constructing queries and aggregations: LINQ and Builders. These APIs eliminate the need to write native MongoDB Query Language (MQL), but they also introduce some overhead when it comes to troubleshooting and optimizing the underlying MQL. Because the generated MQL cannot be inspected at compile time, troubleshooting queries involves outputting MQL at runtime and/or inspecting runtime exceptions.

Given that MQL generation from a C# expression is basically transpiling, we knew that theoretically inferring the general form of MQL in compile time was solvable by static analysis. This realization, and the fact that the .NET ecosystem has an amazing framework for writing static analyzers (Roslyn), made me excited to try out this idea during MongoDB Skunkworks week.

In this article, I will share my experience of forming a plan for this project, crafting a quick proof-of-concept during Skunkworks week, and eventually releasing the first public version.

Skunkworks at MongoDB

One of my favorite perks of working at MongoDB is that we get a whole week, twice a year, to focus on our own projects. This week is a great opportunity to meet and collaborate with other folks in the company, try out any ideas we want, or learn something new.

I started my Skunkworks week by refreshing my Roslyn skills. While a week sounds like a fair amount of time for rapid prototyping, naturally I still had to settle on just a small subset of all the cool features that came to mind. I was lucky and, by the end of the Skunkworks, I had a MongoDB Analyzer for .NET prototype sufficient to demonstrate the feasibility of this idea.

Roslyn analyzers

A significant part of the .NET ecosystem is the open source .NET Compiler Platform SDK (Roslyn API). This SDK is well integrated into the .NET build pipeline and IDE (e.g., VS, Rider), which allows for the creation of tools for code analysis and generation.

The Roslyn SDK exposes the standard compiler's building blocks. The main ones that will be used in the Analyzer project are:

Abstract syntax tree (AST): Data structure representing the text of the analyzed code.
Symbol table: Data structure that holds information about variables, methods, classes, interfaces, types, and other language elements. Each node in AST can have a corresponding symbol.
Emit API: API that allows you to generate a new IL code dynamically and compile it to a memory assembly, which can be loaded and executed in the same application.

Roslyn SDK provides a convenient API to develop and package a code analyzer, which can be easily integrated into a .NET project and executed as part of the build pipeline. Or, it can expose an interactive UI in an IDE, thereby enriching developers' experience and enforcing project-specific rules.

Design approach

The .NET.C# driver provides an API to render any LINQ or Builder expression to MQL. The next logical step is to identify the needed expressions and use the driver to extract the matching MQLs. Extracting the Builders or LINQ expression syntax nodes from the syntax tree provided by Roslyn was fairly straightforward.

The next step, therefore, is to create a new syntax tree and add these expression syntax nodes combined with MQL generating syntax. Then, this new syntax tree is compiled into executable code, which is dynamically invoked to generate the MQL.

To optimize this process, the Analyzer maintains a template syntax tree containing a sample MQL generation code from an expression:

public class MQLGenerator
{ 
    public static string RenderMQL()
    {
        	var buildersDefinition = Builders<MqlGeneratorTemplateType>.Filter.Gt(p => p.Field, 10);
        	return Renderer.Render(buildersDefinition);
    }
}

From this template, a new single syntax tree is produced for each Analyzer run, by dynamically adding the RenderMQL_N method for each analyzed expression N, and replacing the expression placeholder with the analyzed expression:

public static string RenderMQL_1()
{
      	var buildersDefinition = AnalyzedBuildersExpression;
    	return Renderer.Render(buildersDefinition);
}

Next, the compilation unit is created from the syntax tree containing all the analyzed expressions and emitted to in-memory assembly (Figure 1). This assembly is loaded into Analyzer AppDomain, from which the MQLGenerator object is instantiated, which provides the actual MQL by invoking RenderMQL_N methods.

Visualization of LINQ and builder expressions extraction and MQL generation. Process starts with — **Figure 1:** LINQ and Builder expressions extraction and MQL generation.

This approach imposed four fundamental challenges, discussed below:

Data types resolution: Expressions are strongly typed, while the types are usually custom types that are defined in the user code.
Variables resolution: Expressions usually involve variables, constants, and external methods. The Analyzer cannot resolve those dependencies at compile time.
Driver versions: Different driver versions might render different MQL. The exact driver version referenced by the analyzed code has to be used.
Testing: The Roslyn out-of-the-box testing template lets you test analyzers on C# code provided as a simple string, which imposes significant maintainability challenges for a large number of tests.

Data types resolution

Given a simple LINQ expression that retrieves all the movies produced by Christopher Nolan from the movies collection:

var moviesCollection = db.GetCollection<Movie>("movies").AsQueryable();
var movies = moviesCollection.Where(movie => movie.Producer == “Christopher Nolan”);

The underlying Movie type, and all types Movie is dependent upon, must be ported into the Analyzer compilation space. All imported types must exactly reproduce the original namespaces hierarchy. Expressions like db.GetCollection<Movie> must be rewritten with fully qualified names to avoid naming collisions and namespace resolutions. For example, user code could contain Namspace1.Movie and Namespace2.Movie.

An additional problem with importing the types directly is the unbounded complexity of methods and properties implementations, which in most cases could not be compiled in the Analyzer compilation space. This excess code plays no role in MQL generation and must not be imported into the compilation unit.

We decided that an easier and cleaner solution was to create a unique type name for each referenced type under a single namespace. The Analyzer uses the semantic model to inspect the Movie type defined in the user’s code and creates a new MovieNew syntax node mirroring all Movie properties and fields. This process is repeated for each type referenced by Movie, including enums, arrays, collections (Figure 2).

After creating a MovieNew type as a syntax declaration, the original LINQ expression must be rewritten to reference the new type. Therefore, the original expression is transformed to a new expression: db.GetCollection<MovieNew>("movies").

Visual representation of LINQ and Builder expressions extraction, data types resolution and MQL generation. Process begins at user code AST. Through rewrite types, you move to data types AST, and through extract LINQ, you move to expressions AST. Then, through. compile, you move to MQL generating assembly. Finally, through execute, you move to MQL. — **Figure 2:** LINQ and Builder expressions extraction, data types resolution and MQL generation.

Variables resolution

In practice, LINQ and Builders expressions mostly reference variables as opposed to simple constants. For example:

var movies = moviesCollection.Where(movie => movie.Title == movieName)

At runtime, the movieName value is resolved, and MQL is generated with a constant value. For example, the above expression can result in the following MQL:

aggregate([{ "$match" : { "Title" : "Dunkirk" } }])

This constant value is not available to Analyzer at compile time; therefore, we have to think of a workaround. Instead of presenting the constant, the Analyzer outputs the variable name:

aggregate([{ "$match" : { "Title" : movieName } }])

As you can see, this technique does not produce a valid MQL. But, most importantly, it preserves the MQL shape and contains the referenced variable information. This is done by replacing each external variable and method reference in the original expression by a unique constant, and substituting it back in the resulting MQL (Figure 3).

Driver versions

The naive approach would be to embed a fixed driver dependency into the Analyzer. However, this approach imposes some significant limitations, including:

MQL accuracy degradation: Different versions of the driver can produce slightly different MQL due to bug fixes and/or new features.
Backward compatibility: Expressions written with older driver versions might not be supported or result in different MQL.
Forward compatibility: The Analyzer would not be able to process new expressions supported by newer driver versions. This issue can be resolved by releasing a new Analyzer version for each driver version, but ideally we wanted to avoid such development overhead.

Luckily, instead of embedding a driver package with a fixed version into the Analyzer package, and limiting the Analyzer only to that specific driver version, Analyzer uses the actual driver package that is used by the user’s project and found on the user's machine. In this way, Analyzer is “driver-version agnostic” in some sense.

One of the challenges was to dynamically resolve the correct driver version for each compilation, as C# dynamic compilation tries to resolve the dependencies from the current AppDomain. To solve this, Analyzer overrides the global AppDomain assembly resolution and loads the correct driver assemblies for each resolution request.

An additional nuance was to load the correct .NET framework version. Usually, the Analyzer runs on a different .NET platform than the project's .NET target (e.g., Analyzer can run in VS on .NET Framework 4.7.2, while the analyzed project references the .NET Standard 2.1 driver).

Luckily, all recent driver distributions contain the .NET Standard 2.0 version, which is supported by both .NET Core and .NET Framework platforms. The next step is to identify the physical location of .NET Standard 2.0 driver assemblies with the correct version (Figure 4).

This approach allows the Analyzer to be driver-version agnostic, including supporting future driver versions regardless of the OS platform (e.g., Rider on Linux/Mac, VS on Mac/Windows, .NET build Linux/Mac/Windows).

Testing

Writing tests for such a project requires an unorthodox testing methodology as well. However, the Roslyn SDK provides a testing framework for writing integration tests.

An integration test would receive a C# code snippet to be analyzed supplied as string and then execute the Analyzer on it. The default testing methodology introduces some inconveniences. For example, writing and maintaining hundreds of tests cases, with each test case testing multi-line C# code, involving complex data types as a usual string, without a compiler involves quite the overhead. Therefore, we extended the testing framework by creating a custom test runner in the following way.

All the C# code for the integration tests is written as a standalone C# project, which is compiled in a standard way. Common underlying data types and other code elements are easily reused. An intended test method is marked by a custom attribute denoting the expected result.

An additional test project references the former project and uses the reflection to identify the test cases denoted by special attributes. Then, it executes the Analyzer on the test cases’ C# files and the appropriate driver version and validates the results.

For example, for LINQ expression .Where(u => u.Name.Trim() == "123"), we expect the Analyzer to produce a warning for LINQ2 and valid MQL for LINQ3. The test case is written in the following way:

[NotSupportedLinq2("Supported in LINQ3 only: db.coll.Aggregate([{ \"$match\" : { \"Name\" : /^\\s*(?!\\s)123(?<!\\s)\\s*$/s } }])")]
[MQLLinq3("db.coll.Aggregate([{ \"$match\" : { \"Name\" : /^\\s*(?!\\s)123(?<!\\s)\\s*$/s } }])")]
public void String_methods_Trim()
{
	_ = GetMongoQueryable()
	.Where(u => u.Name.Trim() == "123");
}

The Analyzer testing framework parses the C# test cases project and creates a test case for each (DriverVersion, LinqProviderVersion, TestCase) combination (as shown in Figure 5):

Screenshot of the test cases dynamically generated from C# code for each tested driver version discovered in Visual studio test explorer. Test cases displayed in a tiered list. From top to bottom: MongoDB Analyzer tests (net472) 3, MongoDB analyzer tests linq 3, Linq3Tests 2, NotSupportedLinq2 2, vs 14 1_String_methods_Trim, v2 14 1_V3_String_Methods_Trim, LinqNotSupportedExpressionsTests 1, v2 14 1_Unsopported_string-method_Trim — **Figure 5:** Test cases dynamically generated from C# code for each tested driver version discovered in Visual studio test explorer.

This approach allows smooth integration with VS test runner and a seamless development experience.

Besides significantly increasing the maintainability and readability, this approach also introduces a bonus feature. The test code project can be opened as a standalone solution (without the test framework), and the Analyzer output can be visually inspected for each test case as a user would see it.

From initial idea to first release

Because the Skunkworks project proved to be successful, the decision was made to develop a public first release. Generally, developing and releasing a greenfield product in most companies is a lengthy process, which involves resource allocation and planning, productizing, marketing, quality assurance, developing appropriate documentation, and support.

In MongoDB, however, this process was incredibly fast. We formed a remote ad hoc team, across two continents, involving product management, documentation experts, developer relations, marketing specialists, and developers. Despite the fact that we were working together as a team for the first time, the collaboration level was amazing, and the high level of professionalism and motivation allowed everybody to do their part extremely efficiently with almost zero overhead.

As a result, we developed and released a full working product, documentation, marketing materials, and support environment in less than three months.

Learn more about our internal Skunkworks hackathon and some of the projects MongoDB engineers built this year.

← Previous

Introducing the Next Generation of MongoDB Education

MongoDB University has always offered developers free, self-paced, on-demand ways to learn MongoDB and advance their careers. Now MongoDB has launched an enhanced University experience, with a rollout of new courses and features, and a seamless path to MongoDB certification to help take your skills and career to the next level. “MongoDB has always been a developer-first company. But it’s one thing to say that and support the current generation of developers and MongoDB users, it’s another to play a larger role in molding the developers of the future,” says Mark Porter, Chief Technology Officer, MongoDB. “Developers have gone from being a curiosity when I began my career to becoming a boardroom priority. If software and applications are the currency of the modern day economy, development teams are the market makers, and we want to support them on this journey.” Announced at our annual .local London developer conference, the new learning experience makes it easy to quickly pick up knowledge, develop a fundamental MongoDB skillset, and get certified. Beginning November 15, you can discover: New courses that make it easy to learn how to use MongoDB in the context of your preferred programming language, including Python, C#, Java, and Node.JS New discounts and incentives to support your growth. Now, any time you complete a new developer learning path , you will receive a 50% discount on the Associate Developer certification exam fee. And as always, all course content is free. Easier accessibility to course videos - registration is no longer required! Those who do register gain access to hands-on labs, quizzes, and certifications, as well as the ability to track their progress. Language Subtitles for all new courses . Chinese (Traditional and Simplified), Korean, Spanish, French, and Portuguese subtitles are now available. Short form content on newer features . Try our new “Learning Bytes” to build knowledge on MongoDB in 20 minutes or less. An enhanced certification experience with 24/7 exam access and robust study materials, including videos, study guides, and practice questions. Additionally, MongoDB University now offers course certificates and digital certification badges through Credly that strengthen your professional profiles on LinkedIn, and can be shared across Twitter and Facebook. Adding these badges will enhance your opportunities for roles that require MongoDB experience and enter you into the Credly talent pool, making you visible to recruiters and hiring managers looking for specific certifications. Updated labs that provide guided, hands-on activities that allow you to practice what you’ve learned and see real-time results of your work. Getting started with MongoDB University Badges Previous certifications continue to be valid and will now include digital badges, so be on the lookout for an email from our badging partner, Credly. You can then accept your MongoDB Certified badge and share it on LinkedIn, Twitter, and Facebook. When you accept, you will also be included in the Credly Talent Directory , which enables recruiters, hiring managers, and others to connect with you about opportunities based on the specific digital credentials you’ve earned. In-progress learning If you currently have in-progress courses with MongoDB University, those courses must be completed by December 1st in order to have your certificate transfer to the new University experience. If you don’t complete a course by the deadline, that course will still be available in the new University, but your progress will not be transferred. Ready to explore all the new possibilities? Get started with the next generation of MongoDB University today. Start learning today !

November 15, 2022

Next →

MongoDB Announces Leadership Transition

Dev Ittycheria, President and Chief Executive Officer, shared the following message with MongoDB employees this morning. This is the hardest email I have ever had to write to all of you. If you have not seen the announcement, I have decided to retire as CEO. Effective November 10, 2025, Chirantan “CJ” Desai will become the new CEO of MongoDB. This was not an easy decision for me. The process to get to this point has been deeply emotional, as I care profoundly about MongoDB and the people who have made the company what it is today. This news may come as a surprise, and for some, perhaps even a shock. That’s natural. Leadership transitions can evoke a range of reactions. I want to share why this is happening, and why it’s the right thing for MongoDB. Every personnel change, including the most senior leadership changes, involves two key decisions: first, recognizing that it is the right time for change, and second, selecting the best person to replace the person leaving. This email is intended to explain both decisions. Earlier this year, as part of our regular succession planning process, the Board and I discussed my long-term commitment. They asked if I would continue as CEO for another five years. After many conversations with my family and the Board, I realized I could not make that commitment. Some CEOs see their title as their identity. I do not. My core responsibility is to serve in the company's best interests. The company is primed for a new leader. One with a fresh perspective, grounded in experience and skills needed to guide MongoDB through its next evolution as a company, what we call MongoDB 3.0. Consequently, I informed the Board that I would commit to two more years to help find a successor. That began the search process for a suitable successor. To our surprise and delight, what we thought would easily take 12 to 24 months happened much faster than anyone expected. After engaging with multiple qualified candidates, we found the right successor in CJ. CJ is uniquely qualified for this role. CJ brings the rare growth-at-scale experience that will help continue to build MongoDB into an iconic technology company. At ServiceNow, he was the only executive to work directly with three of its highly regarded public company CEOs and played a pivotal role in organically scaling the company from just over $1 billion to more than $10 billion in revenue. Only a handful of independent software companies have ever reached that milestone. CJ helped transform ServiceNow from a product company to a platform company, scaled engineering, drove go-to-market excellence, and engaged deeply with investors. More recently, as President of Product and Engineering at Cloudflare, he helped fuel strong growth and stock performance. CJ also possesses the personal qualities needed to succeed as CEO. He is humble, eager to learn, and wants to draw on the perspectives of the people at MongoDB and other stakeholders to inform his thinking. This blend of experience, judgment, and character gives me full confidence that he is well-equipped to lead MongoDB through its next phase of growth. I often think of MongoDB’s journey as a long and extraordinary expedition. For the past eleven years, I have had the privilege of serving as its guide, helping chart the course, rally the team, and climb together through both calm and challenging terrain. Along the way, we have reached remarkable summits and proven what is possible through relentless innovation, persistence, and teamwork. Now it is time for a new guide to lead the next stage of the ascent and take MongoDB to even greater heights. CJ is the right leader to take MongoDB to the next summit. MongoDB is on a strong footing, with a clear strategy, an exceptional leadership team, a product platform that is more relevant than ever, and a business that is executing well. The rise of AI and the explosion of data-intensive applications play directly to MongoDB’s strengths. Our technology sits at the center of how modern applications are built and how organizations will harness data to power intelligent, adaptive systems. I am confident MongoDB is perfectly positioned to capture this next wave of innovation. As for me, I am not running away from MongoDB or leaving to join another company as CEO. I will remain on the Board and work closely with CJ to ensure a seamless transition. Over the years, this role has demanded an enormous amount of focus and energy; as a result, there are many things I’ve missed doing along the way. I’m looking forward to being more present for those moments — from simple time with my family to experiences and travel we’ve long put off. I plan to hold on to my MongoDB stock, as I firmly believe in the people and the opportunity, knowing that MongoDB’s best days are ahead of it. Yes, change can be unsettling. I’m sure you will have many questions about this change, such as why now, why CJ is the best person to lead the company, and what this means for you. We will hold an all-hands meeting tomorrow at 10:30AM ET to discuss this transition, introduce CJ and take your questions. That being said, I want to emphasize that the right change at the right time is how great companies get stronger. Just as a championship team refreshes its roster to stay competitive, MongoDB is bringing in new leadership, including other recent C-suite leaders who came before CJ, to drive our next phase of growth. This is not an ending; it’s the founding of a new moment. I am incredibly proud of what we have built together and genuinely excited about what lies ahead with CJ leading us forward. I also want to thank each of you for making this journey so meaningful. Words cannot fully capture my gratitude for your passion, creativity, and belief in building something truly special. I have often said that I want MongoDB to be an inflection point in people’s careers, a place where they can grow, take risks, and do the best work of their lives. I can say without hesitation that it has been exactly that for me. The skills I have developed, the experiences I have gained, and the relationships I have formed here have shaped me more than any other chapter in my professional life. I will carry them with me always, and will continue to cheer for and support MongoDB every step of the way. --Dev

November 3, 2025