Modernize data between siloed data warehouses with Infosys Data Mesh and MongoDB
January 14, 2021
The Data Challenge in Digital Transformation
Enterprises that embark on a Digital transformation often face significant challenges with accessing data in a timely manner—an issue that can quickly impede customer satisfaction. To deliver the best digital experience for customers, companies must create the right engagement strategy. This requires all relevant data available in the enterprise be readily accessible.
For example, when a customer contacts an insurance company, it is important that the company has a comprehensive understanding of the customer’s background as well as any prior interactions, so they can orchestrate the best possible experience. Data is available in both BI (Business Intelligence) systems, like Enterprise Data Warehouses, and OI (Operational Intelligence) systems, like policy and claim systems. There is a need to bring these BI and OI systems together to avoid any disruption to the digital functions that may delay synchronization. Data removed from an operational system loses context. Re-establishing this domain context and providing persona-based access to the data requires domain-oriented, decentralized data ownership, as well as architecture.
Ultimately, organizations seek to use data as a key to fueling the products and services they provide their customers. This data should minimize the cost of customer research—but the data needs to be trusted and high quality. Companies need access to these siloed sources of data in a seamless self-service approach across various product life cycles.
The Challenge of Centralized Data
Historically, businesses have handled large amounts of data from various sources by ingesting it all into a centralized database (data warehouse, data lake, or data lake on cloud). They would then feed insight drivers, like reporting tools and dashboards as well as online transaction processing applications, from that central repository. The challenge with this approach is the broken link between analytical systems and transactional systems that impedes the digital experience. Centralized systems, like data warehouses, introduce latency and fail to meet the real time response and performance levels needed to build next-generation digital experiences.
What is Infosys Data Mesh?
Data Mesh helps organizations bridge the chasm between analytics and application development teams within large enterprises. Data Mesh is an architecture pattern that takes a new approach to domain-driven distributed architecture and the decentralization of data. Its basic philosophy is to encapsulate the data, its relationships, context, and access functionality into a data product with guaranteed quality, trust, and ease of use for business consumption.
Data Mesh is best suited for low-latency access to data assets used in digital transformations that are intended to improve experience through rich insights. With its richer domain flavor — distributed ownership, manageability, and low latency access — Data Mesh is best positioned as a bridge between transactional (consuming applications) and analytical systems.
This diagram depicts the high-level solution view of Data Mesh:
Data Mesh Key Design Principles
- Domain-first approach.
- Data as a product. Data Mesh products share the following attributes which maximize usability and minimize friction:
- Self-described: metadata is precise and accurate
- Discoverable and addressable: products are uniquely identifiable and easy to find
- Secure and well-governed: only those who are granted access have it
- Trustworthy: proper data quality conrtols are applie, SLA/SLOs are maintainted
- Open standard and interoperable: data formats — XBRL, JSON
- Build new products easily. Any cross-functional team can build a new, enterprise-level product in an existing domain and/or fro existing products
- Simplified access for multiple technology stacks. Polygot data and ports, cloud and non-cloud.
- Common infrastructure and services for all data pipelines and catalogs.
Platform Requirements for a Data Mesh
To build a data mesh, companies need a database platform that can create domain-driven data products that meet various enterprise needs. This includes:
- Flexible data structures — to accomodate new behaviors
- An API-driven construct — to access current data products and build new domain-data ones
- Support for high-performance query on large-scale data structures
- A shared, scalable infrastructure
Why MongoDB is the Optimal Platform for Infosys Data Mesh
MongoDB is the best platform for realizing the Infosys Data Mesh architecture and powering analytics-driven enterprises because it provides:
- A flexible document model and a poly-cloud infrastructure availability so teams can easily modify and enrich flat or hierarchical data models
- MongoDB Realm Webhooks to create service APIs which connect data across products and enable consumption needs based on business context
- A scalable, shared infrastructure and support for high-performance querying of large scale data
- Service APIs for constructing Infosys Data Mesh
Two use cases:
|Case 1:||Case 2:|
A wealth management firm offers a variety of products to its customers — things like checking and savings accounts, trading, credit and debit cards, insurance, and investment vehicles.
Using the Infosys Data Mesh model, the firm’s data owners create domain-data products categorized by customer and product, and then curate and publish them through a technology-agnostic, API-driven service layer.
Consumers can then use this service layer to build the data products they need to carry out their business functions.
The Risk and Finance unit of a large global bank has multiple regional data lakes catering to each region’s management information system and analytical needs. This poses multiple challenges for creating global data products:
To address these challenges, the bank creates an architecture of regional data hubs for region-specific products and, as with Case 1, makes those products available to authorized consumers through a technology-agnostic, API-driven service layer.
Next, it implements an enterprise data catalog with an easy-to-use search interface on top of the API layer. The catalog’s query engine executes cross-hub queries, creating a self-service model for users to seamlessly discover and consume data products and to align newer ones with their evolving business needs.
Enterprise security platform integration ensures that all regulatory and compliance requirements are fully met.
How Businesses Overall Can Benefit
- Data and Insights become pervasive and consumable across applications and personas
- Speed-to-insights (including real time) enable newer digital experiences and better engagement leading to superior business results
- Self-service through trusted data products is enabled
Infosysy DNA Assets on MongoDB Accelerates the Creation of Industry-Specific Domain Data Products
|Infosys Genome||Infosys Data Prep||Infosys Marketplace|
Creates the foundation for Data Mesh by unifying semantics across industries
Guides consumers through the product creation process with a scalable data preparation framework
Enables discovery and consumption of domain-data products via an enterprise data catalog
Download our Modernization Guide for information about which applications are best suited for modernization and tools to get started.