Pearson

3 results

A Mobile-First, Cloud-First Stack at Pearson

Pearson, the global online education leader, has a simple yet grand mission: to educate the world; to have 1 billion students around the globe touching their content on a regular basis. They are growing quickly, especially to emerging markets where the primary way to consume content is via mobile phones. But to reach global users, they need to deploy in a multitude of private and public data centers around the globe. This demands a mobile-first, cloud-first platform, with the underlying goal to improve education efficacy. In 2018, Pearson will be announcing to the public markets what percentage of revenue is associated with the company’s efficacy. There’s no question; that’s a bold move. As a result, apps have to be built in a way to measure how users are interacting with them. Front and center in Pearson’s strategy is MongoDB. With MongoDB, as Pearson CTO Aref Matin told the audience at MongoDB World ( full video presentation here ), Pearson was able to replace silos of double-digit, independent platforms with a consolidated platform that would allow for measuring efficacy. “A platform should be open, usable by all who want to access functionality and services. But it’s not a platform until you’ve opened up APIs to the external world to introduce new apps on top of it,” declared Matin. A key part of Pearson’s redesigned technology stack, MongoDB proved to be a good fit for a multitude of reasons, including its agility and scalability, document model and ability to perform fast reads and ad hoc queries. Also important to Matin was the ability to capture the growing treasure trove of unstructured data, such as peer-to-peer and social interactions that are increasingly part of education. So far, Pearson has leveraged MongoDB for use cases such as: Identity and access management for 120 million user accounts, with nearly 50 million per day at peak; Adaptive learning and analytics to detect, in near real-time, what content is most effective and identify areas for improvement; and The Pearson Activity Framework (akin to a “Google DoubleClick” according to Matin), which collects data on how users interact with apps and feeds the analytics engine. All of this feeds into Matin’s personal vision of increasing the pace of learning. “Increasing the pace of learning will be a a disruptive force,” said Matin. “If you can reduce the length of time spent on educating yourself, you can learn a lot more and not spend as much on it. That will help us be able to really educate the world at a more rapid pace.” **Sign up to receive videos and content from MongoDB World.** MktoForms2.loadForm("//app-abk.marketo.com", "017-HGS-593", 1151);

July 31, 2014

Pearson / OpenClass Uses MongoDB for Social Learning Platform

We recently spoke with Brian Carpio of Pearson about OpenClass , a new project from Pearson with deep Google integration. What is OpenClass? OpenClass is a dynamic, scalable, fully cloud-based learning environment that goes beyond the LMS. OpenClass stimulates social learning and the exchange of content, coursework, and ideas â€â€ù all from one integrated platform. OpenClass has all the LMS functionality needed to manage courses, but that's just the beginning. Why did you decide to adopt MongoDB for OpenClass? OpenClass leverages MongoDB as one of its primary databases because it offers serious scalability and improved productivity for our developers. With MongoDB, our developers can start working on applications immediately, rather than slogging through the upfront planning and DBA time that relational database systems require. Also, given that a big part of the OpenClass story will be how we integrate with both public and private cloud technologies, MongoDB support for scale-out, commodity hardware is a better fit than traditional scale-up relational database systems that generally must run on big iron hardware. Can you tell us about how you’ve deployed MongoDB? Currently we deploy MongoDB in our world-class datacenters and in Amazon's EC2 cloud environment with future plans to go to a private cloud technologies such as OpenStack. We leverage both Puppet and Fabric for deployment automation and rolling upgrades. We also leverage Zabbix and the mikoomi plugin for monitoring our MongoDB production servers. Currently each OpenClass feature / application leverages its own MongoDB replica set, and we expect to need MongoDB’s sharding features given the expected growth trajectory for OpenClass. What recommendations would you give to other operations teams deploying MongoDB for the first time? Automate everything! Also, work closely with your development teams as they begin to design an application that leverages MongoDB, which is good advice for any new application that will be rolled into production. I would also say to look at Zabbix as it has some amazing features related to monitoring MongoDB in a single replica set or in a sharded configuration that can help you easily identify bottlenecks and identify when it’s time to scale out your MongoDB deployment. Finally, I would suggest subscribing to the #mongodb irc channel, as well as the MongoDB Google Group , and don't be afraid to ask questions. I personally ask a lot of questions in the MongoDB Google Group and receive great answers not only from 10gen CTO Eliot Horowitz , although he does seem to answer a lot of my questions, but from a many other 10gen folks. What is in store for the future with MongoDB at Pearson? Our MongoDB footprint is only going to continue to grow. More and more development teams are playing with MongoDB as the foundation of their new application or OpenClass feature. We are working on migrating functionality out of both Oracle and Microsoft SQL Server to MongoDB where it makes sense to relieve the current stress on those incumbent database technologies. Thanks to Brian for telling us about OpenClass! Brian also blogs at www.briancarpio.com — be sure to check out his posts on MongoDB here and here and here and here and here . Tagged with: case study, Pearson, OpenClass, scalability, flexibility, ease of use

February 28, 2013

Pearson National Transcript Center runs MongoDB

High school students only have to worry about one transcript: their own. But for Pearson , a multi-billion dollar learning company that operates in over 70 countries and employs some 36,000 people, its transcript management problem is much bigger. Pearson Education manages the transcripts for over 14 million students from more than 25,000 institutions, and makes and allows NTC member institutions to securely send records and transcripts to any of over 137,000 academic institutions, not to mention employers, licensure agencies, and scholarship organizations. To manage this big data problem, Pearson turned to MongoDB as the underlying database for its National Transcript Center . Pearson’s National Transcript Center isn’t merely a data store for student transcripts. Pearson stores student data and also transforms it from one standard format to another, including PESC High School Transcript XML, PESC College Transcript XML, SPEEDE EDI, SIF Student Record Exchange, and others. Pearson also generates PDF copies of a student’s records, and provides print copies when electronic delivery is not available. The impetus to use MongoDB was a request to archive student data at the end of each year, rather than deleting it. If the student had graduated, why keep her records around? As it turned out, there was plenty of reasons, including the potential need to transfer records between higher educational institutions or on to employers. But how best to store and manage this student data? Pearson had been using an open-source relational database (RDBMS) to store the student records. However, Pearson ran into performance problems with this RDBMS, problems that would compound each year. The idea of taking a year’s worth of student records and sticking it in a separate table, then sharding over and over as the years passed was going to make performance even worse. So Pearson turned to a key-value NoSQL database. Unfortunately, this too, posed problems. Pearson had no idea what a student record would look like in the future and so needed a dynamic schema. The company did not want to keep creating new tables as fields changed. Another problem with this key-value data store was that its filtering mechanism was hard to work with as Pearson employs very complicated queries, where the company searches different fields at the same time. It proved too difficult to get all that query data marshaled with a key-value database. At this point, Pearson decided to give MongoDB a try. Pearson’s development team immediately appreciated the ease of working with MongoDB’s flexible and dynamic data model. But it was perhaps MongoDB’s query mechanism that sold the team on using the NoSQL database. Mongo automatically converted Pearson’s queries from Hibernate into MongoDB. Pearson had Hibernate criteria calls, which allowed the team to avoid building SQL queries by hand. This work mapped directly to MongoDB, saving Pearson time and trouble. Other benefits became apparent over time. With Pearson’s original RDBMS approach, Pearson would have been forced to search gigantic tables when querying the student records. But with MongoDB, if Pearson starts putting too much data in a namespace, it can easily shard the namespace in MongoDB, for example, enabling search by district rather than of an entire state. Hence, instead of storing student data in a blob, as happened with the RDBMS, Pearson is able to use MongoDB’s GridFS, enabling Pearson to keep files and metadata automatically synced and deployed across a number of systems and facilities. For students looking to get into a good college or employer, their transcript is their passport. By using MongoDB, Pearson has been able to boost performance for its end-users, all while improving ease of use and productivity for its developers. Tagged with: Pearson, education, National Transcript Center, GridFS, RDBMS, case study, MongoDB, NoSQL

February 14, 2013