MongoDB for GIANT Stories

Subscribe Now

Using MongoDB with Hadoop & Spark: Part 3 - Spark Example & Key Takeaways

Matt Kalan
Feb 19, 2015

Welcome to the final part of our three-part series on MongoDB and Hadoop. In this post, we'll look at a Spark example.

  1. Introduction & Setup of Hadoop and MongoDB
  2. Hive Example
  3. Spark Example & Key Takeaways

For more detail on the use case, see the first paragraph of part 1.

Spark Example
For using Spark, I opted to use Python from the interactive shell command “pyspark”. This gives you...

Read More

What’s New in MongoDB 3.0 Part 1: 95% Reduction in Operational Overhead and Security Enhancements

MongoDB
Feb 18, 2015

It’s been a couple of weeks now since our co-founder and CTO announced MongoDB 3.0. The reaction from the community has blown us away as architects, developers and ops teams look to bring MongoDB’s flexibility and ease-of-use to a whole new set of projects.

In the lead-up to it’s General Availability, we’re going to provide more detail on specific areas of MongoDB 3.0. Of course we’ll cover...

Read More

Using MongoDB with Hadoop & Spark: Part 2 - Hive Example

Matt Kalan
Feb 17, 2015

Welcome to part two of our three-part series on MongoDB and Hadoop. In part one, we introduced Hadoop and how to set it up. In this post, we'll look at a Hive example.

  1. Introduction & Setup of Hadoop and MongoDB
  2. Hive Example
  3. Spark Example & Key Takeaways

For more detail on the use case, see the first paragraph of part 1.

Summary

Use case: aggregating 1 minute intervals of stock prices into 5...

Read More

Using MongoDB with Hadoop & Spark: Part 1 - Introduction & Setup

Matt Kalan
Feb 17, 2015

Hadoop is a software technology designed for storing and processing large volumes of data distributed across a cluster of commodity servers and commodity storage. Hadoop was initially inspired by papers published by Google outlining its approach to handling large volumes of data as it indexed the Web. Many organizations are now harnessing the power of Hadoop and MongoDB together to create...

Read More

Weather of the Century: Part 1

MongoDB
Feb 13, 2015

Ever wonder what the weather was like in your birthplace the day you were born? Building an app to answer this question for anyone (from this planet at least) was the subject of talks on visualization and performance at MongoDB World 2014.

In those talks, MongoDB Engineers André Spiegel and Jesse Davis presented the Weather of the Century App, which takes as input any location on Earth and a...

Read More

MongoDB World Registration Open

MongoDB
Feb 13, 2015

100+ MongoDB tech talks, 2,000 fellow MongoDB users, hands-on developer spaces, food, sponsor swag, round-the-clock coffee and a party under a giant whale. What could be better? We can’t think of much.

To experience all of the above, join us for MongoDB World on June 1-2 in New York City. Space is limited. (Honestly it is. Don’t be stuck on the sidewalk looking in, like last year.)

Here’s a...

Read More

MongoDB Security Best Practices

Eliot Horowitz
Feb 10, 2015
Editor's note: This post was edited on June 23, 2015 to reflect the change from MongoDB Management Service to MongoDB Cloud Manager. Learn more here.

MongoDB takes security very seriously. Recently a team of German researchers discovered unsecured instances of MongoDB running openly on the internet. Readers who are concerned about access to their systems are reminded of the following...

Read More

Improved password-based authentication in MongoDB 3.0: SCRAM Explained (Part 2)

MongoDB
Feb 6, 2015

In MongoDB 3.0, SCRAM is now the default password-based authentication mechanism. In the first post in this series, we introduced the threat-model SCRAM is designed to counter, and explained the inner workings of the protocol. Now we can take a closer look at how SCRAM addresses the attacks that follow from this threat-model. For a more detailed discussion, please see the Security...

Read More

Capacity Planning and Hardware Provisioning for MongoDB In Ten Minutes

Chad Tindel
Feb 3, 2015

Most MongoDB deployments run on a cluster of multiple servers. This may introduce capacity planning and provisioning complexities beyond that of traditional databases. Solution Architect Chad Tindel’s Hardware Provisioning presentation from MongoDB World describes some best practices for operations teams sizing their MongoDB deployments.

There are two important concepts related to MongoDB’s...

Read More

Improved Password-Based Authentication in MongoDB 3.0: SCRAM Explained - Pt. 1

MongoDB
Feb 2, 2015

MongoDB 3.0 features a new password authentication mechanism called Salted Challenge Response Authentication Mechanism or SCRAM (see RFC5802). MongoDB 3.0 uses SCRAM as the default authentication mechanism, replacing MONGODB-CR.

In this article we will give an overview of the motivations behind SCRAM's design, a basic analysis of its security properties, and a tutorial on how to use it in...

Read More