The following is a guest post by Doug Daniels, CTO of Mortar Data Inc.
Today, we’re excited to announce integration between MongoLab and Mortar, the Hadoop platform for high-scale data science. If you have one of the 100,000+ databases at MongoLab, you can now seamlessly use Hadoop to:
- Run advanced algorithms (like recommendation engines)
- Build reports that run quickly in parallel against large collections
- Join multiple collections (and outside data) together for analysis
- Store results to Google Drive, back to MongoLab, or many other destinations
In this article we’ll show you how to connect your MongoLab database to Hadoop, and then use Hadoop to do something simple but very useful: gather schema information from an entire collection, including histograms of common values, data types, and more. Mortar handles all deployment, monitoring and cluster management, so no prior knowledge of Hadoop is required. Continue Reading →