You have MongoDB, a tremendously scalable database. You’re collecting a lot of data, but you know you need to do more with it. Specifically, you want to do things like:
Crucially, you want to do this without disturbing the applications that rely on your production MongoDB database.
This step-by-step tutorial will guide you through the process of integrating your MongoDB data with Hadoop and Pig via Mortar. With your MongoDB data in Hadoop, you will be able to run advanced algorithms and reports on it at scale. The aim of the tutorial is to get you up and running quickly, and to teach you a bit along the way about powerful technologies for working with your MongoDB data at scale.