Step-by-step tutorials, deep dives into high-scale technologies, and guides to connecting to your data.

Recommendation Engine

Help your users quickly discover what they want.
  • Open, 100% customizable
  • Highly scalable
  • Fanatical documentation

MongoDB → Hadoop

Use Hadoop to analyze data from MongoDB.
  • Offload computation
  • Repeatable, schedulable
  • Expressive modeling

Custom Graphs & Dashboards

Link with DataHero for effortless data visualization.
  • Monitor Key Metrics
  • Drag-and-Drop Interface
  • Automatic Updating

Amazon Redshift Data Warehouse

Build an ETL Pipeline to Amazon Redshift.
  • Highly scalable
  • Flexible data formats
  • Open, 100% customizable

Build a Custom Data App

Mortar is all-purpose—use it to roll your own.
  • Effortless operations
  • Work local, run remote
  • Free for public code


Open framework for local development, plus high-scale distributed cloud execution.
View All Articles


The omnivorous language found wherever there are huge piles of data to be processed.
View All Articles


Hadoop is practically synonymous with big data. EMR = AWS’s Hadoop service.
View All Articles


Industrial-strength pipelines for multi-stage data processing at scale.
View All Articles

Amazon S3

The world’s most widely-used data storage service.
View All Articles


An easy, flexible, scalable database.
View All Articles

SQL Databases

PostgreSQL, MySQL, Oracle... anything with JDBC.
View All Articles


AWS's high-scale, key-value data store as a service.
View All Articles


AWS's fully managed, data warehouse service.
View All Articles

Google Drive

Store data to Google Drive. Visualize with DataHero.
View All Articles


One of our favorite logging services.
View All Articles