Mortar has joined Datadog, the leading SaaS-based monitoring service for cloud applications. Read more about what this means here.

Spark Help and Resources

Apache Spark is a general-purpose distributed computing engine for processing and analyzing large amounts of data.

Spark Tutorial

Our Spark tutorial is a quick way to get familiar with basic Spark concepts and to do some machine learning using Spark’s scalable machine learning library MLlib.

Learning Spark Book

The early release of the Learning Spark book is a handy resource.

References

The most complete references are available on the official Spark documentation page. Of particular note are the docs for the Pyspark API and for Spark’s RDD operations.