Mortar has joined Datadog, the leading SaaS-based monitoring service for cloud applications. Read more about what this means here.

PiggyBank and Pig Libraries


PiggyBank

PiggyBank is a collection of useful LOAD, STORE, and UDF functions. Mortar compiles and registers it automatically, so you can use anything you find there.

For example, to use the CommonLogLoader from PiggyBank, you can do:

data = LOAD 's3n://path/to/input'
       USING org.apache.pig.piggybank.storage.apachelog.CommonLogLoader()
       AS (addr: chararray, logname: chararray, user: chararray, time: chararray,
           method: chararray, uri: chararray, proto: chararray,
           status: int, bytes: int);

Additional Pig Libraries

  • DataFu: a collection of Pig algorithms released by LinkedIn