Apache Avro

A data serialization framework developed within Hadoop. Avro uses JSON to define data types and protocols, and serializes data in a compact binary format.

Scuba includes an ingest transformer (avro_load) to support ingestion of files in the Apache Avro format.

