📢 Special thanks to our speaker: Karim Lamouri
1.1 Avro - What is it ?
- Avro is used for serialization and deserialization of payloads
- More compact than
JSON
- Is a container file - It has both
the schema
andthe payload
- Allows RPC calls
- Fast serialization and smaller than JSON (do not repeat all the JSON keys as the schema is at the top of the file)
- Allows schema documentation
- Format easy to leverage on Spark or ETL pipelines
You can always deserialize an AVRO file as the schema is embedded in the file itself !