apache spark - Avro in-memory with parquet as storage -


i new parquet , trying understand various intricacies come using parquet. 1 of points pop parquet storage format should used avro in-memory representation. unfortunately, dont understand how avro can useful parquet. can please explain combination me?

thanks!!

avro binary format represent object structure. has advantage of being compressed.

i don't know enough parquet's inner details why avro, may optimized store specific format. nevertheless using avro object format storage big space saver.


Comments