`orc` vs `parquet` vs `avro`
https://qkqhxla1.tistory.com/1136
parquet vs orc vs avro (big data file format )
1. www.datanami.com/2018/05/16/big-data-file-formats-demystified/ 공통점. 3개 타입은 전부 하둡에 저장하는데에 최적화되어있다. orc, parquet, avro 3개 전부 기계가 읽을수 있는 바이너리 포맷이다. orc, p..
qkqhxla1.tistory.com
Why is parquet best for Spark and not ORC, although both are columnar-based file formats?
Answer: In order to understand why Parquet file format is best suited for your requirement when using Apache Spark (as the execution engine), we have to understand and appreciate the features of it to arrive at the answer. To begin with Apache Spark is opt
www.quora.com
https://medium.com/@dhareshwarganesh/benchmarking-parquet-vs-orc-d52c39849aef
Benchmarking PARQUET vs ORC
In this article, we conduct few experiments on Parquet and ORC file system and conclude the advantages and disadvantages over each other.
medium.com