티스토리 뷰

공부

[aws] s3, s3n, s3a

승가비 2022. 11. 26. 05:42
728x90
Generation Usage Description
First – s3 s3:\\ s3 which is also called classic (s3: filesystem for reading from or storing objects in Amazon S3 This has been deprecated and recommends using either the second or third generation library.
Second – s3n s3n:\\ s3n uses native s3 object and makes easy to use it with Hadoop and other files systems. This is also not the recommended option.
Third – s3a s3a:\\ s3a – This is a replacement of s3n which supports larger files and improves in performance.

https://sparkbyexamples.com/spark/spark-read-text-file-from-s3/

 

Spark Read Text File from AWS S3 bucket

In this Spark sparkContext.textFile() and sparkContext.wholeTextFiles() methods to use to read test file from Amazon AWS S3 into RDD and spark.read.text()

sparkbyexamples.com

 

728x90
댓글