티스토리 뷰

728x90

https://hudi.apache.org/cn/blog/2022/01/14/change-data-capture-with-debezium-and-apache-hudi/

 

Change Data Capture with Debezium and Apache Hudi | Apache Hudi

As of Hudi v0.10.0, we are excited to announce the availability of Debezium sources for Deltastreamer that provide the ingestion of change capture data (CDC) from Postgres and Mysql databases to your data lake. For more details, please refer to the origina

hudi.apache.org

https://www.alibabacloud.com/blog/how-to-analyze-cdc-data-in-iceberg-data-lake-using-flink_597838

 

How to Analyze CDC Data in Iceberg Data Lake Using Flink

This article discusses the challenges and limitations of various solutions in CDC data analysis and describes how to use Flink and Iceberg to overcome them.

www.alibabacloud.com

https://mongsil-jeong.tistory.com/38

 

Debezium MySql CDC 카프카 커넥트

Debezium MySQL CDC Connector 디비지움 커넥터는 카프카 개발자들이 만든 커넥터이다. MySql의 모든 Row-Level 변경사항을 모니터링하고 기록할 수 있다. MySql 서버에 접속한뒤, 일정하게 Database의 스탭샷을

mongsil-jeong.tistory.com

https://towardsdatascience.com/data-lake-change-data-capture-cdc-using-apache-hudi-on-amazon-emr-part-2-process-65e4662d7b4b

 

Data Lake Change Data Capture (CDC) using Apache Hudi on Amazon EMR — Part 2—Process

Easily process data changes over time from your database to Data Lake using Apache Hudi on Amazon EMR

towardsdatascience.com

https://iceberg.apache.org/

 

Apache Iceberg

SELECT count(*) FROM nyc.taxis 2,853,020 SELECT count(*) FROM nyc.taxis FOR VERSION AS OF 2188465307835585443 2,798,371 SELECT count(*) FROM nyc.taxis FOR TIMESTAMP AS OF TIMESTAMP '2022-01-01 00:00:00.000000 Z' 2,798,371

iceberg.apache.org

 

728x90
댓글