Apache Hudi

Apache Hudi (Hadoop Upserts Deletes and Incrementals) is an open-source data lakehouse platform optimized for upserts, deletes, and incremental data processing. It supports Copy-on-Write (COW) and Merge-on-Read (MOR) table types and is used for CDC-based data pipelines and streaming analytics.

API entry from apis.yml

apis.yml Raw ↑
aid: table-format:apache-hudi
name: Apache Hudi
description: Apache Hudi (Hadoop Upserts Deletes and Incrementals) is an open-source data lakehouse platform
  optimized for upserts, deletes, and incremental data processing. It supports Copy-on-Write (COW) and
  Merge-on-Read (MOR) table types and is used for CDC-based data pipelines and streaming analytics.
humanURL: https://hudi.apache.org/
tags:
- Apache Hudi
- Data Lake
- CDC
- Incremental Processing
properties:
- type: Documentation
  url: https://hudi.apache.org/docs/overview/
- type: GitHub
  url: https://github.com/apache/hudi