Cloudflare Data Platform
12 hours ago
- #Data Platform
- #Cloudflare
- #Apache Iceberg
- Cloudflare announced the public beta of R2 Data Catalog, a managed Apache Iceberg catalog on R2 object storage.
- Three new products were introduced: Cloudflare Pipelines, R2 Data Catalog, and R2 SQL, forming the Cloudflare Data Platform.
- Cloudflare Pipelines processes events via Workers or HTTP, transforms them with SQL, and ingests them into Iceberg or R2.
- R2 Data Catalog manages Iceberg metadata and now includes compaction to improve query performance.
- R2 SQL is a distributed SQL engine for petabyte-scale queries on R2 data.
- The platform runs on Cloudflare's global infrastructure, supports open standards, and has no egress fees.
- R2 Data Catalog simplifies Iceberg setup and maintenance, with automatic compaction for better performance.
- R2 SQL allows serverless querying of R2 Data Catalog tables, with future plans for expanded SQL capabilities.
- Future updates include Logpush integration, user-defined functions, and enhanced R2 SQL features.
- The platform is designed to be easy to use with affordable, usage-based pricing.