Latest Posts
Elephant Below the Waterline: Hadoop Dependencies in Iceberg
Apache Iceberg is an open table format with widespread adoption, but it carries the weight of historical baggage from Apache Hadoop. With Apache Parquet as a popular storage format for Iceberg, tight coupling to Hadoop classes presents maintenance challenges for integration with the Iceberg ecosystem.
Bringing Kubernetes Clustering to Apache NiFi
Apache NiFi 2.0.0-M1 introduced extensible leader election, adding first-class support for clustering on Kubernetes using native services, without the need for Apache ZooKeeper. Based on the building blocks of Kubernetes ConfigMaps and Leases, new NiFi clustering capabilities reduce both configuration complexity and operational overhead.
Getting Started with Hatch for Python Projects
Hatch enables building, managing, and publishing Python projects using standardized configuration and straightforward commands. As a project of the Python Packaging Authority, Hatch implements modern management conventions and provides a plugin system for extensible integration with project lifecycle actions. With default project templates, Hatch supports static analysis, unit testing, code coverage, binary packaging, module publishing, and more, through an intuitive command-line interface.