Tags • Dependencies
Elephant Below the Waterline: Hadoop Dependencies in Iceberg
Apache Iceberg is an open table format with widespread adoption, but it carries the weight of historical baggage from Apache Hadoop. With Apache Parquet as a popular storage format for Iceberg, tight coupling to Hadoop classes presents maintenance challenges for integration with the Iceberg ecosystem.
Getting Started with Hatch for Python Projects
Hatch enables building, managing, and publishing Python projects using standardized configuration and straightforward commands. As a project of the Python Packaging Authority, Hatch implements modern management conventions and provides a plugin system for extensible integration with project lifecycle actions. With default project templates, Hatch supports static analysis, unit testing, code coverage, binary packaging, module publishing, and more, through an intuitive command-line interface.
Managing Logging Libraries in Apache NiFi
Vulnerabilities in Log4j 2 and other logging libraries have prompted increased scrutiny across many products. Apache NiFi integrates with a wide variety of services that require various dependencies, including multiple types of logging. NiFi 1.15.2 incorporated the removal and exclusion of several unnecessary libraries, highlighting the importance of managing logging dependencies.