Open Source and the Data Lakehouse: Apache Iceberg and Project Nessie
The data lakehouse concept presents a harmonious fusion of the strengths of both data lakes and data warehouses.
Alex Merced is a developer advocate for Dremio, a developer, and a seasoned instructor with a rich professional background. He’s worked with companies like GenEd Systems, Crossfield Digital, CampusGuard, and General Assembly. Alex is a co-author of the O'Reilly book Apache Iceberg: The Definitive Guide. With a deep understanding of the subject matter, Alex has shared his insights as a speaker at events, including Data Day Texas, OSA Con, P99Conf, and Data Council. Driven by a profound passion for technology, Alex has shared his knowledge through various platforms. His tech content can be found in blogs, videos, and his podcasts, Datanation and Web Dev 101. Moreover, Alex Merced has contributed to the JavaScript and Python communities by developing a range of libraries. Notable examples include SencilloDB, CoquitoJS, and dremio-simple-query, among others.
The data lakehouse concept presents a harmonious fusion of the strengths of both data lakes and data warehouses.
Follow these best practices for data lake management to ensure your organization can make the most of your investment.
The need for automated data pipelines is clear. What role will data scientists play in bringing them about?
Developing an enterprise-ready application that is based on machine learning requires multiple types of developers.
Cloud optimization could offer the best method for reducing costs according to a new report.