• Data Pipelines with Apache Airflow

  • Nov 26 2024
  • Length: 44 mins
  • Podcast

Data Pipelines with Apache Airflow

  • Summary

  • This Book provides a comprehensive guide to Apache Airflow, a powerful open-source workflow management platform commonly used in data-intensive environments. It covers the fundamentals of Airflow, including defining data pipelines as directed acyclic graphs (DAGs), scheduling and executing these pipelines, monitoring their performance, and handling failures. The book also explores advanced topics such as templating tasks, building custom components, integrating with external systems, and designing tests for your pipelines. The authors then demonstrate how to deploy and operate Airflow in production environments, including securing the system, managing resources efficiently, and collecting metrics for monitoring. Finally, the book includes detailed guidance on deploying Airflow in various cloud environments, including AWS, Azure, and GCP.


    You can listen and download our episodes for free on more than 10 different platforms:
    https://linktr.ee/cyber_security_summary

    Get the Book now from Amazon:
    https://www.amazon.com/Data-Pipelines-Apache-Airflow-Harenslak/dp/1617296902?&linkCode=ll1&tag=cvthunderx-20&linkId=39a43518fff3b8fca733494faa3cb6df&language=en_US&ref_=as_li_ss_tl


    Show more Show less
activate_Holiday_promo_in_buybox_DT_T2

What listeners say about Data Pipelines with Apache Airflow

Average customer ratings

Reviews - Please select the tabs below to change the source of reviews.