Data Pipelines with Apache Airflow, Second Edition by Julian Ruiter

Data Pipelines with Apache Airflow, Second Edition by Julian Ruiter

Regular price
Checking stock...
Regular price
Checking stock...
Summary

Apache Airflow provides a batteries-included platform for designing, implementing, and monitoring data pipelines. Building pipelines on Airflow eliminates the need for patchwork stacks and homegrown processes, adding security and consistency to the process.

The feel-good place to buy books
  • Free US shipping over $15
  • Buying preloved emits 41% less CO2 than new
  • Millions of affordable books
  • Give your books a new home - sell them back to us!

Data Pipelines with Apache Airflow, Second Edition by Julian Ruiter

Scripts keep crashing and stakeholders demand data they can trust. Late-night alerts drain your energy and delay business decisions. Manual fixes multiply as pipelines sprawl across clouds and clusters. You need orchestration that scales without sacrificing reliability or sleep. Apache Airflow promises order, yet its power can feel overwhelming. This book makes Airflow mastery achievable, practical, and immediately rewarding.  Dynamic scheduling with Dataset API: Align complex, irregular jobs to real-world data availability.  Taskflow API patterns: Write cleaner Python code, reduce boilerplate, and speed team onboarding.  Container-native deployments: Run pipelines on Kubernetes for elastic scaling and cost control.  Comprehensive testing strategies: Catch issues before production, slash incident time, protect reputation.  Production-ready best practices: Logging, security, and monitoring that keep auditors and leaders happy.  Custom operator design: Extend Airflow to any system, unlocking limitless integration possibilities.  Data Pipelines with Apache Airflow, Second Edition gathers five seasoned consultants into one definitive field guide. Their combined experience turns cutting-edge features into steps you can reproduce today. It is the trusted companion for every data engineer.  The book starts with Airflow architecture, then walks through DAG design, testing, deployment, and operations. Updated chapters reveal Taskflow, Dataset scheduling, and Kubernetes setups, explained through real projects, not toy examples. Clear language, diagrams, and downloadable code remove guesswork.  Finish the last page knowing your pipelines deploy reliably, recover gracefully, and scale effortlessly. Sleep through the night while Airflow delivers fresh, accurate data to every downstream consumer.  Ideal for data engineers, DevOps, machine-learning engineers, and Python-savvy analysts ready to level-up orchestration skills.
  • Julian de Ruiter is a Data + AI engineering lead known for transforming messy pipelines into scalable platforms. With global consulting experience, Julian brings analytical rigor and clear storytelling to every page. He distills advanced engineering tactics into practical guidance that accelerates reader confidence. 
  • Ismael Cabral is a machine-learning engineer and renowned Airflow trainer trusted by companies across four continents. Drawing on cross-industry projects, Ismael delivers concise, hands-on advice readers can apply the same day. He turns complex workflow concepts into simple patterns that unlock rapid innovation. 
  • Kris Geusebroek is a data-engineering consultant and maintainer of Whirl, the open-source Airflow testing toolkit. Kris blends deep technical skill with a coach’s mindset, helping teams adopt disciplined engineering habits. He channels that expertise into repeatable methods that keep pipelines reliable and testable. 
  • Daniel van der Ende is a veteran data engineer and early Airflow contributor who has worked on diverse on-prem and cloud setups. Daniel’s pragmatic voice demystifies community-driven features and hidden pitfalls. He packages frontline lessons into actionable tips that save readers weeks of trial and error. 
  • Bas Harenslak is a staff architect at Astronomer, guiding enterprises in large-scale Airflow deployments. With a software-engineering background, Bas writes with precision, warmth, and strategic insight. He translates architectural puzzles into elegant solutions that future-proof reader infrastructure. 
SKU Unavailable
ISBN 13 9781633436374
ISBN 10 1633436373
Title Data Pipelines with Apache Airflow, Second Edition
Author Julian De Ruiter
Condition Unavailable
Binding Type Hardback
Publisher Manning Publications
Year published 2025-12-30
Number of pages 450
Cover note Book picture is for illustrative purposes only, actual binding, cover or edition may vary.