finds.dev← search

// the find

tuanavu/airflow-tutorial

★ 976 · Jupyter Notebook · MIT · updated Nov 2022

Apache Airflow tutorial

A companion repo for a YouTube tutorial series on Apache Airflow, covering setup, basic DAG authoring, and a BigQuery pipeline example. Aimed squarely at beginners who want something to follow along with while watching the videos, not a reference for anyone already running Airflow in production.

Docker Compose setup gets a local Airflow environment running with a single command, which removes the biggest friction point for newcomers. The BigQuery pipeline example is a real-world pattern (GitHub trends) rather than a toy hello-world DAG. Each part maps to a tagged git commit so you can check out exactly the state shown in any given video.

Last touched in 2022 — Airflow is now on 2.x with a completely different task syntax (TaskFlow API, decorators), and this repo still uses the old operator-heavy style that Airflow's own docs have largely moved away from. No tests at all, not even the basic DAG import validation that Airflow itself supports. The GCloud example requires a service account key dropped into a directory, which is the credential-handling pattern you want to teach people to avoid. Nearly 1000 forks but the blog domain (applydatascience.com) appears dead, so half the supplementary material is gone.

View on GitHub →

// want more like this?

We dig through GitHub every week and send a few repos picked for what you actually care about — each with an honest take like this one.

Get finds in your inbox → Search again →