ingest this #1
Use SQL for data pipelines - Data-Centric AI - Airflow Sucks for MLOps - Draw plots on the terminal
Welcome to the first edition of ingest this! - a curated newsletter about Data Engineering, MLOps, and Machine Learning Engineering.
Read this 📚
SQL should be your default choice for data engineering pipelines
We all love a bit of SQL, but is it powerful enough to be the only language used in your data engineering pipelines? In this article, Robin Linacre argues that you should look no further than SQL, thanks to modern techniques, such as CTEs, and new tools, such as DuckDB, SQLGlot, and dbt.
Watch this 👀
Andrew Ng on Data-Centric AI
Andrew Ng discusses how and why we should move from big data to good data when building AI systems.
Hear this 🎧
Airflow Sucks for MLOps
Demetrios Brinkmann from MLOps.community interviews Stephen Bailey about working with data platforms, the problems related to the orchestration layer and the limits of the current tools.
Also available as a video here.
Hack this 🛠️
Draw plots on the terminal with YouPlot
YouPlot is a great tool written in Ruby that allows you to create plots directly on your terminal. Many different plot types are available.