Technical Blog

Data science, machine learning, and software engineering insights

All posts
2026 Databricks Berlin User Group: A Recap and a Surprise Notes from my first talk in five years, what surprised me about the room, and what I want to say next. 6 min 2021 Data Verification for Machine Learning - A Review of DataFrame Validation Libraries A comparison of data validation libraries for Pandas and Spark DataFrames 22 min 2017 Testing Spark tasks with PyTest, Mock and Luigi Testing PySpark tasks using Luigi, PyTest and Mock 4 min 2017 Using mypy for Improving your Codebase Using static type checking to improve Python codebases 13 min