Technical Blog
Data science, machine learning, and software engineering insights
Featured
2026 Databricks Berlin User Group: A Recap and a Surprise Notes from my first talk in five years, what surprised me about the room, and what I want to say next. 2021 Data Verification for Machine Learning - A Review of DataFrame Validation Libraries A comparison of data validation libraries for Pandas and Spark DataFrames All posts
2026 Databricks Berlin User Group: A Recap and a Surprise Notes from my first talk in five years, what surprised me about the room, and what I want to say next. 2021 Data Verification for Machine Learning - A Review of DataFrame Validation Libraries A comparison of data validation libraries for Pandas and Spark DataFrames 2017 Testing Spark tasks with PyTest, Mock and Luigi Testing PySpark tasks using Luigi, PyTest and Mock 2017 Using mypy for Improving your Codebase Using static type checking to improve Python codebases