The Data Engineering Show podcast

Block Bad Data Before the Write with Nike’s Ashok Singamaneni

0:00
20:20
15 Sekunden vorwärts
15 Sekunden vorwärts
Nike’s Principal Data Engineer Ashok Singamaneni joins Benjamin and Eldad to discuss his open-source data quality framework, Spark Expectations. Ashok explains how the tool, which was inspired by Databricks DLT Expectations, shifts data quality checks to before the data is written to a final table. This proactive approach uses row-level, aggregation-level, and query data quality checks to fail jobs, drop bad records, or alert teams - ultimately saving huge costs on recompute and engineering effort in mission-critical data pipelines.

Weitere Episoden von „The Data Engineering Show“