The Data Engineering Show podkast

Block Bad Data Before the Write with Nike’s Ashok Singamaneni

0:00
20:20
Do tyłu o 15 sekund
Do przodu o 15 sekund
Nike’s Principal Data Engineer Ashok Singamaneni joins Benjamin and Eldad to discuss his open-source data quality framework, Spark Expectations. Ashok explains how the tool, which was inspired by Databricks DLT Expectations, shifts data quality checks to before the data is written to a final table. This proactive approach uses row-level, aggregation-level, and query data quality checks to fail jobs, drop bad records, or alert teams - ultimately saving huge costs on recompute and engineering effort in mission-critical data pipelines.

Więcej odcinków z kanału "The Data Engineering Show"