is a lead engineer, author, and thought leader in the domain of data engineering. Sandy co-wrote “Advanced Analytics with PySpark”
and "Advanced Analytics with Spark”
. He led ML and data science teams at Cloudera, Remix, Clover Health, and KeepTruckin.
Sandy is currently the lead engineer on the Dagster project, an open-source data orchestration platform used in MLOps, data science, IOT and analytics. Sandy is a regular speaker at data engineering and ML conferences.
- Data engineering in a post-Hadoop world
- The move to declarative and functional programming in data engineering
- The convergence of ML, data science, and data engineering
- The role of data engineering in the modern enterprise