All content for EnergyTech Podcast is the property of Opsite Energy and is served directly from their servers
with no modification, redirects, or rehosting. The podcast is not affiliated with or endorsed by Podjoint in any way.
The ROI of Clean Data: Unlocking Value Across Drilling, Completions & Midstream - EP 051
EnergyTech Podcast
48 minutes
3 weeks ago
The ROI of Clean Data: Unlocking Value Across Drilling, Completions & Midstream - EP 051
In this episode of the Energy Tech Podcast, Mike Flores sits down with Austin Blake — Director of Data at CNX Resources — for one of our deepest, most comprehensive conversations yet on data quality, machine learning, edge computing, anomaly detection, and what real digital transformation looks like inside a major upstream operator. This conversation pulls back the curtain on how CNX is rebuilding its entire data foundation, streaming real-time operational data from the field, and applying advanced machine learning models to critical equipment like fracture fleets, flowback operations, and multi-stage compressor stations.WHAT YOU’LL LEARN IN THIS EPISODE1. Modernizing Oilfield DataHow CNX moved from siloed spreadsheets to a governed Snowflake data lake with clean, consistent, ML-ready data.2. Real-Time Edge VisibilityHow MQTT + AWS IoT Core deliver sub-second frac, flowback, and drilling data—and how offset-well monitoring helps prevent frac hits.3. Machine Learning That WorksHow CNX built practical anomaly detection models using 1-second SCADA data to improve compressor uptime and maintenance planning.4. AI vs. RealityWhy GenAI won’t fix bad data—and why traditional time-series ML still delivers the most value in oil & gas operations.5. Austin’s Career PathHow a completions engineer became CNX’s Director of Data and what he’s learned about failing fast and delivering real value.Chapters00:00 – Welcome to the Energy Tech Podcast00:16 – Introducing Austin Blake (Director of Data, CNX)00:48 – CNX’s digital transformation and Austin’s role01:30 – Fixing siloed data with a modern Snowflake architecture03:05 – Data governance, medallion layers, and eliminating Excel chaos04:05 – Austin’s background: field engineering → completions → data06:00 – Why completions lacked real-time data historically07:03 – Rise of Python, analytics, and why engineers must learn data08:30 – When CNX realized it needed to fix data before ML12:00 – Why ML fails without clean data13:30 – The well-naming problem in oil & gas15:00 – Misconceptions about AI “orchestrating” bad data16:02 – Aligning drilling, completions, production & midstream around data17:10 – CNX’s edge journey: Red Lion, vendors, and protocol challenges18:30 – Building the edge → AWS IoT Core → MQTT pipeline20:04 – Why sandbox environments accelerate innovation21:30 – Hot vs cold data paths: real-time vs long-term reporting24:00 – Real example: completions dashboards in Grafana25:20 – Preventing frack hits with offset-well observability28:00 – Demo walkthrough: real-time frack visualization32:50 – Why smaller operators can do this themselves34:00 – Building context at the edge (MQTT/Sparkplug)35:20 – Machine learning on compression fleets37:30 – Using 1-second SCADA for anomaly detection39:50 – Deploying ML models + measuring success via uptime42:00 – Austin’s personal approach: iterate fast, fail fast, deliver value44:30 – AI vs ML: what matters in oil & gas47:00 – Closing thoughts + subscribeMusic: Uygar Duzgun / “Fast Life” / courtesy of www.epidemicsound.com