Sign in
Technology
Business
Rudderstack
Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.
The PRQL: Can Machine Learning Be Commoditized?
In this bonus episode, Eric and Kostas preview their upcoming live stream episode featuring Willem Pienaar of Tecton and Tristan Zajonc of Continual.
04:4819/08/2022
100: Data Quality Is Relative to Purpose with James Campbell of Superconductive
Highlights from this week’s conversation include:James’ role at Great Expectations (2:33)What Great Expectations does (5:49)How Great Expectations approaches data quality (7:01)Why a data engineer should use Great Expectations (16:41)Defining “data quality” (19:16)Translating expectations from one domain to the other (27:00)Community around Great Expectations (30:59)The user experience (33:41)Something exciting on the horizon (40:27)Interacting with marketers in a non-technical way (43:57)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.
53:5717/08/2022
The PRQL: What’s the Hardest Part About Data Quality?
Eric and Kostas preview their upcoming conversation with James Campbell at Superconductive.
04:0612/08/2022
99: State of the Data Lakehouse with Vinoth Chandar of Apache Hudi
Highlights from this week’s conversation include:Vinoth’s background and career journey (3:08)Defining “data lakehouse” (5:10)Databricks versus lake houses (13:37)The services a lakehouse needs (17:37)How to communicate technical details (26:55)Onehouse’s product vision (31:41)Lakehouse performance versus BigQuery solutions (36:44)How to deliver customer experience equally (40:17)How to start building a lakehouse (44:00)Big tech’s effect on smaller lakehouses (55:33)Skipping the data warehouse (1:04:39)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.
01:12:3810/08/2022
The PRQL: Does Lakehouse Architecture Really Mean the End of the Data Warehouse and Data Lake As We Know It?
In this bonus episode, Eric and Kostas preview their upcoming conversation with Vinoth Chandar of Apache Hudi.
05:0705/08/2022
98: Category Theory and the Mathematical Foundation of the Technologies We Use with Eric Daimler of Conexus
Highlights from this week’s conversation include:Eric’s background and career journey (3:30)Presenting to people without knowledge of AI (11:04)Why math was chosen over AI (19:03)From compilers to databases (25:42)The contribution of category theory (30:09)The Connexus customer experience (37:45)The primary user of Connexus (46:33)Interacting with 300,000 databases (51:07)When Connexus begins to add value (54:02)The best way to learn this mathematical approach (55:46)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.
01:01:3003/08/2022
The PRQL: Farm to Table Abstract Mathematics
Eric and Kostas preview their upcoming conversation with Eric Damlier of Conexus AI.
04:0129/07/2022
97: How To Build an Organization-Empowering Data Team with Emilie Schario of Amplify Partners
Highlights from this week’s conversation include:Emilie’s background and career journey (3:00)Hypergrowth at GitLab (5:23)Being close to the money in data (9:50)Big things taken from GitLab to Netlify (13:00)Defining “data organization” (17:53)The first roles you should hire for (22:06)Defining “analytics engineer” (23:44)One role to bridge different needs (27:26)Why data analysts are needed (30:51)How to avoid a kitchen sink of data (40:20)Data engineer archetype (45:48)Data roles crossing over (48:09)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.
54:0927/07/2022
The PRQL: If You Were Building a Data Team What Would Your First Hire Be?
Eric and Kostas preview their upcoming conversation with Emilie Schario from Amplify Partners.
04:1922/07/2022
96: How To Collect and Leverage Data From the Physical World with Prateek Joshi of Plutoshift
Highlights from this week’s conversation include:Prateek’s background and career journey (2:10)The lack of advanced data tools for the physical world (4:55)Dealing with data from the physical world (10:53)Stocks in the physical world (14:20)What it takes to execute this kind of project (19:05)Challenges around this infrastructure (25:56)ML tools that are useful in this environment (31:55)Physical instrumentation and environmental interaction (36:43)Current adoption of physical instrumentation (42:50)Data’s responsibility in sustainability (45:56)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.
54:3820/07/2022
The PRQL: Collecting Data in the Physical World
Eric and Kostas preview their upcoming conversation with Prateek Joshi.
03:1615/07/2022
95: How the Metrics Layer Bridges the Gap Between Data & Business with Nick Handel of Transform
Highlights from this week’s conversation include:Nick’s background and career journey (2:40)What Transform does (5:53)Metrics layer vs. metrics store (8:04)Signals vs. metrics (13:24)The user of a metric layer (14:34)Using Transform within an organization (17:05)How to fuse two sources into a metric (23:54)Currently supported databases (28:46)Community engagement (31:33)Optimizing for queries, metrics, and use cases (35:33)Technology and the human factor (40:49)Managing metrics amidst fast-paced change (46:53)Out-of-the-box metrics store (49:26)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.
57:4013/07/2022
The PRQL: Data Marts Aren’t Just for the Enterprise
Eric and Kostas preview their upcoming conversation with Nick Hansel from Transform.
03:5508/07/2022
94: Notebooks Aren’t Just for Data Scientists With Barry McCardel of Hex Technologies
Highlights from this week’s conversation include:Bary’s background and Hex (3:05)Reconciling two sides of data (9:16)Collaboration at Hex (15:10)What it takes to build something like Hex (20:02)Defining “commitment engineering” (26:01)How to begin working with Hex (30:56)Hex customers and uniqueness (40:31)The future in a world of data acquisition (45:30)Crossover between analytics and ML (51:33)Advice for data engineers (57:19)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.
01:02:3606/07/2022
The PRQL: Have You Ever Been a Part of a Company That Has Done Analytics Really Well?
Eric and Kostas preview their upcoming conversation with Barry McCardel of Hex Technologies.
03:4101/07/2022
93: There Is No Data Observability Without Lineage with Kevin Hu of Metaplane
Highlights from this week’s conversation include:Kevin’s background and career journey (1:54)Metaplane and the problem that is solves (6:47)The silence of data problems (9:53)Data physics work that requires more (13:35)Trusting data when bugs are present (19:12)Building a navigable experience (22:36)Developing anomaly detection (30:06)What Metaplane provides today (35:05)Metaplane’s plans for the future (37:45)Comparing Bigquery, Snowflake, and Redshift (40:56)Why data goes bad (48:15)Advice for data trust workers (59:24)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.
01:04:4629/06/2022
The PRQL: What Are the Similarities Between VCs and Tilapia?
Eric and Kostas preview their upcoming conversation with Kevin Hu of Metaplane.
03:2524/06/2022
92: Building a Decentralized Storage System for Media File Collaboration with Tejas Chopra
Highlights from this week’s conversation include:Tejas’ background and career journey (2:49, 43:04)Digital collaboration with Netflix Drive (7:57)A formal version control component (23:44)Centralized store vs. local affairs (31:05)The different skill sets a data engineer needs (37:38)How to get into data engineering (40:57)New technologies coming into day-to-day work (44:39)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.
55:3622/06/2022
The PRQL: What is Netflix Cloud?
Eric and Kostas preview their upcoming conversation with Tejas Chopra of Netflix.
02:5117/06/2022
91: The Future of Streaming Data with Stripe, Deephaven, Materialize, and Benthos
Highlights from this week’s conversation include:How we should think about batch versus streaming (6:02)Defining “streaming ETL” (9:34)A brief history of streaming processing platforms (22:07)The birth and evolution of Benthos (28:41)What led Jeff to build a new tool (34:29)Why you shouldn’t share all the data (37:23)Making streaming technologies approachable to engineers (42:09)Breaking out of traditional terminology (52:58)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.
01:00:2015/06/2022
The PRQL: Can Streaming Simplify Your Data Flows?
Eric and Kostas preview their upcoming livestream panel talking about all things streaming. Don't miss next week's episode with experts from Stripe, Deephaven, Materialize and Benthos
02:3410/06/2022
90: The Modern Data Stack Has a Join Problem with Ahmed Elsamadisi of Narrator AI
Highlights from this week’s conversation include:Ahmed’s background and career journey (2:27)Why the modern data stack “sucks” (4:53)The limitations of progress (9:13)Showing data with only 11 columns (11:55)Managing one table that rules them all (19:02)Viewing the world as timestamped activities (32:40)When this model becomes harder to use (35:15)The two parts you need in a company (44:41)Those who use Narrator (48:32)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.
56:3408/06/2022
The PRQL: Can One Table Rule Them All?
Eric and Kostas preview their upcoming episode with Ahmed Elsamadisi of Narrator AI.
05:1503/06/2022
89: Solving Microservice Orchestration Issues at Netflix with Viren Baraiya of Orkes
Highlights from this week’s conversation include:Viren’s background and career journey (2:23)Engineering challenges in Netflix transitions (6:05)How Conductor changed the process (9:30)Building a lot more microservices (16:04)Open sourcing Conductor (17:38)Defining “orchestration” (22:05)Using an orchestrator written in Java (31:04)Building a cloud service around microservices (34:59)Differentiating product experiences (37:17)Orchestration platforms in new environments (42:15)Advice for those early on in their career (46:10)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.
51:4001/06/2022
The PRQL: What are the Different Flavors of Orchestration?
Eric and Kostas preview their upcoming conversation with Viren Baraiya of Orkes.
04:0827/05/2022
88: What Is Data Observability? With Tristan Spaulding of Acceldata
Highlights from this week’s conversation include:Tristan’s background and career journey (2:43)Updating old technology (11:40)Defining “data observability” (18:44)The primary user of a data observability tool (29:56)Handling an incident (33:01)Why multipliers for data observability (37:06)Early symptoms of a data drift (43:12)Tuning in the context of data engineering (50:11)What keeps Tristan working with data (55:12)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.
01:01:4625/05/2022
The PRQL: Does Data Exist if We Do Not Observe It?
Eric and Kostas preview their upcoming conversation with Tristan Spaulding of Acceldata.
03:4120/05/2022
87: Why Is Now the Golden Age of Data Analytics? With Cindi Howson of ThoughtSpot
Highlights from this week’s conversation include:Cindi’s career journey (2:36)Major shifts in analytics (6:34)Where we are in formation of the modern analytics cloud (9:07)The process of moving into the cloud (11:01)How to accelerate the digital transformation (17:29)Common patterns amongst company cultures (19:42)Data regulations affecting change (22:34)ThoughtSpot customer base (24:06)The need to know SQL (27:42)Power users leveraging the AI Insights (31:24)Who should audit technology (32:33)The ways that education is happening best (36:28)Stuck in descriptive analytics (40:43)Changes in company structure (43:54)Defining an analytics engineer (46:57)The impact on IT as a function (50:33)Enjoying data analytics (53:06)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.
58:0818/05/2022
The PRQL: Can You Trust AI Enabled Analytics?
Eric and Kostas preview their upcoming conversation with Cindi Howson of ThoughtSpot and Host of The Data Chief Podcast.
03:1713/05/2022
86: Solving the Data Quality Problem with Bigeye, Great Expectations, Metaplane, and Lightup.ai
Highlights from this week’s conversation include:Guest introductions (1:02)Defining data quality (4:08)Forgetting to apply software best practices (8:33)Differentiating observability and data quality (17:53)Who should care about quality in the organization (26:55)Why this is still a valid conversation (35:44)The jurisdiction of various components (45:39)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.
58:4011/05/2022
85: You Can Stop Doing Data Fire Drills with Barr Moses of Monte Carlo
Highlights from this week’s conversation include:Barr’s background and career journey (2:12)Trust: a technical or human problem? (9:47)Behind the name “Monte Carlo” (15:41)Defining data accuracy and reliability (17:36)How much can be done with standardization (22:27)How to avoid frustration when generating data about data (25:49)Defining “resolution” (28:59)Understanding the concept of SLAs (33:25)Building a company for a category that doesn’t exist yet (37:40)What it looks like to use Monte Carlo (44:07)The best part about working with data teams (47:28)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.
51:3904/05/2022
The PRQL: Be Careful, Young Padawan, When Comparing Software Observability and Data Observability
Eric and Kostas preview their upcoming conversation with Barr Moses of Monte Carlo.
04:0402/05/2022
Data Council Week (Ep 5): A Primer on Spatial Data With Gabriel Hidalgo of Carto
Highlights from this week’s conversation include:How Gabriel got into data (1:54)What Carto is (5:28)Location data vs spatial data (6:37)Time data vs space data (7:50)System supports for spatial data (9:50)Explaining “spatial functions” (14:19)Who uses Carto and why (15:52)What’s coming for Carto (19:15)What Gabriel does at Carto (22:22)The coolest things Carto’s done (23:52)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.
28:5929/04/2022
Data Council Week (Ep 4): The Data Council Origin Story With Pete Soderling
Highlights from this week’s conversation include:Pete’s start in data and Data Council (2:01)Learning more from failure (6:42)Shaping terminology and definitions (9:30)What investors look for in data technology (12:43)Working as a data engineer (16:32)Data Council takeaways (18:16)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.
21:3128/04/2022
Data Council Week (Ep 3): Product Analytics the Right Way With James Greenhill of PostHog
Highlights from this week’s conversation include:How James got started in data (2:42)What makes PostHog different (10:43)Why we need product analytics (13:40)Capturing and collecting data (15:17)Dealing with drift on a platform like PostHog (19:45)Starting from the metrics versus events (22:50)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.
29:2727/04/2022
Data Council Week (Ep 2): Testing and Observability Are Two Sides of the Same Coin With Ben Castleton of Great Expectations
Highlights from this week’s conversation include:Ben’s background and career journey (2:13)The birth of Great Expectations (5:02)Defining software engineering (9:38)Adopting open source products (13:04)Working in data versus healthcare (18:01)What's next for Great Expectations (20:29)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.
26:0026/04/2022
Data Council Week (Ep 1): Discussing Firebolt’s Engine With Benjamin HoppDiscussing Firebolt’s Engine With Benjamin Hopp
Highlights from this week’s conversation include:Ben’s career journey (2:55)What makes Firebolt different (3:58)Firebolt’s data product family (7:37)Table engines and Firebolt (10:57)Ben’s favorite part of ClickHouse (12:52)The experience of building an optimizer (15:19)Where Firebolt fits into architecture (17:27)Working in the data space: to love and dislike (19:51)Coming soon in the near future (24:35)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.
28:2325/04/2022
The PRQL: A Data Council Austin Quintuple
Eric and Kostas preview an upcoming mini-series for next week featuring conversations with experts at Data Council Austin.
05:3722/04/2022
The Data Stack Show Live: Solving the Data Quality Problem
Don't miss our livestream event on April 27 as we talk about all things Data Quality with some of the best in the business.
02:5621/04/2022
84: Why Are Analytics Still So Hard? With Kaycee Lai of Promethium
Highlights from this week’s conversation include:Kaycee’s background and career journey (2:34)Why analytics are hard (7:28)Defining “data management” (11:47)Defining “data virtualization” (15:57)The relationship between data virtualization and ETL (18:34)Where a company should invest first (21:40)Building without a Frankenstein stack (25:19)How Promethium solves data stack issues (27:53)Giving context to data (35:14)Cataloging: background, at Promethium, future (39:29)Who uses data catalogs (48:00)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.
56:0020/04/2022
The PRQL: Does Putting All Your Data in One Place Create More Problems Than it Solves?
Eric and Kostas preview their upcoming conversation with Kaycee Lai of Promethium.
03:0915/04/2022
83: Closing the Gap Between Business Analytics and Operational Analytics With Max Beauchemin of Preset
Highlights from this week’s conversation include:Max’s career journey and role today (2:56)Hitting the limits of traditional BI (11:06)The most influential technology (14:34)Merging with BI and visualization (17:35)Two thoughts on real-time (21:02)Defining BI (24:53)How many have actually achieved self-serve BI (29:54)How preset.io fits in the BI architecture of today (32:36)How to use preset.io to expose analytics (35:23)The analytics process to power something like embedded (42:45)Opportunities that exist right now in the BI market (44:53)Commoditization in visualizations across business models (47:58)What it felt like to create data tooling (51:34)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com..
56:5913/04/2022
The PRQL: BI, Real-Time, and Data Tooling
Eric and Kostas preview their upcoming conversation with Max Beauchemin of preset.io.
04:2508/04/2022
82: Databases: The Fun Never Stops with Robert Hodges of Altinity
Highlights from this week’s conversation include:Robert’s background and career journey (2:21)How studying languages influences database work (5:13)Why Robert has been working with databases for 40+ years (7:50)Explaining the ClickHouse database (10:43)How ClickHouse is able to focus on latency (13:39)The use cases behind ClickHouse (19:19)How ClickHouse is different than other databases (25:47)Why old problems are just now getting addressed (29:04)How ClickHouse works with others against another (33:03)When to implement ClickHouse (38:53)The distance between ClickHouse and the end-user (42:24)New database technologies (47:02)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com..
01:01:5806/04/2022
The PRQL: What Inspires Continued Innovation in Databases?
Eric and Kostas preview their upcoming conversation with Robert Hodges of Altinity.
03:3401/04/2022
81: Digging into Data Ops with Prukalpa Sankar of Atlan
Highlights from this week’s conversation include:Prukalpa’s background and career journey (3:16)Applying a data-driven mindset to poverty (7:21)What Atlan does (11:53)The makeup of a realistically functioning data team (15:25)How to create a company’s first data team (18:13)Defining “agile data” (22:01)The necessity of data ops (26:36)The minimum data stack needed (29:16)Data team size (31:58)Where to start when you need to make adjustments (34:51)Collaborate with different parts of the data stack (41:27)Defining the metadata plane (44:29)Lessons from facing crazy data problems (48:31)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.
55:3130/03/2022
The PRQL: Data Team Diversity & Maturing Data Ops
Eric and Kostas preview their upcoming conversation about data ops and diversity with Prupalka of Atlan.
03:1425/03/2022
80: Is Reverse-ETL Just Another Data Pipeline? With Census, Hightouch, & Workato
Highlights from this week’s conversation include:Panel introductions (2:23)What is driving the trend behind Reverse ETL? (5:24)The obstacles to building an internal Reverse ETL tool at scale (15:34)How to decide system management vs. user flexibility (20:14)Why previous products failed in creating this category (29:12)Increased demand and democratization of datastack skills via SaaS (42:03)Broader applications for Reverse ETL (47:29)Limitations of Reverse ETL (55:05)How user technical ability affects design and build roadmaps (58:14)What do you anticipate comes next for Reverse ETL? (1:02:45)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.
01:15:5123/03/2022
The PRQL: Is Reverse ETL New or Old?
Eric and Kostas preview their upcoming panel discussion on reverse ETL and the modern data stack.
03:4118/03/2022
79: All About Experimentation with Che Sharma of Eppo
Highlights from this week’s conversation include:Che’s background and career journey (4:23)Coherence between hemispheres in the human brain (6:58)Raising Airbnb above primitive AB testing technology (8:54)Economic thinking in Airbnb’s data science practice (14:24)Dealing with multiple pipelines (16:48)Eppo’s role in recognizing statistically significant data (20:01)Defining “experiment” (23:25)Types of experiments (25:57)The workflow journey (27:18)Dealing with metric silos (34:21)Why we still need to innovate today (37:03)Where experimentation can be used (39:36)How big a sample size should be (43:29)How to self-educate to get the maximum value (45:39)Bridging the gap between data engineers and data scientists (48:14)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com..
56:2116/03/2022