talk-data.com talk-data.com

Joe Reis

Speaker

Joe Reis

25

talks

Joe Reis is a data professional with 20 years in the data industry, known as a "recovering data scientist" and a business-minded data nerd. His experience spans statistical modeling, forecasting, machine learning, data engineering, and data architecture. He is the co-author of Fundamentals of Data Engineering (O'Reilly, 2022).

Bio from: Small Data SF 2025

Frequent Collaborators

Filtering by: The Joe Reis Show ×

Filter by Event / Source

Talks & appearances

Showing 316 of 332 activities

Search activities →

It's 2025! We made it! ;)

In this podcast, I rant about why data modeling matters more than ever, AI, and why humans will seek out "human" things in 2025 and beyond.

❤️ Your support means a lot. Please like and rate this podcast on your favorite podcast platform.

🤓 My works:

📕Fundamentals of Data Engineering: https://www.oreilly.com/library/view/fundamentals-of-data/9781098108298/

🎥 Deeplearning.ai Data Engineering Certificate: https://www.coursera.org/professional-certificates/data-engineering

🔥Practical Data Modeling: https://practicaldatamodeling.substack.com/

🤓 My SubStack: https://joereis.substack.com/

It's December 31, 2024. Gordon Wong and I wrap up 2024 and chat about what we're excited about in 2025 in data and otherwise.

❤️ If you like my podcasts, please like and rate it on your favorite podcast platform.

🤓 My works:

📕Fundamentals of Data Engineering: https://www.oreilly.com/library/view/fundamentals-of-data/9781098108298/

🎥 Deeplearning.ai Data Engineering Certificate: https://www.coursera.org/professional-certificates/data-engineering

🔥Practical Data Modeling: https://practicaldatamodeling.substack.com/

🤓 My SubStack: https://joereis.substack.com/

Matt Housley and I have a LONG chat about working in consulting, leaving your job, AI, the job market, our thoughts on what's coming in 2025, and much more.

❤️ If you like my podcasts, please like and rate it on your favorite podcast platform.

🤓 My works:

📕Fundamentals of Data Engineering: https://www.oreilly.com/library/view/fundamentals-of-data/9781098108298/

🎥 Deeplearning.ai Data Engineering Certificate: https://www.coursera.org/professional-certificates/data-engineering

🔥Practical Data Modeling: https://practicaldatamodeling.substack.com/

🤓 My SubStack: https://joereis.substack.com/

The first time I met Amr Awadallah, he struck me as a rare person genuinely curious about the world and how technology and AI impact it.

We discuss his early roots as an entrepreneur, the founding of Cloudera and Vectara, the challenges of AI in enterprises, what makes humans unique, and much more.

I had an interesting conversation yesterday with a young gentleman upgrading my Google Fiber. While he was originally pursuing a career as a software developer, he and his friends decided against it after seeing the progress of ChatGPT over the last couple of years.

As a father of two teenage boys, I often think about the nature of work, including whether writing code will be relevant for future generations. Here, I rant at least part (not all) of what's on my mind. This is a big topic, and you'll see me ranting more about it.

This morning, a great article came across my feed that gave me PTSD, asking if Iceberg is the Hadoop of the Modern Data Stack?

In this rant, I bring the discussion back to a central question you should ask with any hot technology - do you need it at all? Do you need a tool built for the top 1% of companies at a sufficient data scale? Or is a spreadsheet good enough?

Link: https://blog.det.life/apache-iceberg-the-hadoop-of-the-modern-data-stack-c83f63a4ebb9

❤️ If you like my podcasts, please like and rate it on your favorite podcast platform.

🤓 My works:

📕Fundamentals of Data Engineering: https://www.oreilly.com/library/view/fundamentals-of-data/9781098108298/

🎥 Deeplearning.ai Data Engineering Certificate: https://www.coursera.org/professional-certificates/data-engineering

🔥Practical Data Modeling: https://practicaldatamodeling.substack.com/

🤓 My SubStack: https://joereis.substack.com/

I’ve been asked many questions about building a personal brand for the last few weeks. Perhaps it’s the uncertain job market, people wanting to branch out, or something else. I’m unsure what’s in the air right now.

In this episode, I share some thoughts on building a personal brand.

Hannes Muhleisen is the creator of DuckDB and CEO of DuckDB Labs. We finally got a chance to meet in person at the Forward Data Conference in Paris. We hit it off immediately, and at times, I felt like I was talking with my long lost brother. Hannes is a very cool guy!

While at the conference, we recorded a chat about all things DuckDB, the challenges of data lakehouses and open table formats, local-first tech, and much more. 🦆 🐥

Dave and Johnny run Estuary, a data integration company focused on real-time ETL and ELT. We're also friends, so we decided to have a chat.

In this episode, we chat about the current state of the data integration space, running a startup while raising kids, and much more.

Estuary

Bill Inmon is considered the father of the data warehouse. I just got back from spending a couple of days with Bill, and we discussed the history of the data industry and the data warehouse. On my flight back, I realized people could benefit from a short version of our conversation.

In this short chat, we discuss what a data warehouse is (and is not), Kimball and Inmon, the origins of the data warehouse, and much more.