talk-data.com talk-data.com

Event

Data Council 2023

2026-01-10 YouTube Visit website ↗

Activities tracked

76

Filtering by: Analytics ×

Sessions & talks

Showing 26–50 of 76 · Newest first

Search within this event →
Designing for Intelligence at GitHub Next: Patterns & Practices for Making AI powered Products

Designing for Intelligence at GitHub Next: Patterns & Practices for Making AI powered Products

2023-05-11 Watch
video
Idan Gazit (GitHub Next)

ABOUT THE TALK: What does it take to design successful products around AI capabilities? Achieving acceptable reliability is often not about model improvements and fine-tuning. A holistic approach to building AI-powered products requires thinking about how we elicit context from users, how we prompt the models, how we decide to measure the goodness of results, and how the interaction models we use weave intelligence into experiences.

GitHub Next is the birthplace of products like GitHub Copilot and is currently exploring the frontiers of AI assistance for the entire software development lifecycle. In this talk, Idan Gazit shares some practical learnings from experience building and shipping these prototypes and successful interactions with the broader business.

ABOUT THE SPEAKER: Idan Gazit is a Senior Director of Research at GitHub Next, leading the Developer Experiences team. He is a hybrid designer-developer, and can usually be found geeking out about the Web, data visualization, typography, and color. He lives in the East Bay with his family and surrounds himself with a rotating cast of half-finished projects.

ABOUT DATA COUNCIL: Data Council (https://www.datacouncil.ai/) is a community and conference series that provides data professionals with the learning and networking opportunities they need to grow their careers.

Make sure to subscribe to our channel for the most up-to-date talks from technical professionals on data related topics including data infrastructure, data engineering, ML systems, analytics and AI from top startups and tech companies.

FOLLOW DATA COUNCIL: Twitter: https://twitter.com/DataCouncilAI LinkedIn: https://www.linkedin.com/company/datacouncil-ai/

Building a Business Review Program from Scratch | GlossGenius

Building a Business Review Program from Scratch | GlossGenius

2023-05-11 Watch
video
Katie Bauer (GlossGenius)

ABOUT THE TALK: Regular, metrics-focused business review meetings are a common practice in all types of companies, and getting an effective one up and running is a common task for new data leaders. There are many blog posts describing what an ideal end state looks like, but few discussions of building up to that state from an MVP version. In this talk, Katie Bauer shares about how her team built their executive metrics review process at GlossGenius, where it is today, and where they would like to take it next.

ABOUT THE SPEAKER: Katie Bauer is a data leader who focuses on delivering actionable analysis, scalable data infrastructure, and building strong stakeholder relationships. She is currently Head of Data at GlossGenius and has previously led and built data teams at Reddit and Twitter.

ABOUT DATA COUNCIL: Data Council (https://www.datacouncil.ai/) is a community and conference series that provides data professionals with the learning and networking opportunities they need to grow their careers.

Make sure to subscribe to our channel for the most up-to-date talks from technical professionals on data related topics including data infrastructure, data engineering, ML systems, analytics and AI from top startups and tech companies.

FOLLOW DATA COUNCIL: Twitter: https://twitter.com/DataCouncilAI LinkedIn: https://www.linkedin.com/company/datacouncil-ai/

The Power of Cross Community Collaboration

The Power of Cross Community Collaboration

2023-05-11 Watch
video
Kyle Eaton (Great Expectations) , Maggie Hays (DataHub)

ABOUT THE TALK: Do you want to expand your reach, gain credibility, and enhance the value of an open-source product & community? Whether you’re building a community around your own product or are an individual contributor looking for ways to contribute to your favorite OSS project, it’s time to focus on cross-community partnerships!

In this talk, Kyle Eaton and Maggie Hays detail the core tenants of an effective partnership, including effective co-product development & marketing, leveraging expertise within your community, measuring success, and pitfalls to avoid. Learn what makes for an effective partnership and a roadmap for getting started.

ABOUT THE SPEAKERS: Kyle Eaton is the Growth Lead at Great Expectations. While at Great Expectations, Kyle scaled the Great Expectations community from 0 to 9000+ Slack users, pioneered the Developer Relations team, and led the initiative for data ecosystem partnerships. Prior to Great Expectations, Kyle has played the role of Lead UX Designer in health care, finance, and travel for just nearly a decade.

Maggie Hays is the Community Product Manager for DataHub and part of the Founding Team at Acryl Data. She is passionate about building resources that allow data to be accessible, intuitive, and impactful for a wide spectrum of end-users so organizations can fully realize the power of data-backed decisions.

ABOUT DATA COUNCIL: Data Council (https://www.datacouncil.ai/) is a community and conference series that provides data professionals with the learning and networking opportunities they need to grow their careers.

Make sure to subscribe to our channel for the most up-to-date talks from technical professionals on data related topics including data infrastructure, data engineering, ML systems, analytics and AI from top startups and tech companies.

FOLLOW DATA COUNCIL: Twitter: https://twitter.com/DataCouncilAI LinkedIn: https://www.linkedin.com/company/datacouncil-ai/

How Investors Think About Data [Founders Fund, Sequoia, Bain, Zero Prime Ventures]

How Investors Think About Data [Founders Fund, Sequoia, Bain, Zero Prime Ventures]

2023-05-11 Watch
video
Lauren Reeder (Sequoia) , Slater Stich (Bain Capital Ventures) , Leigh Marie Braswell (Founders Fund)

ABOUT THE TALK: Pete Soderling welcomes several investors from top firms Founders Fund, Sequoia and Bain Capital Ventures to discuss their thoughts on investing in both data infrastructure and AI. They discuss the latest trends in investing in data/ML/AI companies, what they look for in early stage engineering-led teams and what they're most excited about for the future.

ABOUT THE SPEAKERS Moderator: Pete Soderling is the founder of Data Council and Zero Prime Ventures (formerly Data Community Fund).

Leigh Marie Braswell is a principal at Founders Fund focused on data & ML infrastructure and applications. Before joining Founders Fund, she was an early engineer & the first product manager at Scale AI, where she originally built and later led product development for the LiDAR/3D annotation products, used by many autonomous vehicles, robots, and AR/VR companies as a core step in their machine learning lifecycles.

Lauren Reeder is a Partner at Sequoia, where she focuses on data, infrastructure, and climate tech. She works closely with several Sequoia companies, including Statsig, Census, and Deno. Previously, Lauren served as Director of Product at Segment. She started her career in software engineering and did some early-stage investing on the side for a number of years before joining Sequoia.

Slater Stich is a Partner at Bain Capital Ventures. He was previously the founder of Activation Fund and Principal at Valor Equity Partners. At Bain Capital, Slater partners with early-stage founders in infrastructure, with a focus on tools for data teams.

ABOUT DATA COUNCIL: Data Council (https://www.datacouncil.ai/) is a community and conference series that provides data professionals with the learning and networking opportunities they need to grow their careers.

Make sure to subscribe to our channel for the most up-to-date talks from technical professionals on data related topics including data infrastructure, data engineering, ML systems, analytics and AI from top startups and tech companies.

FOLLOW DATA COUNCIL: Twitter: https://twitter.com/DataCouncilAI LinkedIn: https://www.linkedin.com/company/datacouncil-ai/

Conversation Simulator: A Real Life Case Leveraging OpenAI's API | Crisis Text Line

Conversation Simulator: A Real Life Case Leveraging OpenAI's API | Crisis Text Line

2023-05-11 Watch
video
Maddie Schults (Crisis Text Line) , Mateo Garcia (Crisis Text Line)

ABOUT THE TALK: While we will never replace human to human interaction for crisis intervention, there are plenty of opportunities to build intelligence with AI/ML models that crisis responders could greatly benefit from.

In this talk Maddie Schults and Mateo Garcia introduce their conversation simulator, a tool that we built leveraging openAI's API that allows them to train crisis responders on how to support people in crisis with close to real life situations and can help reduce anxiety for new crisis responders as they log on the platform for the first time.

ABOUT THE SPEAKERS: Maddie Schults is the General Manager at Crisis Text Line. She is a product leader and technologist with over 20 years of experience envisioning, building and launching enterprise software products. At Crisis Text Line, Maddie is responsible for building the Global Product for crisis care intervention and its adoption globally in different countries and languages.

Mateo Garcia is Lead Data Scientist at Crisis Text Line, where he oversees all the Analytics & Data Science efforts. He is a data leader with +7 industry experience scaling data teams from the ground up and building data products at different start-ups and consulting firms.

ABOUT DATA COUNCIL: Data Council (https://www.datacouncil.ai/) is a community and conference series that provides data professionals with the learning and networking opportunities they need to grow their careers.

Make sure to subscribe to our channel for the most up-to-date talks from technical professionals on data related topics including data infrastructure, data engineering, ML systems, analytics and AI from top startups and tech companies.

FOLLOW DATA COUNCIL: Twitter: https://twitter.com/DataCouncilAI LinkedIn: https://www.linkedin.com/company/datacouncil-ai/

How Dashboards as Code Can Help You Develop and Validate Your Analytics |  Glean

How Dashboards as Code Can Help You Develop and Validate Your Analytics | Glean

2023-05-11 Watch
video
Dan Eisenberg (Glean.io)

ABOUT THE TALK: Dashboards sit at the end of a long chain of ever-changing data dependencies. And, it is a very visual process – it is hard to tell if a dashboard is correct without an end user looking at the rendered result. This all adds up to a development process that can be slow and error-prone.

“DataOps” is a new set of code-based patterns and practices that aim to address these challenges. In this talk, Dan Eisenberg does a deep dive on these approaches and demonstrate some ways to integrate DataOps into the BI development lifecycle at Glean.

ABOUT THE SPEAKER: Dan Eisenberg is the VP of Technology at Glean.io, a platform for data visualization and collaboration. Prior to Glean, he was a Senior Director of Engineering at Flatiron Health, where his teams designed and built systems for abstracting data from unstructured medical records at scale.

ABOUT DATA COUNCIL: Data Council (https://www.datacouncil.ai/) is a community and conference series that provides data professionals with the learning and networking opportunities they need to grow their careers.

Make sure to subscribe to our channel for the most up-to-date talks from technical professionals on data related topics including data infrastructure, data engineering, ML systems, analytics and AI from top startups and tech companies.

FOLLOW DATA COUNCIL: Twitter: https://twitter.com/DataCouncilAI LinkedIn: https://www.linkedin.com/company/datacouncil-ai/

Creating the Right Developer Community for Your Company | AWS

Creating the Right Developer Community for Your Company | AWS

2023-05-11 Watch
video

ABOUT THE TALK: Wesley Faulkner explores the various types of communities and discusses how to determine the most suitable one for your company at various stages of growth. Whether you are looking to double down on your current community or expand to new platforms, Wesley provides the guidance you'll need to make informed decisions about building a strong and effective community.

ABOUT THE SPEAKER: Wesley Faulkner is a first-generation American, public speaker, and podcaster. He is a founding member of the government transparency group Open Austin and a staunch supporter of racial justice, workplace equity, and neurodiversity. His professional experience spans technology from AMD, Atlassian, Dell, IBM, and MongoDB.

ABOUT DATA COUNCIL: Data Council (https://www.datacouncil.ai/) is a community and conference series that provides data professionals with the learning and networking opportunities they need to grow their careers.

Make sure to subscribe to our channel for the most up-to-date talks from technical professionals on data related topics including data infrastructure, data engineering, ML systems, analytics and AI from top startups and tech companies.

FOLLOW DATA COUNCIL: Twitter: https://twitter.com/DataCouncilAI LinkedIn: https://www.linkedin.com/company/datacouncil-ai/

Generative AI for Search | Tonita

Generative AI for Search | Tonita

2023-05-11 Watch
video
D. Sivakumar (Tonita.co)

ABOUT THE TALK: D. Sivakumar discusses the evolving -- and immensely powerful -- role that generative AI methods, especially in NLP and Vision, play in Search, broadly construed. Through a number of anecdotes and organizing principles, he highlights a handful of key challenges and promising directions.

ABOUT THE SPEAKER: D. Sivakumar (Siva) is co-founder and CEO of Tonita.co, whose mission is to bring fluent natural-language search to every search box on the Web. Prior to founding Tonita in 2021, he worked in the research organizations at Google, Yahoo!, and IBM. His research has spanned algorithms and complexity, web search, and deep learning.

ABOUT DATA COUNCIL: Data Council (https://www.datacouncil.ai/) is a community and conference series that provides data professionals with the learning and networking opportunities they need to grow their careers.

Make sure to subscribe to our channel for the most up-to-date talks from technical professionals on data related topics including data infrastructure, data engineering, ML systems, analytics and AI from top startups and tech companies.

FOLLOW DATA COUNCIL: Twitter: https://twitter.com/DataCouncilAI LinkedIn: https://www.linkedin.com/company/datacouncil-ai/

Designing & Building Metric Trees

Designing & Building Metric Trees

2023-05-11 Watch
video
Abhi Sivasailam (Flexport)

Metrics are the most important primitive in the data world and driving the use of powerful and reliable metrics is the best way data teams can add value to their enterprises. In this talk, we'll walk through how data teams can best support the metric lifecycle, end-to-end from:

  1. Designing useful metrics as part of metric trees
  2. Developing these metrics off stable and standard data contracts
  3. Operationalizing metrics to drive value

ABOUT THE SPEAKER: Abhi Sivasailam is a Growth and Analytics leader who most recently led Product-Led Growth, Product Analytics, and Analytics Engineering at Flexport, where he helped to lead these and other functions through 10x growth over the past 3 years. Previously, Abhi led growth and data teams at Keap, Hustle, and Honeybook.

👉 Sign up for our “No BS” Newsletter to get the latest technical data & AI content: https://datacouncil.ai/newsletter

ABOUT DATA COUNCIL: Data Council (https://www.datacouncil.ai/) is a community and conference series that provides data professionals with the learning and networking opportunities they need to grow their careers.

Make sure to subscribe to our channel for the most up-to-date talks from technical professionals on data related topics including data infrastructure, data engineering, ML systems, analytics and AI from top startups and tech companies.

FOLLOW DATA COUNCIL: Twitter: https://twitter.com/DataCouncilAI LinkedIn: https://www.linkedin.com/company/datacouncil-ai/

What it Takes to Support the World's Most Popular Open Source Communities | NumFOCUS

What it Takes to Support the World's Most Popular Open Source Communities | NumFOCUS

2023-05-11 Watch
video
Dr. Katrina Riehl (NumFOCUS; Snowflake; Georgetown University)

ABOUT THE TALK: This talk walks you through the structure of NumFOCUS, the programs, challenges, and vision for a sustainable, inclusive, and vibrant open source community. This talk will deep dive on sustainability endeavors, including diversity and inclusion, and how you can get involved in the NumFOCUS community.

ABOUT THE SPEAKER: Dr. Katrina Riehl is President of the Board of Directors at NumFOCUS, Head of the Streamlit Data Team at Snowflake, and Adjunct Lecturer at Georgetown University. For almost two decades, Katrina has worked extensively in the fields of scientific computing, machine learning, data mining, and visualization. Most notably, she has helped lead data science efforts at the University of Texas Austin Applied Research Laboratory, Apple, HomeAway (now, Vrbo), and Cloudflare.

ABOUT DATA COUNCIL: Data Council (https://www.datacouncil.ai/) is a community and conference series that provides data professionals with the learning and networking opportunities they need to grow their careers.

Make sure to subscribe to our channel for the most up-to-date talks from technical professionals on data related topics including data infrastructure, data engineering, ML systems, analytics and AI from top startups and tech companies.

FOLLOW DATA COUNCIL: Twitter: https://twitter.com/DataCouncilAI LinkedIn: https://www.linkedin.com/company/datacouncil-ai/

Generative AI & the Natural Language Interface for Data |  Seek AI

Generative AI & the Natural Language Interface for Data | Seek AI

2023-05-11 Watch
video
Sarah Nagy (Seek AI)

ABOUT THE TALK: With the advancement of AI, the natural language interface for data is more valuable than ever before. This talk explores three key questions. First, what would a natural language interface for data actually look like? Second, what kind of value would it add to organizations using the Modern Data Stack? Third, what will the challenges look like when it comes to working with a natural language interface for data? Sarah Nagy will share real-world learnings from Seek's customers for each of these questions.

ABOUT THE SPEAKER: A former quant, Sarah Nagy founded Seek AI in 2021. Prior to starting Seek, Sarah most recently led the consumer data team at Citadel's Ashler Capital. Prior to joining Citadel, Sarah led the quant arms at two startups, Edison and Predata, which both successfully exited. Sarah started her career as a quant at ITG developing algorithmic trading strategies. Sarah has a Master in Finance degree from Princeton and dual Bachelor's degrees in Astrophysics and Business Economics from UCLA.

ABOUT DATA COUNCIL: Data Council (https://www.datacouncil.ai/) is a community and conference series that provides data professionals with the learning and networking opportunities they need to grow their careers.

Make sure to subscribe to our channel for the most up-to-date talks from technical professionals on data related topics including data infrastructure, data engineering, ML systems, analytics and AI from top startups and tech companies.

FOLLOW DATA COUNCIL: Twitter: https://twitter.com/DataCouncilAI LinkedIn: https://www.linkedin.com/company/datacouncil-ai/

How Vercel Builds Dozens of Metrics from One Heterogenous Table

How Vercel Builds Dozens of Metrics from One Heterogenous Table

2023-05-11 Watch
video

ABOUT THE TALK: This talk discusses how Vercel leverages dozens of metrics created from one heterogenous table to drive business, technical, product, and operations decisions across the company. Vercel's approach has empowered technical and non-technical stakeholders to jump into their analytical discovery from the metrics table with more frequent iterations and less involvement from the data team.

Centralizing data and metadata used in creating Vercel's many metrics has increased the number of stakeholders that can participate in analytics, decreased the time needed to troubleshoot outlier events, and removed the data team as a dependency for all data-related tasks.

ABOUT THE SPEAKER: Thomas Mickley-Doyle leads analytics and data science initiatives at Vercel, scaling insights across engineering, product, and design. He focuses on making data modeling, analytics, and decision-making more accessible for all users.

ABOUT DATA COUNCIL: Data Council (https://www.datacouncil.ai/) is a community and conference series that provides data professionals with the learning and networking opportunities they need to grow their careers.

Make sure to subscribe to our channel for the most up-to-date talks from technical professionals on data related topics including data infrastructure, data engineering, ML systems, analytics and AI from top startups and tech companies.

FOLLOW DATA COUNCIL: Twitter: https://twitter.com/DataCouncilAI LinkedIn: https://www.linkedin.com/company/datacouncil-ai/

From 1 to IPO: Growing the Data Team and Data Culture at GitLab

From 1 to IPO: Growing the Data Team and Data Culture at GitLab

2023-05-11 Watch
video
Taylor Murphy (Meltano)

ABOUT THE TALK: When Taylor Murphy joined GitLab, they had just raised their Series C, had about 200 people, and he was the only person "doing data." Over the next 3 years, the company would 6x its total headcount and be on target to IPO, which it did in 2021, all while the demand for data and insights grew exponentially. This talk will detail that growth journey with a particular focus on how they built the data culture across the organization. Taylor will share what went well and what he would repeat, and he'll be honest about what he would do differently if he could go back in time and do it all again.

ABOUT THE SPEAKER: Taylor Murphy is the Head of Product and Data at Meltano, an open source data platform that enables collaboration, efficiency, and visibility. Taylor has been deeply involved in leading and building data-informed teams his entire career.

At Concert Genetics he scaled the Data Operations team to enable the management of hundreds of thousands of genetic tests and millions of claims records.

At GitLab, he was the first data hire where he focused on building and scaling the data organization as the company headed towards its IPO.

ABOUT DATA COUNCIL: Data Council (https://www.datacouncil.ai/) is a community and conference series that provides data professionals with the learning and networking opportunities they need to grow their careers.

Make sure to subscribe to our channel for the most up-to-date talks from technical professionals on data related topics including data infrastructure, data engineering, ML systems, analytics and AI from top startups and tech companies.

FOLLOW DATA COUNCIL: Twitter: https://twitter.com/DataCouncilAI LinkedIn: https://www.linkedin.com/company/datacouncil-ai/

How to Build a Streaming Database in Three Challenging Steps | Materialize

How to Build a Streaming Database in Three Challenging Steps | Materialize

2023-05-11 Watch
video
Frank McSherry (Materialize)

ABOUT THE TALK: A streaming database is a potentially intimidating product to build. Frank McSherry, Chief Scientist at Materialize, breaks down the manageable parts, through three foundational choices that fit together well. Frank also talks about the trade-offs, and how their simplifications lead to a much more manageable streaming database.

ABOUT THE SPEAKER: Frank McSherry is Chief Scientist at Materialize, where he (and others) convert SQL into scale-out, streaming, and interactive dataflows. Before this, he developed the timely and differential dataflow Rust libraries (with colleagues at ETHZ), and led the Naiad research project and co-invented differential privacy while at MSR Silicon Valley. He has a PhD in computer science from the University of Washington.

ABOUT DATA COUNCIL: Data Council (https://www.datacouncil.ai/) is a community and conference series that provides data professionals with the learning and networking opportunities they need to grow their careers.

Make sure to subscribe to our channel for the most up-to-date talks from technical professionals on data related topics including data infrastructure, data engineering, ML systems, analytics and AI from top startups and tech companies.

FOLLOW DATA COUNCIL: Twitter: https://twitter.com/DataCouncilAI LinkedIn: https://www.linkedin.com/company/datacouncil-ai/

Hot Takes and Tragic Mistakes: How (not) to Integrate Data People in Your App Dev Team Workflows

Hot Takes and Tragic Mistakes: How (not) to Integrate Data People in Your App Dev Team Workflows

2023-05-11 Watch
video
Noelle Saldana (Crisis Text Line)

ABOUT THE TALK: Everyone wants to create new products with AI/ML inside, but you need to integrate your data scientists and data engineers into traditional development teams to do that. But what exactly do they do, and where in the process do they fit? Does their work entirely fall under software engineering, product, or something else? Are you even ready for AI/ML? Has anyone figured this out?

Data-scientist-turned-product person Noelle Saldana shares her observations and opinions on how companies should (and shouldn't) use their data people and her hot takes and tragic mistakes to do it the right way the first time.

ABOUT THE SPEAKER: Noelle Saldana has fifteen years of Data Science experience and is passionate about the value data brings to both products and decision-making. She has led Data Science initiatives at companies across multiple industry verticals, ranging from early startups to Fortune 500 enterprises. Her recent focus has been the intersection of product and data strategy; instrumenting data and eliminating data technical debt to enable robust Data Science and Product Analytics downstream.

ABOUT DATA COUNCIL: Data Council (https://www.datacouncil.ai/) is a community and conference series that provides data professionals with the learning and networking opportunities they need to grow their careers.

Make sure to subscribe to our channel for the most up-to-date talks from technical professionals on data related topics including data infrastructure, data engineering, ML systems, analytics and AI from top startups and tech companies.

FOLLOW DATA COUNCIL: Twitter: https://twitter.com/DataCouncilAI LinkedIn: https://www.linkedin.com/company/datacouncil-ai/

Ten years of building open source standards: From Parquet to Arrow to OpenLineage | Astronomer

Ten years of building open source standards: From Parquet to Arrow to OpenLineage | Astronomer

2023-05-11 Watch
video
Julien Le Dem (Astronomer)

ABOUT THE TALK: Over the last decade I have been lucky enough to contribute a few successful open source projects to the data ecosystem. In this talk

Julien Le Dem shares the story of his contribution to successful open source projects to the data ecosystem and what made their success possible. From the ideation process and early growth of the Apache Parquet columnar format and how this led to the creation of its in-memory alter-ego Apache Arrow. Julian will end with showing how this experience enabled the success of OpenLineage, an LFAI & Data project that brings observability to the data ecosystem.

ABOUT THE SPEAKER: Julien Le Dem is the Chief Architect of Astronomer and Co-Founder of Datakin. He co-created Apache Parquet and is involved in several open source projects including OpenLineage, Marquez (LFAI&Data), Apache Arrow, Apache Iceberg and a few others. Previously, he was a senior principal at Wework; principal architect at Dremio; and tech lead for Twitter’s data processing tools and principal engineer working on content platforms at Yahoo, where he received his Hadoop initiation.

ABOUT DATA COUNCIL: Data Council (https://www.datacouncil.ai/) is a community and conference series that provides data professionals with the learning and networking opportunities they need to grow their careers.

Make sure to subscribe to our channel for the most up-to-date talks from technical professionals on data related topics including data infrastructure, data engineering, ML systems, analytics and AI from top startups and tech companies.

FOLLOW DATA COUNCIL: Twitter: https://twitter.com/DataCouncilAI LinkedIn: https://www.linkedin.com/company/datacouncil-ai/

Change Data Streaming Patterns With Debezium & Apache Flink | Decodable

Change Data Streaming Patterns With Debezium & Apache Flink | Decodable

2023-05-11 Watch
video
Gunnar Morling (Decodable)

ABOUT THE TALK: Microservices are one of the big trends in software engineering of the last few years.

In this session we'll discuss and showcase how open-source change data capture (CDC) with Debezium can help developers with typical challenges they often face when working on microservices.

Learn how to:

  • Employ the outbox pattern for reliable, eventually consistent data exchange between microservices, without incurring unsafe dual writes or tight coupling
  • Gradually extract microservices from existing monolithic applications, using CDC, the strangler fig pattern and Apache Flink
  • Coordinate long-running business transactions across multiple services using CDC-based saga orchestration, ensuring such activity gets consistently applied or aborted by all participating services.

ABOUT THE SPEAKER: Gunnar Morling is a software engineer and open-source enthusiast by heart, currently working at Decodable on stream processing based on Apache Flink. In his prior role as a software engineer at Red Hat, he led the Debezium project, a distributed platform for change data capture. He is a Java Champion and has founded multiple open source projects such as JfrUnit, kcctl, and MapStruct. Gunnar is an avid blogger (morling.dev) and has spoken at a wide range of conferences like QCon, Java One, and Devoxx. He lives in Hamburg, Germany.

ABOUT DATA COUNCIL: Data Council (https://www.datacouncil.ai/) is a community and conference series that provides data professionals with the learning and networking opportunities they need to grow their careers.

Make sure to subscribe to our channel for the most up-to-date talks from technical professionals on data related topics including data infrastructure, data engineering, ML systems, analytics and AI from top startups and tech companies.

FOLLOW DATA COUNCIL: Twitter: https://twitter.com/DataCouncilAI LinkedIn: https://www.linkedin.com/company/datacouncil-ai/

How Riot Games Uses Data to Maximize Engagement & Enjoyment

How Riot Games Uses Data to Maximize Engagement & Enjoyment

2023-05-11 Watch
video
Ian (Riot Games)

ABOUT THE TALK: League of legends faces lots of interesting problems in the data space that are unique due to the video game aspect. How do you deploy and train models in a binary video game? How has the data and ML stack changed since the league's inception in 2009? How do you do player-facing ML (Lane detection, feeding detection, etc.) and decision science at this scale?

ABOUT THE SPEAKER: Ian is a senior software engineer at Riot Games, working on the League Data Central team. Along with his team, Ian ships Machine Learning and Data products to millions of league of legends and tft players including in game recommendations, player behaviour models, and internal decision science to help make the game a better place for all.

ABOUT DATA COUNCIL: Data Council (https://www.datacouncil.ai/) is a community and conference series that provides data professionals with the learning and networking opportunities they need to grow their careers.

Make sure to subscribe to our channel for the most up-to-date talks from technical professionals on data related topics including data infrastructure, data engineering, ML systems, analytics and AI from top startups and tech companies.

FOLLOW DATA COUNCIL: Twitter: https://twitter.com/DataCouncilAI LinkedIn: https://www.linkedin.com/company/datacouncil-ai/

Creating our Own Kubernetes & Docker to Run Our Data Infrastructure | Modal

Creating our Own Kubernetes & Docker to Run Our Data Infrastructure | Modal

2023-05-11 Watch
video

ABOUT THE TALK: In this talk, Erik Bernhardsson will share how Modal starts 1000s of large containers in seconds, and what they had to do under the surface to build this. This includes a custom file system written in Rust, their own container runtime, and their own container image builder. This talk will give you an idea of how containers work along with some of the low-level Linux details underneath. We'll also talk about many infrastructure tools hold data teams back, and why they deserve faster and better tools.

ABOUT THE SPEAKER: Erik Bernhardsson is the founder and CEO of Modal, which is an infrastructure provider for data teams. Before Modal, Erik was the CTO at Better for six years, and previously spent seven years at Spotify, building the music recommendation system and running data teams.

ABOUT DATA COUNCIL: Data Council (https://www.datacouncil.ai/) is a community and conference series that provides data professionals with the learning and networking opportunities they need to grow their careers.

Make sure to subscribe to our channel for the most up-to-date talks from technical professionals on data related topics including data infrastructure, data engineering, ML systems, analytics and AI from top startups and tech companies.

FOLLOW DATA COUNCIL: Twitter: https://twitter.com/DataCouncilAI LinkedIn: https://www.linkedin.com/company/datacouncil-ai/

CDC Stream Processing with Apache Flink

CDC Stream Processing with Apache Flink

2023-05-11 Watch
video
Timo Walther (Data Artisans, Ververica, Immerok)

ABOUT THE TALK: In this talk, we highlight what it means for Apache Flink to be a general data processor that acts as a data integration hub. Looking under the hood, we demonstrate Flink's SQL engine as a changelog processor that ships with an ecosystem tailored to processing CDC data and maintaining materialized views. We will discuss the semantics of different data sources and how to perform joins or stream enrichment between them. This talk illustrates how Flink can be used with systems such as Kafka (for upsert logging), Debezium, JDBC, and others.

ABOUT THE SPEAKER: Timo Walther is a long-term member of the management committee and among the top committers in the Apache Flink project. Timo worked as a software engineer at Data Artisans and lead of the SQL team at Ververica. He was a Co-Founder of Immerok which was acquired by Confluent in 2023. In Flink, he is working on various topics in the Table & SQL ecosystem to make stream processing accessible for everyone.

ABOUT DATA COUNCIL: Data Council (https://www.datacouncil.ai/) is a community and conference series that provides data professionals with the learning and networking opportunities they need to grow their careers.

Make sure to subscribe to our channel for the most up-to-date talks from technical professionals on data related topics including data infrastructure, data engineering, ML systems, analytics and AI from top startups and tech companies.

FOLLOW DATA COUNCIL: Twitter: https://twitter.com/DataCouncilAI LinkedIn: https://www.linkedin.com/company/datacouncil-ai/

How ML, NLP, & 5 mins of Playtime Help Parents, Caregivers, & Children Enjoy Life Together

How ML, NLP, & 5 mins of Playtime Help Parents, Caregivers, & Children Enjoy Life Together

2023-05-11 Watch
video
Mady Mantha (Happypillar)

ABOUT THE TALK: One in five kids has a mental or behavioral disorder, but only 15% have access to care, and the current supply of trained therapists barely covers that demand. Happypillar is a digital therapeutic app that provides evidence-proven behavioral intervention to all at scale. Learn how we combine ML, ASR, NLP, and other technologies with the expertise of our founding clinical play therapist to offer accurate and real-time personalized feedback, all with compliant security processes and the strictest privacy controls.

ABOUT THE SPEAKER: Mady Mantha is a Product and ML Engineering Leader and the Co-Founder & CTO at Happypillar. As a Director of Conversational AI at Sirius, Mady led the team that built Walmart’s conversational AI.

ABOUT DATA COUNCIL: Data Council (https://www.datacouncil.ai/) is a community and conference series that provides data professionals with the learning and networking opportunities they need to grow their careers.

Make sure to subscribe to our channel for the most up-to-date talks from technical professionals on data related topics including data infrastructure, data engineering, ML systems, analytics and AI from top startups and tech companies.

FOLLOW DATA COUNCIL: Twitter: https://twitter.com/DataCouncilAI LinkedIn: https://www.linkedin.com/company/datacouncil-ai/

A Deep Dive into the dbt Manifest | Squarespace

A Deep Dive into the dbt Manifest | Squarespace

2023-05-11 Watch
video
Aaron Richter (Squarespace)

ABOUT THE TALK: Ever noticed the manifest.json file that dbt puts into your target folder? This little file contains rich information about your dbt project that enables numerous fun use cases! These include complex deployment configurations, quality enforcement, and streamlined development workflows. This talk will go over what the manifest is and how it is produced, along with case studies of how the manifest is used across the community and in Squarespace’s data pipelines.

ABOUT THE SPEAKER: Aaron Richter is a software developer with a passion for all things data. His work involves making sure data is clean and accessible, and that the tools to access it are at peak performance. Aaron is currently a data engineer at Squarespace, where he supports the company’s analytics platform. Previously, he built the data warehouse at Modernizing Medicine, and worked as a data science advocate at Saturn Cloud.

ABOUT DATA COUNCIL: Data Council (https://www.datacouncil.ai/) is a community and conference series that provides data professionals with the learning and networking opportunities they need to grow their careers.

Make sure to subscribe to our channel for the most up-to-date talks from technical professionals on data related topics including data infrastructure, data engineering, ML systems, analytics and AI from top startups and tech companies.

FOLLOW DATA COUNCIL: Twitter: https://twitter.com/DataCouncilAI LinkedIn: https://www.linkedin.com/company/datacouncil-ai/

Data Warehouses are Gilded Cages  What Comes Next | Motherduck

Data Warehouses are Gilded Cages What Comes Next | Motherduck

2023-05-11 Watch
video
Nicholas Ursa (Motherduck)

ABOUT THE TALK: If you squint, the data warehouse-centric analytics stack can look a lot like mainframe-era centralized control: shared distributed compute, some sandbox space if you are lucky, and no way to work locally that's equivalent. And it's been getting expensive! Yet the major vendors appear to be converging on features. Is this "The End of History" for OLAP? Your laptop is incredibly powerful and tragically underutilized. It has better I/O than servers from just a few years back, and with a solid analytic engine like DuckDB, it can handle surprising workloads.

In this talk Nicholas Ursa will share what makes DuckDB fast and show how far single node scale-up can take you. He will also show some ideas Motherduck is testing that blur the line between local and remote systems and shift some of that centralized control back to users.

ABOUT THIS SPEAKER: Nicholas Ursa is an Engineer at Motherduck. He led data engineering teams at The New York Times and better.com after cutting his teeth in ad tech.

ABOUT DATA COUNCIL: Data Council (https://www.datacouncil.ai/) is a community and conference series that provides data professionals with the learning and networking opportunities they need to grow their careers.

Make sure to subscribe to our channel for the most up-to-date talks from technical professionals on data related topics including data infrastructure, data engineering, ML systems, analytics and AI from top startups and tech companies.

FOLLOW DATA COUNCIL: Twitter: https://twitter.com/DataCouncilAI LinkedIn: https://www.linkedin.com/company/datacouncil-ai/

When to Move from Batch to Streaming and how to do it without hiring an entirely new team | Bytewax

When to Move from Batch to Streaming and how to do it without hiring an entirely new team | Bytewax

2023-05-11 Watch
video
Zander Matheson (Bytewax)

ABOUT THE TALK: With more and more demand for data pipelines and applications to go real-time it can get overwhelming. This talk demystifies the when, why, and how of moving from batch processing to real-time/stream processing. We will look at arguments for and against stream processing, common architectures, common pitfalls, and open source tools used.

ABOUT THE SPEAKER: Zander Matheson is the founder and CEO of Bytewax, an open source software company focused on enabling more developers to work with streaming data. Before starting Bytewax he worked on data infrastructure and data science at GitHub and Heroku.

ABOUT DATA COUNCIL: Data Council (https://www.datacouncil.ai/) is a community and conference series that provides data professionals with the learning and networking opportunities they need to grow their careers.

Make sure to subscribe to our channel for the most up-to-date talks from technical professionals on data related topics including data infrastructure, data engineering, ML systems, analytics and AI from top startups and tech companies.

FOLLOW DATA COUNCIL: Twitter: https://twitter.com/DataCouncilAI LinkedIn: https://www.linkedin.com/company/datacouncil-ai/

Building a better world with AI, one architectural drawing at a time | mbue

Building a better world with AI, one architectural drawing at a time | mbue

2023-05-11 Watch
video
Jean-Pierre Trou (mbue) , Ron Green (mbue)

ABOUT THE TALK: Mbue uses advanced computer vision and NLP technologies to read and understand architectural and technical drawings, catch flaws and other mistakes that cause delays and costly fixes, and ultimately automate the quality control process. Learn how they approach this complex problem, what they are doing to solve it, and where we're going next.

ABOUT THE SPEAKERS: Jean-Pierre Trou is the CEO and co-founder of mbue, a SaaS AI-First company focused on saving time, money and reducing liability for Architecture, Engineering and Construction (AEC) companies with automated quality control tools. mbue's web-based application utilizes Artificial Intelligence to instantly review technical drawings. Think “autocorrect” for construction documents. Jean Pierre is also the Founding Principal of Runa Workshop, Architecture and Interior design firm, and Founding Partner at Vaast, a real estate company, all based in Austin, Texas.

Ron Green is a serial tech entrepreneur and expert in artificial intelligence. Ron is co-founder and CTO of mbue and also co-founded KUNGFU.AI, an AI consultancy that helps companies build and deploy AI and machine learning solutions. Prior to KUNGFU.AI, Ron was CEO and founder of Thrive Technologies (acquired by CLOUD), ran software development at Ziften Technologies, Powered (acquired by Dachis Group), and Visible Genetics (acquired by Bayer).

ABOUT DATA COUNCIL: Data Council (https://www.datacouncil.ai/) is a community and conference series that provides data professionals with the learning and networking opportunities they need to grow their careers.

Make sure to subscribe to our channel for the most up-to-date talks from technical professionals on data related topics including data infrastructure, data engineering, ML systems, analytics and AI from top startups and tech companies.

FOLLOW DATA COUNCIL: Twitter: https://twitter.com/DataCouncilAI LinkedIn: https://www.linkedin.com/company/datacouncil-ai/