talk-data.com talk-data.com

Filter by Source

Select conferences and events

People (14 results)

See all 14 →

Activities & events

Title & Speakers Event

Talk Title: "Under the Hood of LLM"

Description: In this talk we unveil the inner workings of Large Language Models (LLMs), explore the architecture of Transformers and how chatbots and AI agents works underneath. This talk also dives into the concept of Model Context Protocol (MCP), the limitless potential that it offers and how it differs from RAG and other protocols like A2A. Join us for an in-depth look at how these powerful AI models that you already use on daily basis really work, how they process data, generate responses and redefine and push the boundaries of artificial intelligence.

Speaker/Bio: Radovan Kavicky Radovan Kavicky \| President & Principal Data Scientist @ GapData Institute https://www.linkedin.com/in/radovankavicky/ & https://radovankavicky.substack.com/

Radovan currently works as President & Principal Data Scientist at GapData Institute/GDI Institute and is an expert in XAI (Explainable AI), i.e. the field of scientific research at the intersection of mathematics and artificial intelligence, he also works as a AI Consultant (AI IXX/Dubai, UAE) for the implementation of AI and Data Science knowledge in business, economics and public administration

- in April 2025 he successfully completed the prestigious AI Connect program (nominated in 2024 as the only Slovak) under the auspices of the US Department of State (Bureau of Cyberspace and Digital Policy) and the Atlantic Council GeoTech Center\, which aims to help countries such as Slovakia transfer technology and knowledge in the field of AI towards responsible and successful implementation in public administration and the economy

- in April 2024 he was invited (keynote) to represent the CEE region and Europe as the historically first Slovak at the Global AI Show 2024 in Dubai (with 10k+ attendees one of the world's largest AI events)

- previously a member of Slovak.AI and until May 2024 also worked as Data Science & AI Evangelist at AIslovakIA - National Platform for the Development of Artificial Intelligence in Slovakia and in cooperation with MIRRI/MIRDI (Ministry of Investment\, Regional Development and Informatics of the Slovak Republic) led a mission of Slovak AI researchers at University of Cambridge

- Radovan is a member of several international and European professional and professional organizations in AI and data science\, such as: British Computer Society (Member\, #BCS)\, Slovak Economic Association (SEA)\, IEEE Computer Society (Member\, #IEEE)\, Slovak.AI (#SlovakAI)\, CLAIRE/CAIRNE\, European AI Alliance & TAILOR network\, AAAI\, EurAI\, AI4SK\, ELLIS\, AIDA

- selected lectures from past: @Global AI Show (Dubai\, 2024\, Keynote) @WeAreDevelopers (Berlin\, 2023) @TAILOR (Brussels\, 2023) @CODECON (Bratislava\, 2023\, main talk) @DATAcated (New York\, 2022) @PyData (Hamburg\, 2022) @ML Prague (2023\, 2022) @Data Science Summit (Warsaw\, 2021) @PyData (Hong Kong\, 2021) @ODSC Europe (London\, 2021) @WeAreDevelopers (WeAreDevsLIVE\, 2021) @Data Science Conference (DSG 5.0\, 2019) @H2O.ai (Prague\, 2019) @PyCon LT (PyData track\, Vilnius\, 2019) @VDSG (Linz\, 2019) @TechSummit (Bratislava\, 2017 & 2019) @PyData (Berlin\, 2017)

LinkedIn: https://www.linkedin.com/in/radovankavicky/

Registration:

@Meetup.com group's event here (https://www.meetup.com/pydata-slovakia-bratislava/events/307439940/) & @Eventbrite registration here (https://www.eventbrite.com/e/pydata-slovakia-ba-meetup-29-radovan-kavicky-under-the-hood-of-llm-tickets-1335453710999). +our event you can find also @Facebook here (https://www.facebook.com/events/2193597837725593/) and LinkedIn here (https://www.linkedin.com/events/7320847136919564288/about/).

[Disclaimer: If you just mark "going" @Facebook event we can't guarantee your seat]

Language of the event: English


PyData Bratislava [Python Data Enthusiasts and Users, Data Scientists & Statisticians of all levels from Slovakia]

-- PyData is a group for users and developers of data analysis tools to share ideas and learn from each other. We gather to discuss how best to apply Python tools, as well as those using R and Julia, to meet the evolving challenges in data management, processing, analytics, and visualization. PyData is organized by NumFOCUS.org, a 501(c)3 non-profit in the United States.

The PyData ​Code of Conduct​ governs this meetup. To discuss any issues or concerns relating to the code of conduct or the behavior of anyone at a PyData meetup, please contact the organizer or NumFOCUS Executive Director Leah Silen (+1512-222-5449; [email protected]).

Our Facebook group you can find here: https://www.facebook.com/groups/1813599648877946/

Our Twitter account here: https://twitter.com/PyDataBA

Our LinkedIn group here: https://www.linkedin.com/groups/13506080


Organizers: GapData Institute (https://www.gapdata.org/) (GDI) is a nonprofit nonpartisan research institution harnessing power of data & wisdom of economics for public good.

\|\| Data. Think. Change. \|\|

NumFOCUS (http://www.numfocus.org/) is a 501(c)(3) nonprofit that supports and promotes world-class, innovative, open source scientific computing. The mission of NumFOCUS is to promote sustainable high-level programming languages, open code development, and reproducible scientific research.

PyData Slovakia & Bratislava Meetup #29 [Radovan Kavicky: Under the Hood of LLM]

IAQF & Thalesians Seminar Series: Stress Testing Spillover Risk in Mutual Funds. A Seminar by Agostino Capponi.

6:00 PM Seminar Begins 7:30 PM Reception

Hybrid Event

Location: Fordham University McNally Amphitheater 140 West 62nd Street New York, NY 10023

Free Registration! For Virtual Attendees: Please email [email protected] for the link

Abstract: We develop a framework to quantify the vulnerability of mutual funds to fire-sale spillover losses. We account for the first-mover incentive that results from the mismatch between the liquidity offered to redeeming investors and the liquidity of assets held by the funds. In our framework, the negative feedback loop between investors’ redemptions and price impact from asset sales leads to an aggregate change in funds’ NAV, which is determined as a fixed point of a nonlinear mapping. We show that a higher concentration of first movers increases the aggregate vulnerability of the system, as measured by the ratio between endogenous losses due to fund redemptions and exogenous losses due to initial price shocks only. When calibrated to U.S. mutual funds, our model shows that, in stressed market scenarios, spillover losses are significantly amplified through a nonlinear response to initial shocks that results from the first-mover incentive. Higher spillover losses provide a stronger incentive to redeem early, further increasing fire-sale losses and the transmission of shocks through overlapping portfolio holdings.

Bio: Agostino Capponi is a Professor in the Department of Industrial Engineering and Operations Research at Columbia University, where he is also a member of the Data Science Institute and the founding director of the Columbia Center for Digital Finance and Technology. His current research interests are in financial technology, machine learning in finance, market microstructure, systemic and liquidity risk, climate finance, energy markets, and economic networks. Agostino's research has been funded by major agencies, including NSF, DARPA, DOE, IBM, GRI, INET, Ripple, Stellar, and the Ethereum foundation. His research has been recognized with the 2018 NSF CAREER award, a JP Morgan AI Research Faculty award, and the UBRI Innovator award. His research has also been covered by various media outlets, including Bloomberg, the Financial Times, Vox, and Politico. Agostino is a fellow of the crypto and blockchain economics research forum, and an academic fellow of Alibaba's Luohan academy. He serves as an editor of Management Science in the Finance Department, co-editor of Mathematics and Financial Economics, and financial engineering area editor of Operations Research. He has held editorial positions at several major journals in his field, such as the SIAM Journal on Financial Mathematics, Mathematical Finance, Finance and Stochastics, Operations Research Letters, Stochastic Systems, and Stochastic Models. Agostino is a past Chair of the SIAG/FME Activity Group and of the INFORMS Finance Section, and is currently a member of the Council of the Bachelier Finance Society. Agostino is co-editor of the book Machine Learning and Data Sciences for Financial Markets: A Guide to Contemporary Practices, published in 2023 by the Cambridge University press.

Hybrid: Agostino Capponi - Stress Testing Spillover Risk in Mutual Funds.
Jim Sterne – guest @ Board Chair, Digital Analytics Association - USA

🌟 Session Overview 🌟

Session Name: Creating a Generative AI Adoption Roadmap Speaker: Jim Sterne Session Description: Generative AI presents tremendous opportunities, but most companies are trapped in the research cycle. We are faced with a change management conundrum like never before. Jim starts with a view of how generative AI will impact your company, your job, and your life. He then draws on lessons from the computer revolution, digital transformation, the mobile revolution, and Robotic Process Automation to deliver a rational roadmap for the adoption of generative AI capabilities. It's a transformation blueprint from executive alignment to measuring the impact of new projects. Forming an AI Council, setting policies and guidelines, rolling out training, and developing new ways to measure the business value of new projects will set you up to successfully integrate generative AI into your processes, products, and services.

🚀 About Big Data and RPA 2024 🚀

Unlock the future of innovation and automation at Big Data & RPA Conference Europe 2024! 🌟 This unique event brings together the brightest minds in big data, machine learning, AI, and robotic process automation to explore cutting-edge solutions and trends shaping the tech landscape. Perfect for data engineers, analysts, RPA developers, and business leaders, the conference offers dual insights into the power of data-driven strategies and intelligent automation. 🚀 Gain practical knowledge on topics like hyperautomation, AI integration, advanced analytics, and workflow optimization while networking with global experts. Don’t miss this exclusive opportunity to expand your expertise and revolutionize your processes—all from the comfort of your home! 📊🤖✨

📅 Yearly Conferences: Curious about the evolution of QA? Check out our archive of past Big Data & RPA sessions. Watch the strategies and technologies evolve in our videos! 🚀 🔗 Find Other Years' Videos: 2023 Big Data Conference Europe https://www.youtube.com/playlist?list=PLqYhGsQ9iSEpb_oyAsg67PhpbrkCC59_g 2022 Big Data Conference Europe Online https://www.youtube.com/playlist?list=PLqYhGsQ9iSEryAOjmvdiaXTfjCg5j3HhT 2021 Big Data Conference Europe Online https://www.youtube.com/playlist?list=PLqYhGsQ9iSEqHwbQoWEXEJALFLKVDRXiP

💡 Stay Connected & Updated 💡

Don’t miss out on any updates or upcoming event information from Big Data & RPA Conference Europe. Follow us on our social media channels and visit our website to stay in the loop!

🌐 Website: https://bigdataconference.eu/, https://rpaconference.eu/ 👤 Facebook: https://www.facebook.com/bigdataconf, https://www.facebook.com/rpaeurope/ 🐦 Twitter: @BigDataConfEU, @europe_rpa 🔗 LinkedIn: https://www.linkedin.com/company/73234449/admin/dashboard/, https://www.linkedin.com/company/75464753/admin/dashboard/ 🎥 YouTube: http://www.youtube.com/@DATAMINERLT

AI/ML Analytics Big Data Dashboard GenAI
DATA MINER Big Data Europe Conference 2020

Government Blockchain Showcase, August 16 Old Town Hall, Fairfax, VA August 16th GBA Global gbaglobal.org/blockchain-showcase/ Join the Government Blockchain Showcase on August 16 at Fairfax Old Town Hall in Virginia, hosted by the Government Blockchain Association, the Virginia Blockchain Council, and InFlux Technologies, and make your blockchain solution known and see the latest blochchain solutions. Does your company have a blockchain solution? Showcase it now. The Virginia Joint Commission on Technology and Science (JCOTS) is analyzing blockchain, digital asset mining, and cryptocurrency to recommend their use in the Commonwealth. At the same time, a bill is moving through congress (H.R. 6572 – Deploying American Blockchain Act of 2023) that orders the Secretary of Commerce to make the U.S. a leader in blockchain and digital assets. And now, the topic has become part of political campaigns with candidates endorsing these technologies. With pollical pressure increasing, government administrators are being directed to act. To do this, they need to understand the technology. Join the Government Blockchain Showcase on August 16 at Fairfax Old Town Hall in Virginia, hosted by the Government Blockchain Association, the Virginia Blockchain Council, and InFlux Technologies, and make your blockchain solution known and see the latest blochchain solutions. Are you interested in discovering how blockchain enhances Decentralized Public Infrastructure Networks (DePINs)? Want to learn how to modernize legacy government systems to better respond to the rapidly evolving needs of constituents? Join the Virginia Blockchain Council, Flux Technologies, and the Government Blockchain Association (GBA) for the Government Blockchain Showcase! Event Highlights: Explore Blockchain’s Impact on Decentralized Public Infrastructure Networks (DePINs): Learn how blockchain technology can boost the efficiency, transparency, and security of digital public infrastructure. Modernize Legacy Systems: Find out how to upgrade government systems to keep pace with rapid technological advancements. Engage with Leaders: Hear from government legislators, administration officials, and program managers as they discuss their needs and innovative solutions to public sector challenges. Private Sector Innovations: Listen to industry leaders describe the solutions used by governments and enterprise clients for: Elections & Voting Decentralized Data Storage Identity Management Payment Processing Vital Records Management And Many Others This event brings together government and industry pioneers to showcase how blockchain and Web3 technologies are transforming the public sector.

Join the Government Blockchain Showcase on August 16 at Fairfax Old Town Hall
Beena Ammanath – Global Head of the Deloitte AI Institute @ Deloitte

Throughout the past year, we've seen AI go from a nice-to-have, to a must-have in almost every large organization’s boardroom. There’s been more and more focus deploy AI  by leadership teams, and as a result, there's never been more pressure on the data team to deliver with AI. However, as the pressure to deliver with AI grows, the need to build safe and trustworthy experiences has also never been more important. But how do we balance between innovation and building these trustworthy experiences? How do you make responsible AI practical? Who should we get into the room when scoping safe AI use-cases?  Beena Ammanath is an award- winning senior technology executive with extensive experience in AI and digital transformation. Her career has spanned leadership roles in e-commerce, finance, marketing, telecom, retail, software products, service, and industrial domains. She is also the author of the ground breaking book, Trustworthy AI. Beena currently leads the Global Deloitte AI Institute and Trustworthy AI/ Ethical Technology at Deloitte. Prior to this, she was the CTO-AI at Hewlett Packard Enterprise. A champion for women and multicultural inclusion in technology and business, Beena founded Humans for AI, a 501c3b non-profit promoting diversity and inclusion in AI. Her work and contributions have been acknowledged with numerous awards and recognition such as 2016 Women Super Achiever Award from World Women’s Leadership Congress and induction into WITI’s 2017 Women in Technology Hall of Fame. Beena was honored by UC Berkeley as 2018 Woman of the Year for Business Analytics, by the San Francisco Business Times as one of the 2017 Most Influential Women in Bay Area and by the National Diversity Council as one of the Top 50 Multicultural Leaders in Tech. In the episode, Beena and Adel delve into the core principles of trustworthy AI, the interplay of ethics and AI in various industries, how to make trustworthy AI practical, who are the primary stakeholders for ensuring trustworthy AI, the importance of AI literacy when promoting responsible and trustworthy AI, and a lot more. Links mentioned in the Show Trustworthy AI by Beena AmmanathDeloitte AI InstituteHumans for AIData Literacy by Design, with Valerie Logan, CEO of the Data Lodge[Course] Implementing AI Solutions in Business[Webinar - October 19th 2023] Building a Capability Roadmap for AI

AI/ML Analytics Marketing
DataFramed

Join us September 19th to discuss the hot topic of 2023: Artificial Intelligence (AI).

AI is much talked about, everywhere from the news, in Parliament, to the boardroom and pubs across the country, but, what really is AI, and how does openness matter with it?

To get things kicked off, Matt Armstrong-Barnes from HPE will present a level set on AI, what it is, who is using it, what it is being used for, and possible directions.

Next Jennifer Ding from the Alan Turing Institute and contributor to the recent OpenUK AI Openness Report will discuss the importance of open in AI and why open approaches lead to better results.

Food and drinks will be provided and there will be plenty of time to chat with other attendees and debate with the speakers.

Thank you once again to Avanade for sponsoring the venue and providing the refreshments.

Matt Armstrong-Barnes

As a Chief Technologist at Hewlett Packard Enterprise, Matt has a passion for artificial intelligence and data science.

He has held senior management positions and been accountable for the overall architecture (including Business, Security, Integration and Technology), technical risks, IT strategy, and technology change for customers.

He holds a Degree in Computer Science, with a Masters in Artificial Intelligence. He is a Fellow of the Institute of Engineering and Technology, Chartered Fellow of the British Computer Society, Chartered IT Professional and Chair of Smart Cities at TechUK, and was awarded the title of Chartered Engineer by the Engineering Council.

Jennifer Ding

Jennifer Ding is a senior researcher at The Alan Turing Institute, co-leading the Research Application Management (RAM) team. Previously, she was a startup founder and data scientist at several public interest tech companies, creating data products for industry and government partners. She enjoys massaging data big and small, and is a co-founder of London Data Week.

Date: 19 September

Time: 6:30pm-9pm

Please see the pop-up sign in the lobby, sign yourself in, and then enter the door on the left in the ground floor.

Access to meetup

In order to provide visitor passes, we'll need you to have a full name set on your profile and you will require ID in order to sign in.

Refreshments

Drinks and food will be served at 6:30pm

Dial-in link

Microsoft Teams meeting

Join on your computer, mobile app or room device

Click here to join the meeting

Meeting ID: 239 431 273 213

Passcode: FER3uG

Download Teams \| Join on the web

Or call in (audio only)

+44 20 3794 0298,,837403425# United Kingdom, London

Phone Conference ID:837 403 425#

Find a local number \| Reset PIN

OpenUK London Meetup September - Open and AI?
Event Data Council 2023 2023-05-18
Daniel Selans – co-founder and CTO @ Streamdal.com

ABOUT THE TALK: In this talk, Dan Selans shows you how we developed a schema discovery process that is able to automatically evolve schemas in a complex distributed system that is processing upwards of a 100,000 messages per second.

He dives deep into the details of schema versioning, detecting schema conflicts, compatibility and normalization, all without the use of any batching processes.

He shows how they developed a schema discovery process that is able to automatically evolve schemas in a complex distributed system that is processing upwards of a 100,000 messages per second. He also details how to detect schema drift, determine compatibility and ultimately how to do all of this, without having to involve batching.

ABOUT THE SPEAKER: Daniel Selans is the co-founder and CTO of Streamdal.com, a streaming data performance monitoring company. Dan previously wrote software at companies such as InVisionApp, New Relic and DigitalOcean and before that, spent over 10 years doing integration and R&D work at data centers.

ABOUT DATA COUNCIL: Data Council (https://www.datacouncil.ai/) is a community and conference series that provides data professionals with the learning and networking opportunities they need to grow their careers.

Make sure to subscribe to our channel for the most up-to-date talks from technical professionals on data related topics including data infrastructure, data engineering, ML systems, analytics and AI from top startups and tech companies.

FOLLOW DATA COUNCIL: Twitter: https://twitter.com/DataCouncilAI LinkedIn: https://www.linkedin.com/company/datacouncil

AI/ML Analytics Data Engineering New Relic Data Streaming
Barry McCardel – Co-founder and CEO @ Hex , Drew Banin – Co-Founder @ Fishtown Analytics , Pedram Navid – CMO @ West Marin Data , Julia Schottenstein – Product Manager @ dbt labs

ABOUT THE TALK: What are the latest trends and buzzwords in Data?

Barry McCordel welcomes panelists from Hex, DBT Labs and West Marin Data to discuss their thoughts on the latest trends and buzzwords in Data.

Learn about the latest in the world of streaming, data teams doing more with less, data meshes, innovations in different kids of SQL plus more!

ABOUT THE SPEAKERS: Julia Schottenstein is the Product Manager at dbt labs. Prior to this, she worked in Venture Capital as a Principal at NEA.

Drew Banin is the co-founder of dbt labs. He has built event collection systems that scaled to billions of events per month, implemented Markov-based marketing attribution models on millions of dollars of marketing spend, and dreams in NetworkX graphs.

Barry McCardel is the CEO and co-founder of Hex. He previously worked at TrialSpark leading operation and Palantir Technologies where he led teams at the intersection of product development and real-world impact.

Pedram Navid is the Founder of West Marin Data. In his role he helps startups implement their data stack. He also supports them with product, marketing and community-building.

ABOUT DATA COUNCIL: Data Council (https://www.datacouncil.ai/) is a community and conference series that provides data professionals with the learning and networking opportunities they need to grow their careers.

Make sure to subscribe to our channel for the most up-to-date talks from technical professionals on data related topics including data infrastructure, data engineering, ML systems, analytics and AI from top startups and tech companies.

FOLLOW DATA COUNCIL: Twitter: https://twitter.com/DataCouncilAI LinkedIn: https://www.linkedin.com/company/datacouncil

AI/ML Analytics Data Engineering dbt Marketing SQL Data Streaming
Emma Tang – Big Data Infrastructure Lead @ Stripe

ABOUT THE TALK: In this lightning talk, Emma Tang shares learnings from Stripe’s early efforts to tackle data correctness. As a financial technology company, data correctness is paramount to the operation of the company. This low tolerance for data inaccuracy poses unique constraints to how infrastructure is designed. Emma shares strategies as well as the trade-offs made in order to achieve this high level of correctness.

ABOUT THE SPEAKER: Emma Tang led Big Data Infrastructure at Stripe helping the company build and scale data infrastructure systems to support the 14x revenue growth and 6x headcount growth during her time there.

ABOUT DATA COUNCIL: Data Council (https://www.datacouncil.ai/) is a community and conference series that provides data professionals with the learning and networking opportunities they need to grow their careers.

Make sure to subscribe to our channel for the most up-to-date talks from technical professionals on data related topics including data infrastructure, data engineering, ML systems, analytics and AI from top startups and tech companies.

FOLLOW DATA COUNCIL: Twitter: https://twitter.com/DataCouncilAI LinkedIn: https://www.linkedin.com/company/datacouncil-ai/

AI/ML Analytics Big Data Data Engineering
Ahmed Elsamadisi – founder and CEO @ Narrator

ABOUT THE TALK: Modern data stacks focus on the most common use-cases and dashboards, but what about all the ad-hoc requests that come? The current tool set fails to allow data analysts to iterate easily with stakeholders. In this talk, we will discuss that without an ad-hoc layer, data analysts are left to answer questions with hacky live SQL or have every request go through the resource-intensive and expensive production processes and workflows.

An ad-hoc layer solves this by allowing data analysts to answer data questions, change their mind, and deliver data dumps or simple analyses incredibly fast and reliably. Allowing them to prioritize putting it into production only if it needs to be reused.

ABOUT THE SPEAKER: Ahmed Elsamadisi is the founder and CEO of Narrator. Narrator enables companies to make better decisions by providing them with the ability to answer any question in under 10 minutes. Ahmed started his career building algorithms for self-driving cars and human-robot interaction. He then joined Raytheon to develop AI algorithms for missile defense, focusing on tracking and discrimination. In 2015, Ahmed joined WeWork as the first hire on their data team. He built their data engineering infrastructure and grew the team of data engineers and analysts.

ABOUT DATA COUNCIL: Data Council (https://www.datacouncil.ai/) is a community and conference series that provides data professionals with the learning and networking opportunities they need to grow their careers.

Make sure to subscribe to our channel for the most up-to-date talks from technical professionals on data related topics including data infrastructure, data engineering, ML systems, analytics and AI from top startups and tech companies.

FOLLOW DATA COUNCIL: Twitter: https://twitter.com/DataCouncilAI LinkedIn: https://www.linkedin.com/company/datacouncil-ai/

AI/ML Analytics Data Engineering SQL
Ben Rogojan – Data Engineer @ Facebook

ABOUT THE TALK: Whether consulting or working as an employee there are certain tools, patterns and practices many of us would like to disappear in the next few years. Many of them delay projects, frustrate data engineers and yet we continue to rely on them. Whether it be transferring data via SFTP or joining teams without coding standards, some companies, even those that may be considered cutting edge, still have these patterns.

In this talk Ben Rogojan explores some of these tools, patterns and practices as well as why he hopes he doesn’t see them around in a few years.

ABOUT THE SPEAKER: Ben Rogojan has spent his career focused on helping companies develop end-to-end data solutions that are simple and maintainable. He has worked in various industries such as healthcare, finance, and e-commerce. In addition, he has worked for companies including Facebook as a data engineer. Using his broad experiences he has helped companies develop, improve, modernize, and migrate their data infrastructure.

ABOUT DATA COUNCIL: Data Council (https://www.datacouncil.ai/) is a community and conference series that provides data professionals with the learning and networking opportunities they need to grow their careers.

Make sure to subscribe to our channel for the most up-to-date talks from technical professionals on data related topics including data infrastructure, data engineering, ML systems, analytics and AI from top startups and tech companies.

FOLLOW DATA COUNCIL: Twitter: https://twitter.com/DataCouncilAI LinkedIn: https://www.linkedin.com/company/datacouncil-ai/

AI/ML Analytics Data Engineering
Zack Klein – Software Engineer @ Whatnot

ABOUT THE TALK: After two years, three rounds of funding, and hundreds of new employees — Whatnot’s modern data stack has come from not existing to processing tens of millions of events across hundreds of different event types each day.

How does their small (but mighty!) team keep up? This talk explores data contracts — it covers the use of Interface Definition Language (Protobuf) to serve as the source of truth for event definitions, govern event construction in production, automatically generate DBT models in the data warehouse.

ABOUT THE SPEAKER: Zack Klein is a software engineer at Whatnot, where he thoroughly enjoys building data products and narrowly avoiding breaking production each day. Previously, he worked on big data platforms at Blackstone and HBO.

ABOUT DATA COUNCIL: Data Council (https://www.datacouncil.ai/) is a community and conference series that provides data professionals with the learning and networking opportunities they need to grow their careers.

Make sure to subscribe to our channel for the most up-to-date talks from technical professionals on data related topics including data infrastructure, data engineering, ML systems, analytics and AI from top startups and tech companies.

FOLLOW DATA COUNCIL: Twitter: https://twitter.com/DataCouncilAI LinkedIn: https://www.linkedin.com/company/datacouncil-ai/

AI/ML Analytics Big Data Data Contracts Data Engineering dbt DWH Modern Data Stack Protobuf
Kyle Kirwan – co-founder and CEO @ Bigeye

ABOUT THE TALK: Incident management is a key practice used by DevOps and SRE teams to keep software reliable—but it's still uncommon among data teams! Datadog says incident management can "streamline their response procedures, reducing mean time to repair (MTTR) and minimizing any impact on end users."

In this talk, Kyle Kirwan, co-founder of data observability company Bigeye, will explain the basics of incident management and how data teams can use it to reduce disruptions to analytics and machine learning applications.

ABOUT THE SPEAKER: Kyle Kirwan is the co-founder and CEO of Bigeye. He began his career as a data scientist, went on to lead the development of Uber's internal data catalog/lineage/quality tools, and now helps data teams use data observability to improve pipeline reliability and data quality.

ABOUT DATA COUNCIL: Data Council (https://www.datacouncil.ai/) is a community and conference series that provides data professionals with the learning and networking opportunities they need to grow their careers.

Make sure to subscribe to our channel for the most up-to-date talks from technical professionals on data related topics including data infrastructure, data engineering, ML systems, analytics and AI from top startups and tech companies.

FOLLOW DATA COUNCIL: Twitter: https://twitter.com/DataCouncilAI LinkedIn: https://www.linkedin.com/company/datacouncil-ai/

AI/ML Analytics BigEye Data Engineering Data Quality Datadog DevOps
Curtis Northcutt – CEO and co-founder @ Cleanlab

ABOUT THE TALK: In this talk, we discuss cleanlab open-source (github.com/cleanlab/cleanlab) and Cleanlab Studio (https://cleanlab.ai/studio). Cleanlab open-source is a fast-growing python framework for data-centric AI that automatically detects issues in ML datasets. Cleanlab Studio is a no-code web interface used by universities and fortune 500 companies for dataset issue detection and fixing. Cleanlab algorithms have theoretical support for improved accuracy on real-world, messy data.

ABOUT THE SPEAKER: Curtis Northcutt is an American computer scientist and entrepreneur focusing on machine learning and AI to empower people. He is the CEO and co-founder of Cleanlab, an AI software company that improves machine learning model performance by automatically fixing data and label issues in real-world, messy datasets. Curtis completed his PhD at MIT where he invented Cleanlab’s algorithms for automatically finding and fixing label issues in any dataset.

ABOUT DATA COUNCIL: Data Council (https://www.datacouncil.ai/) is a community and conference series that provides data professionals with the learning and networking opportunities they need to grow their careers.

Make sure to subscribe to our channel for the most up-to-date talks from technical professionals on data related topics including data infrastructure, data engineering, ML systems, analytics and AI from top startups and tech companies.

FOLLOW DATA COUNCIL: Twitter: https://twitter.com/DataCouncilAI LinkedIn: https://www.linkedin.com/company/datacouncil-ai/

AI/ML Analytics Data Engineering GitHub Python
Ivan Aguilar – Data Scientist @ Teleskope

ABOUT THE TALK: Building and curating representative datasets is crucial for accurate ML systems. Monitoring metrics post-deployment helps improve the model. Unstructured language models may face data shifts, leading to unpredictable inferences. Open-source APIs and annotation tools streamline annotation and reduce analyst workload.

This talk discusses generating datasets and real-time precision/recall splits to detect data shifts, prioritize data collection, and retrain models.

ABOUT THE SPEAKER: Ivan Aguilar is a data scientist at Teleskope focused on building scalable models for detecting PII/PHI/Secrets and other compliance related entities within customers' clouds. Prior to joining Teleskope, Ivan was a ML Engineer at Forge.AI, a Boston based shop working on information extraction, content extraction, and other NLP related tasks.

ABOUT DATA COUNCIL: Data Council (https://www.datacouncil.ai/) is a community and conference series that provides data professionals with the learning and networking opportunities they need to grow their careers.

Make sure to subscribe to our channel for the most up-to-date talks from technical professionals on data related topics including data infrastructure, data engineering, ML systems, analytics and AI from top startups and tech companies.

FOLLOW DATA COUNCIL: Twitter: https://twitter.com/DataCouncilAI LinkedIn: https://www.linkedin.com/company/datacouncil-ai/

AI/ML Analytics API Data Collection Data Engineering NLP
Ricky Saporta – SVP of Data @ Entera

ABOUT THE TALK: During this talk, we'll make the argument that by aligning your product with data's core purpose, you increase adoption of your product and accelerate growth.

We'll propose a framework for Data Product Management that ensures this vital alignment is consistently held while catalyzing development and shortening time-to-outcome.

Along the way, we will show how to best structure your company's data org based on your current stage of growth in pursuit of improving the delivery of data products and enhancing outcomes for customers/end-users.

ABOUT THE SPEAKER: Ricky Saporta is passionate how people learn to make great decisions. A builder of data teams, Ricky is currently serving as SVP of Data at Entera. He spent the prior four years at The Farmer's Dog as Head of Data Strategy.

ABOUT DATA COUNCIL: Data Council (https://www.datacouncil.ai/) is a community and conference series that provides data professionals with the learning and networking opportunities they need to grow their careers.

Make sure to subscribe to our channel for the most up-to-date talks from technical professionals on data related topics including data infrastructure, data engineering, ML systems, analytics and AI from top startups and tech companies.

FOLLOW DATA COUNCIL: Twitter: https://twitter.com/DataCouncilAI LinkedIn: https://www.linkedin.com/company/datacouncil-ai/

AI/ML Analytics Data Engineering
Dean Pleban – CEO @ DagsHub

ABOUT THE TALK: While giving a talk to a group of up-and-coming data scientists, a question that surprised Dean Pleban was: "When you say “production”, what exactly do you mean?"

In this talk, Dean defines what production actually means. I’ll present a first-principles, step-by-step approach to thinking about deploying a model to production. He will talk about challenges you might face in each step, and provides further reading if you want to dive deeper into each one.

ABOUT THE SPEAKER: Dean Pleban has a background combining physics and computer science. He’s worked on quantum optics and communication, computer vision, software development and design. He’s currently CEO at DagsHub, where he builds products that enable data scientists to work together and get their models to production, using popular open source tools. He’s also the host of the MLOps Podcast, where he speaks with industry experts about ML in production.

ABOUT DATA COUNCIL: Data Council (https://www.datacouncil.ai/) is a community and conference series that provides data professionals with the learning and networking opportunities they need to grow their careers.

Make sure to subscribe to our channel for the most up-to-date talks from technical professionals on data related topics including data infrastructure, data engineering, ML systems, analytics and AI from top startups and tech companies.

FOLLOW DATA COUNCIL: Twitter: https://twitter.com/DataCouncilAI LinkedIn: https://www.linkedin.com/company/datacouncil-ai/

AI/ML Analytics Computer Science Data Engineering MLOps
Katie Hindson – Head of Product and Data @ Lightdash

ABOUT THE TALK: Building data tools requires us to not only think about the data team, but also about the people that the data team is serving: business users, or "non-data team people".

This talk will go over how it's super important to consider these two personas when building data tools, but it can also be a bit complicated. We will talk through a few principles we can use to build data products that are great for everyone (not just the data team!)

ABOUT THE SPEAKER: As a product manager with a background in data science, Katie Hindson loves building data products. Currently, she's working at Lightdash, an open-source BI tool that instantly turns your dbt project into a full-stack BI platform. Katie is really interested in the interaction between data teams, their tools, and the rest of the company - because the best data teams are the ones that can help everyone at the company make better decisions, faster.

ABOUT DATA COUNCIL: Data Council (https://www.datacouncil.ai/) is a community and conference series that provides data professionals with the learning and networking opportunities they need to grow their careers.

Make sure to subscribe to our channel for the most up-to-date talks from technical professionals on data related topics including data infrastructure, data engineering, ML systems, analytics and AI from top startups and tech companies.

FOLLOW DATA COUNCIL: Twitter: https://twitter.com/DataCouncilAI LinkedIn: https://www.linkedin.com/company/datacouncil-ai/

AI/ML Analytics BI Data Engineering Data Science dbt Lightdash
Hamel Hussain – entrepreneur-in-residence @ fast.ai

ABOUT THE TALK: In this talk, Hamel Hussain discusses innovative approaches and tools for software development, their history, and future directions. He dives into the historical threads upon which these new approaches are built, talks about nbev, a popular open-source project that implements many of these ideas. He also shares learnings from building nbdev, along with challenges and future directions.

ABOUT THE SPEAKER: Hamel Hussain is an entreprenuer-in-residence at fast.ai, where he is building new software development tools like nbdev. Prior to fast.ai, Hamel was a machine learning engineer at companies like Airbnb, GitHub, and DataRobot, and other related roles in management consulting.

ABOUT DATA COUNCIL: Data Council (https://www.datacouncil.ai/) is a community and conference series that provides data professionals with the learning and networking opportunities they need to grow their careers.

Make sure to subscribe to our channel for the most up-to-date talks from technical professionals on data related topics including data infrastructure, data engineering, ML systems, analytics and AI from top startups and tech companies.

FOLLOW DATA COUNCIL: Twitter: https://twitter.com/DataCouncilAI LinkedIn: https://www.linkedin.com/company/datacouncil-ai/

AI/ML Analytics Data Engineering GitHub
Shir Chorev – co-founder and CTO @ Deepchecks

ABOUT THE TALK As machine learning models are becoming more common in production, organizations are recognizing the significance of continuous validation, and are integrating automated testing into their CI/CD pipelines to ensure that their models remain relevant and are trustworthy. However, with constantly changing data and black-box logic, testing these models can be a daunting task.

In this talk, we explore the common pitfalls of ML models and best practices for testing them. We demonstrate how to use the deepchecks open source package to validate models and data during the research and CI/CD phases.

ABOUT THE SPEAKER Shir Chorev is the co-founder and CTO of Deepchecks, an MLOps startup for continuous validation of ML models and data. Previously, Shir worked at the Prime Minister’s Office and at Unit 8200, conducting and leading research in various Machine Learning and Cybersecurity related challenges.

ABOUT DATA COUNCIL: Data Council (https://www.datacouncil.ai/) is a community and conference series that provides data professionals with the learning and networking opportunities they need to grow their careers.

Make sure to subscribe to our channel for the most up-to-date talks from technical professionals on data related topics including data infrastructure, data engineering, ML systems, analytics and AI from top startups and tech companies.

FOLLOW DATA COUNCIL: Twitter: https://twitter.com/DataCouncilAI LinkedIn: https://www.linkedin.com/company/datacouncil-ai/

AI/ML Analytics CI/CD Data Engineering MLOps