talk-data.com talk-data.com

Topic

GenAI

Generative AI

ai machine_learning llm

1517

tagged

Activity Trend

192 peak/qtr
2020-Q1 2026-Q1

Activities

1517 activities · Newest first

Not Another LLM Talk… Practical Lessons from Building a Real-World Adverse Media Pipeline

LLMs are magical—until they aren’t. Extracting adverse media entities might sound straightforward, but throw in hallucinations, inconsistent outputs, and skyrocketing API costs, and suddenly, that sleek prototype turns into a production nightmare.

Our adverse media pipeline monitors over 1 million articles a day, sifting through vast amounts of news to identify reports of crimes linked to financial bad actors, money laundering, and other risks. Thanks to GenAI and LLMs, we can tackle this problem in new ways—but deploying these models at scale comes with its own set of challenges: ensuring accuracy, controlling costs, and staying compliant in highly regulated industries.

In this talk, we’ll take you inside our journey to production, exploring the real-world challenges we faced through the lens of key personas: Cautious Claire, the compliance officer who doesn’t trust black-box AI; Magic Mike, the sales lead who thinks LLMs can do anything; Just-Fine-Tune Jenny, the PM convinced fine-tuning will solve everything; Reinventing Ryan, the engineer reinventing the wheel; and Paranoid Pete, the security lead fearing data leaks.

Expect practical insights, cautionary tales, and real-world lessons on making LLMs reliable, scalable, and production-ready. If you've ever wondered why your pipeline works perfectly in a Jupyter notebook but falls apart in production, this talk is for you.

Tackling Data Challenges for Scaling Multi-Agent GenAI Apps with Python

The use of multiple Large Language Models (LLMs) working together perform complex tasks, known as multi-agent systems, has gained significant traction. While orchestration frameworks like LangGraph and Semantic Kernel can streamline orchestration and coordination among agents, developing large-scale, production-grade systems can bring a host of data challenges. Issues such as supporting multi-tenancy, preserving transactional integrity and state, and managing reliable asynchronous function calls while scaling efficiently can be difficult to navigate.

Leveraging insights from practical experiences in the Azure Cosmos DB engineering team, this talk will guide you through key considerations and best practices for storing, managing, and leveraging data in multi-agent applications at any scale. You’ll learn how to understand core multi-agent concepts and architectures, manage statefulness and conversation histories, personalize agents through retrieval-augmented generation (RAG), and effectively integrate APIs and function calls.

Aimed at developers, architects, and data scientists at all skill levels, this session will show you how to take your multi-agent systems from the lab to full-scale production deployments, ready to solve real-world problems. We’ll also walk through code implementations that can be quickly and easily put into practice, all in Python.

Small Language Models (SLMs) provide a viable alternative to Large Language Models (LLMs) for developing high-performing, cost-effective, and secure generative AI solutions. CDAOs, AI architects, and data architects should attend this presentation to gain insights into the strengths and weaknesses of SLMs and discover five specific use cases where SLMs outperform LLMs.

Analytics is experiencing another monumental change. Just as visual drag and drop BI tools and augmented insights led to changes in analytics delivery, we now experience conversational interfaces, automated workflows and AI agents that cause us to rethink how analytics will be done. Join this session to learn the new technologies that are making an impact and how this will affect plans for future investment in analytics tools, platforms and solutions.

As organizations scale GenAI from concept to production, they face challenges like ensuring accuracy, explaining responses, and connecting GenAI to unique knowledge. This session shows how GraphRAG combines knowledge graphs with retrieval-augmented generation to build GenAI apps grounded in enterprise data. Learn how companies like Klarna have deployed GenAI to build chatbots grounded in knowledge graphs, improving productivity and trust, while a major gaming company achieved 10x faster insights. We'll share real examples and practical steps for successful GenAI deployment.

D&A value is not possible without data storytelling that offers a better way to engage communication findings than just BI reporting or data science notebooks. Join this session to know about the fundamentals of data storytelling and how to fill the gap between data science speakers and decision makers. It further discusses how to tell the best data storytelling and how to upscale data storytelling for future in landscape of GenAI.

CDAOs and AI leaders often struggle to get started with GenAI. Attend this session to understand the first critical components you need to build or buy: Data, AI Engineering tools, a search and retrieval system, the application, and the right types of models. With these building blocks, you can build several working GenAI prototypes to help you prove the value and justify further investments.

Productivity and operational efficiency are one of the key measures of business performance and economics. GenAI has promising capabilities of improving productivity and operational efficiency of data management function, and data governance. Organizations should explore and assess those capabilities to align it with strategic goals to improve the productivity and operational efficiency.

As AI evolves into more agentic forms, capable of autonomous decision-making and complex interactions, the readiness of your data becomes a mission-critical priority. This roundtable gathers data & analytics leaders to explore the unique challenges of preparing data ecosystems for agentic AI. Discussions will focus on overcoming barriers such as data quality gaps, governance complexities, and scalability issues, while highlighting the transformative role of technologies like generative AI, data fabrics, and metadata-driven governance

AstraZeneca has implemented a "platform" approach, which serves as a centralized repository of standardized, enterprise grade, reusable services and capabilities that are accessible to AI factories. This platform includes user interfaces, APIs that integrate AI services with enterprise systems, supporting resources like data import tools and agent orchestration services. AstraZeneca will share how, starting with a few generative AI use cases, they have successfully identified common services and capabilities, subsequently standardizing these elements to maximize their applicability through the platform. These solutions leverage technologies like GPT models, Natural Language Processing and Retrieval Augmented Generation (RAG) architecture.

D&A value is not possible without data storytelling that offers a better way to engage communication findings than just BI reporting or data science notebooks. Join this session to know about the fundamentals of data storytelling and how to fill the gap between data science speakers and decision makers. It further discusses how to tell the best data storytelling and how to upscale data storytelling for future in landscape of GenAI.

As enterprises embrace GenAI and intelligent agents, securing sensitive data—like PII, financial records, and IP—while maintaining compliance is crucial. This session explores how Skyflow helps meet modern privacy demands, including India’s DPDP Act, using polymorphic encryption, tokenization, consent management, and fine-grained access controls. See real-world architectures that show how to embed privacy into both legacy and AI-first systems, enabling innovation without compromising security or regulatory compliance.

CDAOs and AI leaders are grappling with two crucial questions: 1. What public cloud provider should we choose for AI and GenAI initiatives, and 2. how do we assemble the right cloud architecture to scale and deploy AI more effectively?
This session compares public cloud AI and Generative AI architectures from AWS, Azure and GCP and provides insights on their points of differentiation.

Beyond the Bill: Gaining Granular Databricks Cost Insights with Data Apps | The Data Apps Conference

Managing cloud costs requires accurate resource tagging, but maintaining completeness and accuracy is a challenge. In this session, Mitchell Ertle (Senior Partner Solutions Architect) and Josue Bogran (Data & AI Architect) demonstrate how Sigma and Databricks combine to streamline FinOps and resource management with AI-driven cost attribution and workflow automation.

Through a practical demonstration, you'll see:

Identify and classify untagged Databricks pipelines with a cost attribution app Use GenAI from Databricks to suggest tags with human-in-the-loop approval Enable bidirectional data flow between Sigma and Databricks for real-time updates Automate workflows with Sigma’s actions framework Ensure security and governance by inheriting Unity Catalog permissions Discover why this combination is powerful—Sigma provides intuitive application building while Databricks delivers computation, AI/ML capabilities, and data storage. These platforms create solutions business users can interact with directly, without technical expertise.

Whether in data engineering, finance, or operations, learn how Sigma + Databricks can automate workflows, optimize costs, and drive business impact.

➡️ Learn more about Data Apps: https://www.sigmacomputing.com/product/data-applications?utm_source=youtube&utm_medium=organic&utm_campaign=data_apps_conference&utm_content=pp_data_apps


➡️ Sign up for your free trial: https://www.sigmacomputing.com/go/free-trial?utm_source=youtube&utm_medium=video&utm_campaign=free_trial&utm_content=free_trial

sigma #sigmacomputing #dataanalytics #dataanalysis #businessintelligence #cloudcomputing #clouddata #datacloud #datastructures #datadriven #datadrivendecisionmaking #datadriveninsights #businessdecisions #datadrivendecisions #embeddedanalytics #cloudcomputing #SigmaAI #AI #AIdataanalytics #AIdataanalysis #GPT #dataprivacy #python #dataintelligence #moderndataarchitecture

Creating a business case for GenAI is hard. GenAI initiatives produce outcomes that range from everyday to game-changing and cannot be described by a single currency of value. Gartner has identified three business case types and recommends CIOs create a value portfolio that encompasses the variety of GenAI investments enterprises make. Come to this session to learn how to frame your GenAI business case and align executive expectations on value creation from GenAI.

The emergence of Agentic AI will revolutionize automation and efficiency gains cutting across businesses. However, challenges like hallucination and inconsistency hinder the immediate adoption of GenAI models. Utilizing Agentic AI to tame the inherent variability of unstructured data, which is at the core of every white collar work, is the basic step to exploit larger AgenticAI automation in enterprises. This is based on our experience of designing hundreds of automation solutions. We will showcase a few use-cases in this session.

The fourth industrial revolution is fueled by AI and data. This talk will describe how to create and consume data products leveraging capabilities of Generative AI. Empowering knowledge workers with self-service access to data is an essential capability of a modern enterprise. Robust metadata and a framework for Minimum Viable Governance (MVG) is at the heart of trusted AI. The use of Generative AI to automate creation of data products will be discussed along with critical techniques to ensure integrity of the data product design. The use of natural language interfaces for interrogating data products will also be discussed.
-Learn about the anatomy of data as a product in support of corporate AI initiatives.
-Learn about the concept of Minimum Viable Governance to drive the right balance between agility and governance.
-Learn how business users can be empowered to create data products through a scalable manufacturing process using Generative AI.