talk-data.com
Speaker
Salma Bakouk
7
talks
Salma Bakouk is the CEO and co-founder of Sifflet, the leading holistic data observability platform. Since launching the company in 2021, she has been dedicated to building trust in data for her clients. With deep expertise in developing scalable data pipelines and implementing advanced analytics solutions, Salma has positioned Sifflet as a frontrunner in the data observability space. Before founding Sifflet, she served as an Executive Director at Goldman Sachs, where she honed her skills in managing complex data and analytics projects.
Bio from: Big Data & AI Paris 2025
Filter by Event / Source
Talks & appearances
7 activities · Newest first
À l'ère de la production d'IA, les enjeux liés à la qualité et à la fiabilité des données sont plus critiques que jamais, surtout avec l'augmentation constante du volume des actifs de données.
Face à ce constat, les équipes data sont confrontées à un défi de taille : comment fournir des données prêtes pour l’IA sans augmenter leurs effectifs ? C’est là qu’intervient l’observabilité des données agentique.
Rejoignez cette session pour découvrir cette nouvelle ère :
- Comment les agents vont révolutionner et accélérer les workflows d’observabilité des données
- Pourquoi les agents fourniront une qualité de données contextualisée selon les équipes, et pourquoi cela est important
- Comment les interfaces en langage naturel vont encore démocratiser l’observabilité des données à l’échelle de l’organisation
Vous repartirez avec une vision claire de la façon dont le marché de l’observabilité des données intègre les capacités de l’IA pour accompagner les entreprises dans leur quête d’analytique de pointe et de données prêtes pour l’IA.
AI is only as strong as the data beneath it. Yet most enterprises still rely on fragmented tools and reactive processes that undermine trust. The result: innovation that looks impressive in demos but collapses under real-world pressure. In this keynote, Salma Bakouk, CEO of Sifflet, argues that metadata, not models, is the true foundation for the AI era. By building a metadata control plane enriched with agentic observability, enterprises can move from reactive patchwork to proactive intelligence. In this keynote, she offers a provocative vision of where the market is heading, what traditional approaches are getting wrong, and why the winners of the AI economy will be those who treat trust not as insurance, but as infrastructure.
Salma Bakouk (CEO of Sifflet) and I discuss the evolving data and AI landscape, the rise of data observability in the age of AI, balancing personal and professional life as a founder, and much more.
Industry leaders from Adaptavist and Givaudan will share insights into their data stack and maturity, the importance of data observability, and who should own it internally. They will discuss their journey to securing buy-in for data observability, offer advice for organizations beginning their journey, and explore the future role of automation and AI in this space.
In this talk, Pete from Penguin Random House will introduce his 3 E's theory, highlighting the critical steps of Establishing a data structure, Enabling teams, and Exploiting results for optimal impact. He'll delve into how trust in both data and process, underpinned by data observability, is essential for transforming data skepticism into meaningful collaboration.
Summary The problems that are easiest to fix are the ones that you prevent from happening in the first place. Sifflet is a platform that brings your entire data stack into focus to improve the reliability of your data assets and empower collaboration across your teams. In this episode CEO and founder Salma Bakouk shares her views on the causes and impacts of "data entropy" and how you can tame it before it leads to failures.
Announcements
Hello and welcome to the Data Engineering Podcast, the show about modern data management When you’re ready to build your next pipeline, or want to test out the projects you hear about on the show, you’ll need somewhere to deploy it, so check out our friends at Linode. With their new managed database service you can launch a production ready MySQL, Postgres, or MongoDB cluster in minutes, with automated backups, 40 Gbps connections from your application hosts, and high throughput SSDs. Go to dataengineeringpodcast.com/linode today and get a $100 credit to launch a database, create a Kubernetes cluster, or take advantage of all of their other services. And don’t forget to thank them for their continued support of this show! Modern data teams are dealing with a lot of complexity in their data pipelines and analytical code. Monitoring data quality, tracing incidents, and testing changes can be daunting and often takes hours to days or even weeks. By the time errors have made their way into production, it’s often too late and damage is done. Datafold built automated regression testing to help data and analytics engineers deal with data quality in their pull requests. Datafold shows how a change in SQL code affects your data, both on a statistical level and down to individual rows and values before it gets merged to production. No more shipping and praying, you can now know exactly what will change in your database! Datafold integrates with all major data warehouses as well as frameworks such as Airflow & dbt and seamlessly plugs into CI workflows. Visit dataengineeringpodcast.com/datafold today to book a demo with Datafold. RudderStack helps you build a customer data platform on your warehouse or data lake. Instead of trapping data in a black box, they enable you to easily collect customer data from the entire stack and build an identity graph on your warehouse, giving you full visibility and control. Their SDKs make event streaming from any app or website easy, and their extensive library of integrations enable you to automatically send data to hundreds of downstream tools. Sign up free at dataengineeringpodcast.com/rudder Data teams are increasingly under pressure to deliver. According to a recent survey by Ascend.io, 95% in fact reported being at or over capacity. With 72% of data experts reporting demands on their team going up faster than they can hire, it’s no surprise they are increasingly turning to automation. In fact, while only 3.5% report having current investments in automation, 85% of data teams plan on investing in automation in the next 12 months. 85%!!! That’s where our friends at Ascend.io come in. The Ascend Data Automation Cloud provides a unified platform for data ingestion, transformation, orchestration, and observability. Ascend users love its declarative pipelines, powerful SDK, elegant UI, and extensible plug-in architecture, as well as its support for Python, SQL, Scala, and Java. Ascend automates workloads on Snowflake, Databricks, BigQuery, and open source Spark, and can be deployed in AWS, Azure, or GCP. Go to dataengineeringpodcast.com/ascend and sign up for a free trial. If you’re a data engineering podcast listener, you get credits worth $5,000 when you become a customer. Your host is Tobias Macey and today I’m interviewing Salma Bakouk about achieving data reliability and reducing entropy within your data stack with sifflet
Interview
Introduction How did you get involved in the area of data management? Can you describe what Sifflet is and the st