talk-data.com
Activities & events
| Title & Speakers | Event |
|---|---|
|
Unity Catalog & Governance at Scale with Databricks
2025-11-13 · 12:00
SPECIAL GUEST: Kiran Sreekumar, Databricks ArchitectData without governance is not just unmanageable, it’s a liability. As organisations scale their data platforms across teams, regions, and clouds, the need for consistent governance becomes critical. In this session, we’ll explore how Databricks Unity Catalog provides a unified approach to governance, enabling:
We’ll share real-world examples of how enterprises are leveraging Unity Catalog to:
Whether you’re a data engineer, architect, or governance lead, this session will provide practical insights into building a trusted data foundation with Unity Catalog. Why attend? ✅ Understand how to operationalise governance at scale ✅ See how leading organisations manage compliance in real-world scenarios ✅ Learn best practices to unlock secure collaboration and faster delivery Join us to discover how to turn governance from a blocker into an enabler of data and AI success. |
Unity Catalog & Governance at Scale with Databricks
|
|
Geospatial Data on Databricks
2025-11-05 · 12:00
Most organizations capture huge volumes of spatial data, including addresses, coordinates, routes, and catchments, but struggle to operationalize it at scale. Traditional GIS (Geographic Information Systems) tools are powerful but isolated; unlocking value requires integrating spatial analytics directly within your data platform. In this session, we’ll cover: - Geospatial fundamentals on Databricks: understanding geometry vs. geography, coordinate systems, and H3 grids. - Scaling challenges: Combining spatial and business data, processing millions of coordinates efficiently, and maintaining real-time freshness. - Databricks capabilities: How Spatial SQL, Lakeflow, and Unity Catalog enable native spatial processing, federated access, and governed sharing across teams. - Applied use cases: From network optimisation to asset tracking and location-based insights across industries. We'll finish with a live demo, see how raw coordinates become actionable intelligence within the Lakehouse. Why Attend: - Learn how to bring geospatial analytics natively into Databricks. - Discover best practices for scaling spatial workloads efficiently. - Understand how Unity Catalog underpins governance and reusability. - See real-world examples and a live demo in action. Join us to learn how Databricks unifies spatial and analytical workloads, delivering governed, high-performance geospatial insight at enterprise scale. This session will be delivered by Unifeye's CDO and Databricks Champion Bianca Stratulat, and Senior Data Engineers, Jordan Begg and Hasnat Abdul |
Geospatial Data on Databricks
|
|
Powering Secure and Scalable Data Governance at PepsiCo With Unity Catalog Open APIs
2025-06-12 · 00:20
Dipankar Kushari
– Lead Specialist Solutions Architect
@ Databricks
,
Sudipta Das
– Enterprise Data Operations Director
@ PepsiCo
PepsiCo, given its scale, has numerous teams leveraging different tools and engines to access data and perform analytics and AI. To streamline governance across this diverse ecosystem, PepsiCo unifies its data and AI assets under an open and enterprise-grade governance framework with Unity Catalog. In this session, we'll explore real-world examples of how PepsiCo extends Unity Catalog’s governance to all its data and AI assets, enabling secure collaboration even for teams outside Databricks. Learn how PepsiCo architects permissions using service principals and service accounts to authenticate with Unity Catalog, building a multi-engine architecture with seamless and open governance. Attendees will gain practical insights into designing a scalable, flexible data platform that unifies governance across all teams while embracing openness and interoperability. |
|
|
Sponsored by: Impetus Technologies | Future-Ready Data at Scale: How Shutterfly Modernized for GenAI-Driven Personalization
2025-06-12 · 00:20
Catalina Toba
– Director, Data Engineering
@ Shutterfly
,
Chris Raub
– Sales Director - US West
@ Impetus
As a leading personalized product retailer, Shutterfly needed a modern, secure, and performant data foundation to power GenAI-driven customer experiences. However, their existing stack was creating roadblocks in performance, governance, and machine learning scalability. In partnership with Impetus, Shutterfly embarked on a multi-phase migration to Databricks Unity Catalog. This transformation not only accelerated Shutterfly’s ability to provide AI-driven personalization at scale but also improved governance, reduced operational overhead, and laid a scalable foundation for GenAI innovation. Join experts from Databricks, Impetus, and Shutterfly to discover how this collaboration enabled faster data-driven decision-making, simplified compliance, and unlocked the agility needed to meet evolving customer demands in the GenAI era. Learn from their journey and take away best practices for your own modernization efforts. |
|
|
Stop Guessing Spend Where It Counts: Data-Driven Decisions for High-Impact Investments on Databricks
2025-06-11 · 23:10
Clara MacAvoy
– Software Engineer
@ Databricks
,
Bruce Wong
– Sr. Director, Data Platform
@ Databricks
Struggling with runaway cloud costs as your organization grows? Join us for an inside look at how Databricks’ own Data Platform team tackled escalating spend in some of the world’s largest workspaces — saving millions of dollars without sacrificing performance or user experience. We’ll share how we harnessed powerful features like System Tables, Workflows, Unity Catalog, and Photon to monitor and optimize resource usage, all while using data-driven decisions to improve efficiency and ensure we invest in the areas that truly drive business impact. You’ll hear about the real-world challenges we faced balancing governance with velocity and discover the custom tooling and best practices we developed to keep costs in check. By the end of this session, you’ll walk away with a proven roadmap for leveraging Databricks to control cloud spend at scale. |
|
|
Unlocking Enterprise Potential: Key Insights from P&G's Deployment of Unity Catalog at Scale
2025-06-11 · 21:00
Kinga Morawska
– Engineering Manager
@ P&G
This session will explore Databricks Unity Catalog (UC) implementation by P&G to enhance data governance, reduce data redundancy and improve the developer experience through the enablement of a Lakehouse architecture. The presentation will cover: The distinction between data treated as a product and standard application data, highlighting how UC's structure maximizes the value of data in P&G's data lake. Real-life examples from two years of using Unity Catalog, demonstrating benefits such as improved governance, reduced waste and enhanced data discovery. Challenges related to disaster recovery and external data access, along with our collaboration with Databricks to address these issues. Sharing our experience can provide valuable insights for organizations planning to adopt Unity Catalog on an enterprise scale. |
|
|
Learning from Goldman Sachs' Legend Lakehouse for Data Governance
2025-06-11 · 20:50
George Wu
– Vice President
@ Goldman Sachs
,
Abhishek Narang
– Managing Director & Technology Fellow
@ Goldman Sachs
Data is the backbone of modern decision-making, but centralizing it is only the tip of the iceberg. Entitlements, secure sharing and just-in-time availability are critical challenges to any large-scale platform. Join Goldman Sachs as we reveal how our Legend Lakehouse, coupled with Databricks, overcomes these hurdles to deliver high-quality, governed data at scale. By leveraging an open table format (Apache Iceberg) and open catalog format (Unity Catalog), we ensure platform interoperability and vendor neutrality. Databricks Unity Catalog then provides a robust entitlement system that aligns with our data contracts, ensuring consistent access control across producer and consumer workspaces. Finally, Legend functions, integrating with Databricks User Defined Functions (UDF), offer real-time data enrichment and secure transformations without exposing raw datasets. Discover how these components unite to streamline analytics, bolster governance and power innovation. |
|
|
Sponsored by: Informatica | Modernize analytics and empower AI in Databricks with trusted data using Informatica
2025-06-11 · 19:40
Rik Tamm-Daniels
– GVP Ecosystems and Technology
@ Informatica
,
Ajay GOLLAPALLI
– Director
@ Informatica
As enterprises continue their journey to the cloud, data warehouse and data management modernization is essential to optimize analytics and drive business outcomes. Minimizing modernization timelines is important for reducing risk and shortening time to value – and ensuring enterprise data is clean, curated and governed is imperative to enable analytics and AI initiatives. In this session, learn how Informatica's Intelligent Data Management Cloud (IDMC) empowers analytics and AI on Databricks by helping data teams: · Develop no-code/low-code data pipelines that ingest, transform and clean data at enterprise scale · Improve data quality and extend enterprise governance with Informatica Cloud Data Governance and Catalog (CDGC) and Unity Catalog · Accelerate pilot-to-production with Mosaic AI |
|
|
Managing Databricks at Scale
2025-06-11 · 19:40
Vikas Ranjan
– Senior Manager, Network Data & AI
@ T-Mobile
T-Mobile’s leadership in 5G innovation and its rapid growth in the fixed wireless business have led to an exponential increase in data, reaching 100s of terabytes daily. This session explores how T-Mobile uses Databricks to manage this data efficiently, focusing on scalable architecture with Delta Lake, auto-scaling clusters, performance optimization through data partitioning and caching and comprehensive data governance with Unity Catalog. Additionally, it covers cost management, collaborative tools and AI-driven productivity tools, highlighting how these strategies empower T-Mobile to innovate, streamline operations and maximize data impact across network optimization, supporting the community, energy management and more. |
|
|
Sponsored by: Hexaware | Global Data at Scale: Powering Front Office Transformation with Databricks
2025-06-11 · 19:20
Bindu Birur
– Head of Analytics & Data Engineering Delivery
@ KPMG
Global Data at Scale: Powering Front Office Transformation with DatabricksJoin KPMG for an engaging session on how we transformed our data platform and built a cutting-edge Global Data Store (GDS)—a game-changing data hub for our Front Office Transformation (FOT). Discover how we seamlessly unified data from various member firms, turning it into a dynamic engine for and enabled our business to leverage our Front Office ecosystem to enable smarter analytics and decision-making. Learn about our unique approach that rapidly integrates diverse datasets into the GDS and our hub-and-spoke model, connecting member firms’ data lakes, enabling secure, high-speed collaboration via Delta Sharing. Hear how we are leveraging Unity Catalog to help ensure data governance, compliance, and straight forward data lineage. We’ll share strategies for risk management, security (fine-grained access, encryption), and scaling a cloud-based data ecosystem. |
|
|
Advanced Data Access Control for the Exabyte Era: Scaling with Purpose
2025-06-11 · 00:20
Arpan Ghosh
– Engineering Manager
@ Databricks
,
Shuting Zhang
– software engineer
@ Databricks
As data-driven companies scale from small startups to global enterprises, managing secure data access becomes increasingly complex. Traditional access control models fall short at enterprise scale, where dynamic, purpose-driven access is essential. In this talk, we explore how our “Just-in-Time” Purpose-Based Access Control (PBAC) platform addresses the evolving challenges of data privacy and compliance, maintaining least privilege while ensuring productivity. Using features like Unity Catalog, Delta Sharing & Databricks Apps, the platform delivers real-time, context-aware data governance. Leveraging JIT PBAC keeps your data secure, your engineers productive, your legal & security teams happy and your organization future-proof in the ever-evolving compliance landscape. |
|
|
Sponsored by: Alation | Better Together: Enterprise Catalog with Databricks & Alation at American Airlines
2025-06-10 · 23:30
Anuradha Maradapu
– Manager, Data Governance & Engineering
@ American Airlines
In the era of data-driven enterprises, true democratization requires more than just access–it demands context, trust, and governance at scale. In this session, discover how to seamlessly integrate Databricks Unity Catalog with Alation’s Enterprise Data Catalog to deliver: End-to-End Lineage Storytelling: Unify technical and business views into a single, cohesive narrative that resonates with both technical engineers and non-technical stakeholders across business domains Accelerated and Democratized Insights: Automate metadata stitching to reduce time-to-insight, enabling analysts to answer critical business questions faster and drive multi-domain collaboration Empowered, Trustworthy Discovery: Equip business users with a unified platform, populated with rich documentation and usage signals, so they can find, understand, and confidently use trusted data assets |
|
|
Trust You Can Measure: Data Quality Standards in The Lakehouse
2025-06-10 · 20:50
Amit Pahwa
– Staff Software Engineer
@ Databricks
,
Sergiy Kanyshchev
– Staff Software Engineer
@ Databricks
Do you trust your data? If you’ve ever struggled to figure out which datasets are reliable, well-governed, or safe to use, you’re not alone. At Databricks, our own internal lakehouse faced the same challenge—hundreds of thousands of tables, but no easy way to tell which data met quality standards. In this talk, the Databricks Data Platform team shares how we tackled this problem by building the Data Governance Score—a way to systematically measure and surface trust signals across the entire lakehouse. You’ll learn how we leverage Unity Catalog, governed tags, and enforcement to drive better data decisions at scale. Whether you're a data engineer, platform owner, or business leader, you’ll leave with practical ideas on how to raise the bar for data quality and trust in your own data ecosystem. |
|
|
Empowering Fundraising With AI: A Journey With Databricks Mosaic AI
2025-06-10 · 18:30
Amina Alavi
– Director of Data Science
@ Doctors Without Borders
Artificial Intelligence (AI) is more than a corporate tool; it’s a force for good. At Doctors Without Borders/Médecins Sans Frontières (MSF), we use AI to optimize fundraising, ensuring that every dollar raised directly supports life-saving medical aid worldwide. With Databricks, Mosaic AI and Unity Catalog, we analyze donor behavior, predict giving patterns and personalize outreach, increasing contributions while upholding ethical AI principles. This session will showcase how AI maximizes fundraising impact, enabling faster crisis response and resource allocation. We’ll explore predictive modeling for donor engagement, secure AI governance with Unity Catalog and our vision for generative AI in fundraising, leveraging AI-assisted storytelling to deepen donor connections. AI is not just about efficiency; it’s about saving lives. Join us to see how AI-driven fundraising is transforming humanitarian aid on a global scale. |
|
|
ThredUp’s Journey with Databricks: Modernizing Our Data Infrastructure
2025-06-10 · 15:00
Aniket Mane
– VP, Data platform & Enterprise Apps Engg
@ ThredUp Inc.
,
Chintan Patel
– Data Engineering Manager
@ Thredup
Building an AI-ready data platform requires strong governance, performance optimization, and seamless adoption of new technologies. At ThredUp, our Databricks journey began with a need for better data management and evolved into a full-scale transformation powering analytics, machine learning, and real-time decision-making. In this session, we’ll cover: Key inflection points: Moving from legacy systems to a modernized Delta Lake foundation Unity Catalog’s impact: Improving governance, access control, and data discovery Best practices for onboarding: Ensuring smooth adoption for engineering and analytics teams What’s next? Serverless SQL and conversational analytics with Genie Whether you’re new to Databricks or scaling an existing platform, you’ll gain practical insights on navigating the transition, avoiding pitfalls, and maximizing AI and data intelligence. |
|
|
Sponsored by: Avanade | Accelerating Adoption of Modern Analytics and Governance at Scale
2023-07-26 · 21:11
To unlock all the competitive advantage Databricks offers your organization, you might need to update your strategy and methodology for the platform. With over 1,000+ Databricks projects completed globally in the last 18 months, we are going to share our insights on the best building blocks to target as you search for efficiency and competitive advantage. These building blocks supporting this include enterprise metadata and data management services, data management foundation, and data services and products that enable business units to fully use their data and analytics at scale. In this session, Avanade data leaders will highlight how Databricks’ modern data stack fits Azure PaaS and SaaS (such as Microsoft Fabric) ecosystem, how Unity catalog metadata supports automated data operations scenarios, and how we are helping clients measure modern analytics and governance business impact and value. Talk by: Alan Grogan and Timur Mehmedbasic Here’s more to explore: State of Data + AI Report: https://dbricks.co/44i2HBp Databricks named a Leader in 2022 Gartner® Magic QuadrantTM CDBMS: https://dbricks.co/3phw20d Connect with us: Website: https://databricks.com Twitter: https://twitter.com/databricks LinkedIn: https://www.linkedin.com/company/databricks Instagram: https://www.instagram.com/databricksinc Facebook: https://www.facebook.com/databricksinc |
|
|
Enabling Data Governance at Enterprise Scale Using Unity Catalog
2023-07-26 · 21:11
Amgen has invested in building modern, cloud-native enterprise data and analytics platforms over the past few years with a focus on tech rationalization, data democratization, overall user experience, increase reusability, and cost-effectiveness. One of these platforms is our Enterprise Data Fabric which focuses on pulling in data across functions and providing capabilities to integrate and connect the data and govern access. For a while, we have been trying to set up robust data governance capabilities which are simple, yet easy to manage through Databricks. There were a few tools in the market that solved a few immediate needs, but none solved the problem holistically. For use cases like maintaining governance on highly restricted data domains like Finance and HR, a long-term solution native to Databricks and addressing the below limitations was deemed important: The way these tools were set up, allowed the overriding of a few security policies
To address these challenges, and for large-scale enterprise adoption of our governance capability, we started working on UC integration with our governance processes. With an aim to realize the following tech benefits:
Today, using UC, we have to implement fine-grained access control & governance for the restricted data of Amgen. We are in the process of devising a realistic migration & change management strategy across the enterprise. Talk by: Lakhan Prajapati and Jaison Dominic Connect with us: Website: https://databricks.com Twitter: https://twitter.com/databricks LinkedIn: https://www.linkedin.com/company/databricks Instagram: https://www.instagram.com/databricksinc Facebook: https://www.facebook.com/databricksinc |
|
|
Best Practices for Setting Up Databricks SQL at Enterprise Scale
2023-07-26 · 21:05
Paul Roome
– Senior Staff Product Manager
@ Databricks
,
Jeremy Lewallen
– Product Manager
@ Databricks
,
Siddharth Bhai
– Product Management
@ Databricks
,
Samrat Ray
– Senior Staff Product Manager
@ Databricks
To learn more, visit the Databricks Security and Trust Center: https://www.databricks.com/trust In this session, we will talk about the best practices for setting up Databricks to run at large enterprise scale with thousands of users, departmental security and governance, and end-to-end lineage from ingestion to BI tools. We’ll showcase the power of Unity Catalog and Databricks SQL as the core of your modern data stack and how to achieve both data, environment, and financial governance while empowering your users to quickly find and access the data they need. Talk by: Siddharth Bhai, Paul Roome, Jeremy Lewallen, and Samrat Ray Connect with us: Website: https://databricks.com Twitter: https://twitter.com/databricks LinkedIn: https://www.linkedin.com/company/databricks Instagram: https://www.instagram.com/databricksinc Facebook: https://www.facebook.com/databricksin |
|
|
Sponsored by: Privacera | Applying Advanced Data Security Governance with Databricks Unity Catalog
2023-07-26 · 21:04
This talk explores the application of advanced data security and access control integrated with Databricks Unity Catalog through Privacera. Learn about Databricks with Unity Catalog and Privacera capabilities and real-world use cases demonstrating data security and access control best practices and how to successfully plan for and implement enterprise data security governance at scale across your entire Databricks Lakehouse. Talk by: Don Bosco Durai Connect with us: Website: https://databricks.com Twitter: https://twitter.com/databricks LinkedIn: https://www.linkedin.com/company/databricks Instagram: https://www.instagram.com/databricksinc Facebook: https://www.facebook.com/databricksinc |
|
|
Databricks As Code:Effectively Automate a Secure Lakehouse Using Terraform for Resource Provisioning
2023-07-25 · 23:12
At Rivian, we have automated more than 95% of our Databricks resource provisioning workflows using an in-house Terraform module, affording us a lean admin team to manage over 750 users. In this session, we will cover the following elements of our approach and how others can benefit from improved team efficiency.
Talk by: Jason Shiverick and Vadivel Selvaraj Connect with us: Website: https://databricks.com Twitter: https://twitter.com/databricks LinkedIn: https://www.linkedin.com/company/databricks Instagram: https://www.instagram.com/databricksinc Facebook: https://www.facebook.com/databricksinc |
|