talk-data.com
Event
ETL Strategies in Microsoft Fabric - Pipelines, Dataflows & Notebooks
Activities tracked
0
๐ฏ The Problem: Your ETL jobs take 6 hours. Business users want data "as fresh as possible." You're juggling Dataflows, Notebooks, Pipelines, and now Mirroring - but which one is actually right for which scenario? ๐ก This Evening: We'll cut through the confusion with a clear decision framework. You'll see each tool in action and understand when to use Spark vs. Dataflows vs. Mirroring - with real-world examples. โฑ๏ธ Save yourself: Weeks of painful refactoring. Choose the right ETL approach from day one. What you'll learn: โ๏ธ Spark Environments - Managing libraries and configurations\, and WHY environment isolation matters for production ๐ Spark Job Definitions - Automating Spark jobs at scale\, and WHY scheduled jobs beat manual notebook runs ๐ Data Factory Pipelines - Enterprise orchestration with activities\, and WHY Fabric Pipelines are simpler than classic ADF ๐ช Database Mirroring - Near real-time replication from SQL Server\, Cosmos DB\, Snowflake\, and WHY mirroring beats traditional CDC approaches Decision Framework:
- Simple transformations โ Dataflow Gen2
- Complex logic, ML โ Notebook โ Spark Job
- Orchestration, dependencies โ Data Pipeline
- Real-time sync from source โ Mirroring
Who should attend: Data Engineers building ETL pipelines, DBAs managing data integration, Architects designing data platforms Agenda:
- 18:30 - Welcome & Networking
- 18:45 - Environments & Spark Jobs
- 19:10 - Data Factory Pipelines Deep Dive
- 19:35 - Database Mirroring - The Game Changer
- 19:50 - ETL Decision Framework
- 19:55 - Q&A and Discussion
- 20:00 - Networking
Sessions & talks
Showing 1โ0 of 0 ยท Newest first
No individual activities are attached to this event yet.