talk-data.com talk-data.com

Topic

Data Modelling

data_governance data_quality metadata_management

355

tagged

Activity Trend

18 peak/qtr
2020-Q1 2026-Q1

Activities

355 activities · Newest first

Universal Data Modeling

Most data professionals work with multiple datasets scattered across teams, systems, and formats. But without a clear modeling strategy, the result is often chaos: mismatched schemas, fragile pipelines, and a constant fight to make sense of the noise. This essential guide offers a better way by introducing a practical framework for designing high-quality data models that work across platforms while supporting the growing demands of AI, analytics, and real-time systems. Author Jun Shan bridges the gap between disconnected modeling approaches and the need for a unified, system-agnostic methodology. Whether you're building a new data platform or rethinking legacy infrastructure, Universal Data Modeling gives you the clarity, patterns, and tools to model data that's consistent, resilient, and ready to scale. Connect conceptual, logical, and physical modeling phases with confidence Apply best-fit techniques across relational, semistructured, and NoSQL formats Improve data quality, clarity, and maintainability across your organization Support modern design paradigms like data mesh and data products Translate domain knowledge into models that empower teams Build flexible, scalable models that stand the test of technology change

Microsoft Power BI Quick Start Guide - Fourth Edition

Bring your data to life with the ultimate beginner's guide to Power BI, now featuring Microsoft Fabric, Copilot, and full-color visuals to make learning data modeling, storytelling, and dashboards easier and faster than ever Key Features Build data literacy and gain confidence using Power BI through real-world, beginner-friendly examples Learn to shape, clean, and model data using Power BI Desktop and Power Query, with zero experience required Build vibrant, accurate reports and dashboards with real-world modeling examples Book Description Updated with the latest innovations in Power BI, including integration with Microsoft Fabric for seamless data unification and Copilot for AI-powered guidance. This comprehensive guide empowers you to build compelling reports and dashboards from the ground up. Whether you're new to Power BI or stepping into a data role, this book provides a friendly, approachable introduction to business intelligence and data storytelling You'll start with the Power BI Desktop interface and its core functionality, then move into shaping and cleaning your data using the Power Query Editor. From designing intuitive data models to writing your first DAX formulas, you’ll develop practical skills that apply directly to real-world scenarios. he book emphasizes how to use visualizations and narrative techniques to turn numbers into meaningful insights The chapters focus on hands-on, real-world examples—like analyzing sales trends, tracking KPIs, and cleaning messy data. You'll learn to build and refresh reports, scale your Power BI setup, and enhance your solutions using Microsoft Fabric and Copilot. Fabric unifies analytics across your organization, while Copilot speeds up your workflow with AI-driven insights and report suggestions By the end of the book, you’ll have the confidence and experience to turn raw data into insightful, impactful dashboards What you will learn Understand why data literacy matters in decision-making and careers Connect to data using import, DirectQuery, and live connection modes Clean and transform data using Power Query Editor and dataflows Design reports with visuals that support clear data storytelling Apply row-level security to enforce access and data protection Manage and monitor Power BI cloud for scalability and teamwork Use AI tools like Copilot to speed up prep and generate insights Learn Microsoft Fabric basics to enable unified data experiences Who this book is for This book is ideal for anyone looking to build a solid foundation in Power BI, regardless of prior experience. Whether you're just starting out or stepping into a new role that involves data, you'll find clear, approachable guidance throughout. The step-by-step tutorials and real-world examples make it easy to follow along—even if it’s your first time working with business intelligence tools

Financial Modeling and Reporting with Microsoft Power BI

Design powerful financial reports in Power BI by building models, measures, and dashboards tailored for real-world accounting and analytics Key Features Build a complete financial data model from ledgers, journals, and budgets Master DAX for income statements, KPIs, and performance analysis Learn Power BI Paginated and AI tools for printable and predictive reporting Purchase of the print or Kindle book includes a free PDF eBook Book Description Power BI for Financial Reporting is the definitive guide to designing high-performance, flexible, and insightful financial reports using Power BI. This book empowers finance and BI professionals to create everything from trial balances to enterprise-wide performance dashboards with ease and precision. The book starts by helping you define your reporting goals and data sources, mapping these needs to Power BI’s capabilities. You’ll then build a core financial data model—covering ledger transactions, charts of accounts, and multi-company support. As you proceed, you’ll integrate complex DAX measures, handle foreign exchange and journal entries, and extend your model with budgeting and inventory data. Each chapter builds toward a comprehensive suite of reports, complete with visual best practices and tested metrics. You’ll learn to streamline datasets using Power Query, test for data integrity, and generate printable reports via Power BI Paginated. The final chapters dive into using AI, predictive analytics, and Microsoft Fabric to future-proof your reporting. Whether you're consolidating data across systems or evolving your reports for changing business needs, this hands-on guide ensures you’re prepared to meet the demands of modern finance. What you will learn Build core financial models from ledgers and accounts Create Trial Balance and Income Statements using DAX Optimize Power BI with Power Query and data transformation Add budgets, targets, and KPIs to performance dashboards Integrate inventory data for nuanced stock reporting Produce printable reports using Power BI Paginated Apply AI for report generation and predictive analytics Test, tune, and evolve reports for secure, scalable use Who this book is for This book is for finance professionals, accountants, financial analysts, and BI developers who want to leverage Power BI to improve, automate, and future-proof their financial reporting. Whether consolidating data from ERPs, building reports across entities, or exploring advanced Power BI features, this book equips readers with practical skills and strategic insight.

SAP ABAP 7.5 Optimization for HANA: AMDP, CDS and Native SQL for Peak Performance

In the evolving landscape of SAP development, performance is no longer just a nice-to-have—it's a necessity. With the power of SAP HANA and the enhancements introduced in ABAP 7.5, developers are now equipped to rethink how applications are built, executed, and optimized. This book is your guide to that transformation. We begin by understanding the core shift: moving data-intensive operations directly into the HANA database. When implemented correctly, this "code pushdown" philosophy dramatically reduces data transfer and processing overhead. AMDP (ABAP Managed Database Procedures), our in-database processing engine, enables us to write complex logic directly in SQLScript, harnessing HANA’s parallel processing capabilities. We focus on crafting efficient AMDP procedures by adopting set-based operations and minimizing unnecessary data movement. Next, we explore Core Data Services (CDS) Views, our go-to data modeling tool. CDS Views are not just simple database views; they act as semantic layers that define how our applications interact with data. We learn to create optimized CDS Views by leveraging associations, annotations, and table functions, enabling us to build reusable, high-performance data models. These views simplify complex queries, improve data consistency, and enhance application flexibility. We then turn to Native SQL, our direct line to the HANA database. While AMDP and CDS Views provide powerful abstractions, Native SQL offers ultimate control for specialized tasks. We embed Native SQL within AMDP procedures to access database-specific features and fine-tune performance for critical operations. Along the way, we apply best practices for writing efficient queries, with a strong focus on indexing, join strategies, and precise data filtering. Throughout this journey, we emphasize the importance of rigorous testing and proactive monitoring. Just like a race car undergoes extensive testing before hitting the track, our ABAP applications require careful validation to ensure accuracy and optimal performance. We explore techniques for unit testing AMDP procedures, validating CDS Views, and monitoring query performance. We also look at strategies for detecting and addressing potential bottlenecks before they affect end users. SAP ABAP 7.5 Optimization for HANA is not just about writing faster code—it’s about fundamentally rethinking how we develop applications. By embracing code pushdown, leveraging AMDP, CDS Views, and Native SQL, and implementing robust testing and monitoring strategies, we build ABAP applications that are not only faster, but also more scalable, maintainable, and adaptable to the ever-evolving demands of modern business. You Will: Learn how to implement the "code pushdown" philosophy, moving data-intensive operations directly into the HANA database to reduce data transfer and processing overhead Understand to create optimized CDS Views, leveraging associations, annotations, and table functions to build reusable, high-performance data models that simplify complex queries and improve data consistency. Explore how to write complex logic directly in SQLScript using AMDP, harnessing HANA's parallel processing capabilities, and using Native SQL for specialized tasks, accessing database-specific features to optimize performance. This Book is For: ABAP Developers, SAP Consultants and Architects and IT Managers and Technical Leads

Microsoft Power BI Step by Step

Are you ready to turn your data into powerful insights and make smarter business decisions? Microsoft Power BI Step by Step is your hands-on guide to mastering one of todays most in-demand business intelligence tools. Written by certified Power BI experts Nuric Ugarte and José Rafael Escalanteleaders in the Power BI community, with years of real-world consulting and teaching experiencethis book takes you from your very first steps in Power BI all the way to advanced data modeling, DAX calculations, and sharing interactive reports with your team. Whether youre a business analyst, data professional, Excel power user, or IT decision-maker, youll find clear, step-by-step instructions and practical exercises that make learning Power BI approachable and effective. Youll learn how to connect to a wide range of data sources, clean and transform your data, create stunning visualizations, and collaborate securely in the Power BI Service. Plus, youll discover how to use the latest features, including Copilot, to streamline your workflow and get answers faster. If you want to build your data skills, impress your organization, and unlock the full potential of Power BI, this is the book you need to get thereone step at a time.

Hands-On Software Engineering with Python - Second Edition

Grow your software engineering discipline, incorporating and mastering design, development, testing, and deployment best practices examples in a realistic Python project structure. Key Features Understand what makes Software Engineering a discipline, distinct from basic programming Gain practical insight into updating, refactoring, and scaling an existing Python system Implement robust testing, CI/CD pipelines, and cloud-ready architecture decisions Book Description Software engineering is more than coding; it’s the strategic design and continuous improvement of systems that serve real-world needs. This newly updated second edition of Hands-On Software Engineering with Python expands on its foundational approach to help you grow into a senior or staff-level engineering role. Fully revised for today’s Python ecosystem, this edition includes updated tooling, practices, and architectural patterns. You’ll explore key changes across five minor Python versions, examine new features like dataclasses and type hinting, and evaluate modern tools such as Poetry, pytest, and GitHub Actions. A new chapter introduces high-performance computing in Python, and the entire development process is enhanced with cloud-readiness in mind. You’ll follow a complete redesign and refactor of a multi-tier system from the first edition, gaining insight into how software evolves—and what it takes to do that responsibly. From system modeling and SDLC phases to data persistence, testing, and CI/CD automation, each chapter builds your engineering mindset while updating your hands-on skills. By the end of this book, you'll have mastered modern Python software engineering practices and be equipped to revise and future-proof complex systems with confidence. What you will learn Distinguish software engineering from general programming Break down and apply each phase of the SDLC to Python systems Create system models to plan architecture before writing code Apply Agile, Scrum, and other modern development methodologies Use dataclasses, pydantic, and schemas for robust data modeling Set up CI/CD pipelines with GitHub Actions and cloud build tools Write and structure unit, integration, and end-to-end tests Evaluate and integrate tools like Poetry, pytest, and Docker Who this book is for This book is for Python developers with a basic grasp of software development who want to grow into senior or staff-level engineering roles. It’s ideal for professionals looking to deepen their understanding of software architecture, system modeling, testing strategies, and cloud-aware development. Familiarity with core Python programming is required, as the book focuses on applying engineering principles to maintain, extend, and modernize real-world systems.

The Definitive Guide to DAX: Mastering the semantic model expression language for Microsoft Power BI, Fabric, and Excel, 3rd Edition

Seasoned Experts Alberto Ferrari and Marco Russo Help You Master DAX for Superior Business Intelligence Solutions The Definitive Guide to DAX is an authoritative resource for mastering the DAX language, which is pivotal for creating measures for semantic models in Microsoft Power BI, Fabric, Analysis Services, and Excel. With years of experience since the inception of Power Pivot, the authors offer a comprehensive reference that navigates through the unique and complex concepts of DAX, making it an essential tool for BI professionals. This third edition updates you with the latest features and provides a framework for writing efficient DAX code, enhancing your ability to customize calculations in data models. You will gain the skills necessary to understand and apply advanced DAX concepts, structure code in user-defined functions, and leverage new calendars for time intelligence calculations. By reading this book, you will: Understand the foundational concepts of DAX and its unique language structure Master the use of evaluation contexts to enhance data analysis Utilize CALCULATE and CALCULATETABLE functions effectively Implement variables to simplify complex DAX expressions Leverage classic and new calendar based time intelligence functions Work with iterators to perform advanced calculations Apply visual calculations to improve report interactivity Create and manage calculation groups for dynamic reporting Handle complex hierarchies and relationships in data models Author DAX queries for testing and troubleshooting About This Book For BI professionals and Excel power users eager to deepen their understanding of DAX and enhance their data modeling capabilities with advanced techniques For data analysts and IT specialists seeking to optimize their use of Microsoft Power BI, Fabric, Analysis Services, and Excel for more efficient and insightful data analysis

AWS re:Invent 2025 - Advanced data modeling for Amazon ElastiCache (DAT438)

This session delves into the intricacies of Amazon ElastiCache data modeling using the purpose-built Valkey data types to optimize application performance and scalability. Explore the use of strings, sets, sorted sets, hashes, bitmaps, and geospatial indexes to model complex relationships and solve use cases such as caching, session store, feature store, real-time analytics, geospatial applications, and rate limiters.

Learn more: More AWS events: https://go.aws/3kss9CP

Subscribe: More AWS videos: http://bit.ly/2O3zS75 More AWS events videos: http://bit.ly/316g9t4

ABOUT AWS: Amazon Web Services (AWS) hosts events, both online and in-person, bringing the cloud computing community together to connect, collaborate, and learn from AWS experts. AWS is the world's most comprehensive and broadly adopted cloud platform, offering over 200 fully featured services from data centers globally. Millions of customers—including the fastest-growing startups, largest enterprises, and leading government agencies—are using AWS to lower costs, become more agile, and innovate faster.

AWSreInvent #AWSreInvent2025 #AWS

AWS re:Invent 2025 - Advanced data modeling with Amazon DynamoDB (DAT414)

Amazon DynamoDB is a popular choice for modern applications because it’s a serverless database that provides single-digit millisecond performance at any scale. Optimizing your usage of DynamoDB requires a different approach to data modeling than traditional relational databases. In this session, AWS Data Hero Alex DeBrie shows you advanced techniques to help you get the most out of DynamoDB. Learn how to “think in DynamoDB” by learning the DynamoDB foundations and principles for data modeling. Further, learn practical strategies and DynamoDB features to handle difficult use cases in your application.

Learn more: More AWS events: https://go.aws/3kss9CP

Subscribe: More AWS videos: http://bit.ly/2O3zS75 More AWS events videos: http://bit.ly/316g9t4

ABOUT AWS: Amazon Web Services (AWS) hosts events, both online and in-person, bringing the cloud computing community together to connect, collaborate, and learn from AWS experts. AWS is the world's most comprehensive and broadly adopted cloud platform, offering over 200 fully featured services from data centers globally. Millions of customers—including the fastest-growing startups, largest enterprises, and leading government agencies—are using AWS to lower costs, become more agile, and innovate faster.

AWSreInvent #AWSreInvent2025 #AWS

AWS re:Invent 2025 - Data modeling core concepts for Amazon DynamoDB (DAT311)

Join this session to learn the core concepts of Amazon DynamoDB data modeling. Explore best practices for common access patterns used by DynamoDB customers for applications that need consistent, fast performance at any scale. Developers experienced with DynamoDB can learn best practices and trade-offs to make when deciding on single-table and multi-table designs, indexing strategies, and more.

Learn more: More AWS events: https://go.aws/3kss9CP

Subscribe: More AWS videos: http://bit.ly/2O3zS75 More AWS events videos: http://bit.ly/316g9t4

ABOUT AWS: Amazon Web Services (AWS) hosts events, both online and in-person, bringing the cloud computing community together to connect, collaborate, and learn from AWS experts. AWS is the world's most comprehensive and broadly adopted cloud platform, offering over 200 fully featured services from data centers globally. Millions of customers—including the fastest-growing startups, largest enterprises, and leading government agencies—are using AWS to lower costs, become more agile, and innovate faster.

AWSreInvent #AWSreInvent2025 #AWS

Power BI for Finance

Build effective data models and reports in Power BI for financial planning, budgeting, and valuations with practical templates, logic, and step-by-step guidance. Free with your book: DRM-free PDF version + access to Packt's next-gen Reader Key Features Engineer optimal star schema data models for financial planning and analysis Implement common financial logic, calendars, and variance calculations Create dynamic, formatted reports for income statements, balance sheets, and cash flow Purchase of the print or Kindle book includes a free PDF eBook Book Description Martin Kratky brings his global experience of over 20 years as co-founder of Managility and creator of Acterys to empower CFOs and accountants with Power BI for Finance through this hands-on guide to streamlining and enhancing financial processes. Starting with the foundation of every effective BI solution, a well-designed data model, the book shows you how to structure star schemas and integrate common financial data sources like ERP and accounting systems. You’ll then learn to implement key financial logic using DAX and M, covering calendars, KPIs, and variance calculations. The book offers practical advice on creating clear and compliant financial reports, such as income statements, balance sheets, and cash flows with visual design and formatting best practices. With dedicated chapters on advanced workflows, you’ll learn how to handle multi-currency setups, perform group consolidations, and implement planning models like rolling forecasts, annual budgets, and sales and operations planning (S&OP). As you advance, you’ll gain insights from real-world case studies covering company valuations, Excel integration, and the use of write-back methods with Dynamics Business Performance Planning and Acterys. The concluding chapters highlight how AI and Copilot enhance financial analytics. Email sign-up and proof of purchase required What you will learn Apply multi-currency handling and group consolidation techniques in Power BI Model discounted cash flow and company valuation scenarios Design and manage write-back workflows with Dynamics BPP and Acterys Integrate Excel and Power BI using live connections and cube formulas Utilize AI, Copilot, and LLMs to enhance automation and insight generation Create complete finance-focused dashboards for sales and operations planning Who this book is for This book is for finance professionals including CFOs, FP&A managers, controllers, and certified accountants who want to enhance reporting, planning, and forecasting using Power BI. Basic familiarity with Power BI and financial concepts is recommended to get the most out of this hands-on guide.

Building Machine Learning Systems with a Feature Store

Get up to speed on a new unified approach to building machine learning (ML) systems with a feature store. Using this practical book, data scientists and ML engineers will learn in detail how to develop and operate batch, real-time, and agentic ML systems. Author Jim Dowling introduces fundamental principles and practices for developing, testing, and operating ML and AI systems at scale. You'll see how any AI system can be decomposed into independent feature, training, and inference pipelines connected by a shared data layer. Through example ML systems, you'll tackle the hardest part of ML systems--the data, learning how to transform data into features and embeddings, and how to design a data model for AI. Develop batch ML systems at any scale Develop real-time ML systems by shifting left or shifting right feature computation Develop agentic ML systems that use LLMs, tools, and retrieval-augmented generation Understand and apply MLOps principles when developing and operating ML systems

To be clear - I'm not saying that analytics and data engineering are a fad. I'm not saying the data teams are doomed to fade away, or that the old fundamentals of data modeling are wrong, or that the urge to quantify everything is a mistake. I'm saying that things seem pretty good, right now. But, you know. Like Charles Schwab constantly says, past performance is no guarantee of future results. So someone else might say all of that in the future - because, as John Maynard Keynes said, in the long run, we are all dead.

For years, data engineering was a story of predictable "pipelines": move data from point A to point B. But AI just hit the reset button on our entire field. Now, we're all staring into the void, wondering what's next. While the fundamentals haven't changed, data remains challenging in the traditional areas of data governance, data management, and data modeling, which still present challenges. Everything else is up for grabs. This talk will cut through the noise and explore the future of data engineering in an AI-driven world. We'll examine how team structures will evolve, why agentic workflows and real-time systems are becoming non-negotiable, and how our focus must shift from building dashboards and analytics to architecting for automated action. The reset button has been pushed. It's time for us to invent the future of our industry.

Start with a dataset in Motherduck and build a production-ready analytics app using Omni’s semantic model and APIs. We’ll cover practical data modeling techniques, share lessons learned from building AI features, and walk through how to give AI the context it needs to answer questions accurately. You’ll leave with a working app and the skills to build your next one.

Microsoft Power Platform Solutions Architect's Handbook - Second Edition

Dive into 'Microsoft Power Platform Solution Architect's Handbook' to master the art of designing and delivering enterprise-grade solutions using Microsoft's cutting-edge Power Platform. Through a mix of practical examples and hands-on tutorials, this book equips you to harness tools like AI, Copilot, and DevOps for building innovative, scalable applications tailored to enterprise needs. What this Book will help me do Acquire the knowledge to effectively utilize AI tools such as Power Platform Copilot and ChatGPT to enhance application intelligence. Understand and apply enterprise-grade solution architecture principles for scalable and secure application development. Gain expertise in integrating heterogenous systems with Power Platform Pipes and third-party APIs. Develop proficiency in creating and maintaining reusable Dataverse data models. Learn to establish and manage a Center of Excellence to govern and scale Power Platform solutions. Author(s) Hugo Herrera is an experienced solution architect specializing in the Microsoft Power Platform with a deep focus on integrating AI and cloud-native strategies. With years of hands-on experience in enterprise software development and architectural design, Hugo brings real-world insights into his writing, emphasizing practical application of advanced concepts. His approach is clear, structured, and aimed at empowering readers to excel. Who is it for? This book is tailored for IT professionals like solution architects, enterprise architects, and technical consultants who are looking to elevate their capabilities in Power Platform development. It is also suitable for individuals with an intermediate understanding of Power Platform seeking to spearhead enterprise-level digital transformation projects. Ideal readers are those ready to deepen their integration, data modeling, and AI usage skills within the Microsoft ecosystem, particularly for enterprise applications.

This course will focus on leveraging the semantic layer to build and consume metrics from dbt. We will start with an initial dbt project and leverage dbt Copilot to create the necessary logic to power your semantic layer. After defining and building these assets, we will configure partner tools to utilize single source of truth reporting: driving collaboration across data consumers in your team. Prerequisites: The prerequisites for this course include: dbt Fundamentals, specifically data modeling and model configurations What to bring: You will need to bring your own laptop to complete the hands-on exercises. We will provide all the other sandbox environments for dbt and a data platform.

In this course, learn how to manage and monitor data platform costs using dbt's built-in tools. We’ll cover how to surface warehouse usage data, set up basic monitoring, and apply rule-based recommendations to optimize performance. You’ll also explore how cost insights fit naturally into the developer workflow—equipping you to make smarter decisions without leaving dbt. This course is for analytics engineers, data analysts, and data platform owners who have a foundational understanding of dbt and want to build more cost-effective data pipelines. Using these cost management and orchestration strategies, the internal dbt Labs Analytics team achieved significant savings: Our cloud compute bill was reduced by 9% by simply implementing dbt Fusion and state-aware orchestration. By understanding the impact of models on platform costs, the team reduced the number of models built in scheduled jobs by 35% and shaved 20% off of job execution times. After this course, you will be able to: Articulate how dbt development patterns impact data platform costs. Configure dbt Cloud to monitor warehouse compute spend. Use the dbt Cost Management dashboard to identify high-cost models and jobs. Apply specific optimization techniques, from materializations to advanced data modeling patterns, to reduce warehouse costs. Implement proactive strategies like dbt Fusion and state-aware orchestration to prevent future cost overruns. Prerequisites for this course include: dbt fundamentals What to bring: You will need to bring your own laptop to complete the hands-on exercises. We will provide all the other sandbox environments for dbt and data platform. Duration: 2 hours Fee: $200 Trainings and certifications are not offered separately and must be purchased with a Coalesce pass Trainings and certifications are not available for Coalesce Online passes

This is the introductory course for developers jumping into dbt! We will dive into data modeling, sources, data tests, documentation, and deployment. As an instructor-led course, you’ll have the chance to learn with peers, ask questions, and get live coaching and feedback. After this course, you will be able to: Explain the foundational concepts of dbt Build data models and a DAG to visualize dependencies Configure tests and add documentation to your models Deploy your dbt project to refresh data models on a schedule Prerequisites for this course: Intermediate SQL knowledge What to bring: You must bring your own laptop to complete the hands-on exercises. We will provide all the other sandbox environments for dbt and data platform. Duration: 4 hours Fee: $400 Trainings and certifications are not offered separately and must be purchased with a Coalesce pass Trainings and certifications are not available for Coalesce Online passes

Thomas in't Veld, founder of Tasman Analytics, joined Yuliia and Dumke to discuss why data projects fail: teams obsess over tooling while ignoring proper data modeling and business alignment. Drawing from building analytics for 70-80 companies, Thomas explains why the best data model never changes unless the business changes, and how his team acts as "data therapists" forcing marketing and sales to agree on fundamental definitions. He shares his controversial take that data modeling sits more in analysis than engineering. Another hot take: analytics engineering is merging back into data engineering, and why showing off your DAG at meetups completely misses the point - business understanding is the critical differentiator, not your technology stack.