talk-data.com talk-data.com

Filter by Source

Select conferences and events

People (218 results)

See all 218 →

Companies (3 results)

Back Market 1 speaker
Data Science and Machine Learning Engineer
BlackRock 2 speakers
PhD, Director of Data Science
Black Duck 1 speaker
Strategic Account Manager
Showing 8 results

Activities & events

Title & Speakers Event
Next at Night 2025-04-11 · 02:00
Tate Renner – Special performance , The Killers – Featured performers , Wyclef Jean – Special performance

Doors open at 6:30 pm, the show starts promptly at 7 pm.

Get ready to rock out to the iconic sounds of The Killers, with special performances by the legendary Wyclef Jean and rising star Tate Renner. Next at Night is an exclusive concert for Next attendees, and a perfect way to unwind after a day of learning. Come and experience the energy of Las Vegas under the neon lights at Allegiant Stadium.

The Killers are a Las Vegas-based four-piece who formed in 2002, featuring the talents of singer/keyboardist Brandon Flowers, drummer Ronnie Vannucci, guitarist Dave Keuning, and bassist Mark Stoermer. The band has received countless accolades for their artistic achievement, including multiple Grammy nominations, American Music Award nominations, MTV Video Music Awards, NME Awards, and more.

Wyclef Jean is a Grammy Award-winning musician and producer with decades of experience in the music industry, known for his work with the Fugees and his successful solo career. Beyond music, he’s a versatile entrepreneur involved in podcasts, jingles, and scoring for TV shows like Showtime’s “The Chi.” Wyclef is also dedicated to social causes, partnering with organizations like TIAA to promote financial literacy and empower underserved youth.

Tate Renner is a rising singer/songwriter with a unique blend of country and rock influences, shaped by his upbringing in Texas and Chicago. He gained national attention on Season 26 of NBC’s “The Voice” as a member of Reba McEntire’s team. Now based in Nashville, Tate is dedicated to writing, recording, and performing his original music, with his debut single “Scars” produced by country music legend Clint Black.

Google Cloud Next '25

Summary

Architectural decisions are all based on certain constraints and a desire to optimize for different outcomes. In data systems one of the core architectural exercises is data modeling, which can have significant impacts on what is and is not possible for downstream use cases. By incorporating column-level lineage in the data modeling process it encourages a more robust and well-informed design. In this episode Satish Jayanthi explores the benefits of incorporating column-aware tooling in the data modeling process.

Announcements

Hello and welcome to the Data Engineering Podcast, the show about modern data management RudderStack helps you build a customer data platform on your warehouse or data lake. Instead of trapping data in a black box, they enable you to easily collect customer data from the entire stack and build an identity graph on your warehouse, giving you full visibility and control. Their SDKs make event streaming from any app or website easy, and their extensive library of integrations enable you to automatically send data to hundreds of downstream tools. Sign up free at dataengineeringpodcast.com/rudderstack- Your host is Tobias Macey and today I'm interviewing Satish Jayanthi about the practice and promise of building a column-aware data architecture through intentional modeling

Interview

Introduction How did you get involved in the area of data management? How has the move to the cloud for data warehousing/data platforms influenced the practice of data modeling?

There are ongoing conversations about the continued merits of dimensional modeling techniques in modern warehouses. What are the modeling practices that you have found to be most useful in large and complex data environments?

Can you describe what you mean by the term column-aware in the context of data modeling/data architecture?

What are the capabilities that need to be built into a tool for it to be effectively column-aware?

What are some of the ways that tools like dbt miss the mark in managing large/complex transformation workloads? Column-awareness is obviously critical in the context of the warehouse. What are some of the ways that that information can be fed into other contexts? (e.g. ML, reverse ETL, etc.) What is the importance of embedding column-level lineage awareness into transformation tool vs. layering on top w/ dedicated lineage/metadata tooling? What are the most interesting, innovative, or unexpected ways that you have seen column-aware data modeling used? What are the most interesting, unexpected, or challenging lessons that you have learned while working on building column-aware tooling? When is column-aware modeling the wrong choice? What are some additional resources that you recommend for individuals/teams who want to learn more about data modeling/column aware principles?

Contact Info

LinkedIn

Parting Question

From your perspective, what is the biggest gap in the tooling or technology for data management today?

Closing Announcements

Thank you for listening! Don't forget to check out our other shows. Podcast.init covers the Python language, its community, and the innovative ways it is being used. The Machine Learning Podcast helps you go from idea to production with machine learning. Visit the site to subscribe to the show, sign up for the mailing list, and read the show notes. If you've learned something or tried out a project from the show then tell us about it! Email [email protected]) with your story. To help other people find the show please leave a review on Apple Podcasts and tell your friends and co-workers

Links

Coalesce

Podcast Episode

Star Schema Conformed Dimensions Data Vault

The intro and outro music is from The Hug by The Freak Fandango Orchestra / CC BY-SA Sponsored By: Rudderstack: Rudderstack

RudderStack provides all your customer data pipeli

AI/ML CDP Cloud Computing Data Engineering Data Lake Data Management Data Modelling Data Vault dbt DWH ETL/ELT dimensional modeling Python Data Streaming
Mark Van de Wiel – guest @ Fivetran , Tobias Macey – host

Summary Data integration from source systems to their downstream destinations is the foundational step for any data product. With the increasing expecation for information to be instantly accessible, it drives the need for reliable change data capture. The team at Fivetran have recently introduced that functionality to power real-time data products. In this episode Mark Van de Wiel explains how they integrated CDC functionality into their existing product, discusses the nuances of different approaches to change data capture from various sources.

Announcements

Hello and welcome to the Data Engineering Podcast, the show about modern data management When you’re ready to build your next pipeline, or want to test out the projects you hear about on the show, you’ll need somewhere to deploy it, so check out our friends at Linode. With their new managed database service you can launch a production ready MySQL, Postgres, or MongoDB cluster in minutes, with automated backups, 40 Gbps connections from your application hosts, and high throughput SSDs. Go to dataengineeringpodcast.com/linode today and get a $100 credit to launch a database, create a Kubernetes cluster, or take advantage of all of their other services. And don’t forget to thank them for their continued support of this show! You wake up to a Slack message from your CEO, who’s upset because the company’s revenue dashboard is broken. You’re told to fix it before this morning’s board meeting, which is just minutes away. Enter Metaplane, the industry’s only self-serve data observability tool. In just a few clicks, you identify the issue’s root cause, conduct an impact analysis⁠—and save the day. Data leaders at Imperfect Foods, Drift, and Vendr love Metaplane because it helps them catch, investigate, and fix data quality issues before their stakeholders ever notice they exist. Setup takes 30 minutes. You can literally get up and running with Metaplane by the end of this podcast. Sign up for a free-forever plan at dataengineeringpodcast.com/metaplane, or try out their most advanced features with a 14-day free trial. Mention the podcast to get a free "In Data We Trust World Tour" t-shirt. RudderStack helps you build a customer data platform on your warehouse or data lake. Instead of trapping data in a black box, they enable you to easily collect customer data from the entire stack and build an identity graph on your warehouse, giving you full visibility and control. Their SDKs make event streaming from any app or website easy, and their state-of-the-art reverse ETL pipelines enable you to send enriched data to any cloud tool. Sign up free… or just get the free t-shirt for being a listener of the Data Engineering Podcast at dataengineeringpodcast.com/rudder. Data teams are increasingly under pressure to deliver. According to a recent survey by Ascend.io, 95% in fact reported being at or over capacity. With 72% of data experts reporting demands on their team going up faster than they can hire, it’s no surprise they are increasingly turning to automation. In fact, while only 3.5% report having current investments in automation, 85% of data teams plan on investing in automation in the next 12 months. 85%!!! That’s where our friends at Ascend.io come in. The Ascend Data Automation Cloud provides a unified platform for data ingestion, transformation, orchestration, and observability. Ascend users love its declarative pipelines, powerful SDK, elegant UI, and extensible plug-in architecture, as well as its support for Python, SQL, Scala, and Java. Ascend automates workloads on Snowflake, Databricks, BigQuery, and open source Spark, and can be deployed in AWS, Azure, or GCP. Go to dataengineeringpodcast.com/ascend and sign up for a free trial. If you’re a data engineering podcast listener, you get credits worth $5,000 when you become a customer. Your host is Tobias Macey and today I’m interviewing Mark Van de Wiel about Fivetran’s implementation of chang

Analytics AWS Azure BigQuery CDP Cloud Computing Dashboard Data Engineering Data Lake Data Management Data Quality Databricks ETL/ELT Fivetran GCP Java Kubernetes MongoDB MySQL postgresql Python Scala Snowflake Spark SQL Data Streaming
Val Kroll – host , Julie Hoyer – host , Michael Helbling – host , Tim Wilson – host @ Analytics Power Hour - Columbus (OH , Moe Kiss – host , Barr Moses – CEO and co-founder @ Monte Carlo

You know that sinking feeling: the automated report went out first thing Monday morning, and your Slack messages have been blowing up ever since because revenue flatlined on Saturday afternoon! You frantically start digging in (spilling your coffee in the process!) while you're torn between hoping that it's "just a data issue" (which would be good for the company but a black mark on the data team) and that it's a "real issue with the site" (not good for the business, but at least your report was accurate!). Okay. So, maybe you've never had that exact scenario, but we've all dealt with data breakages occurring in various unexpected nooks and crannies of our data ecosystem. It can be daunting to make a business case to invest in monitoring and observing all the various data pipes and tables to proactively identify data issues. But, as our data gets broader and deeper and more business-critical, can we afford not to? On this episode, we were joined by Barr Moses, co-founder and CEO of Monte Carlo to chat about practical strategies and frameworks for monitoring data and reducing data downtime! For complete show notes, including links to items mentioned in this episode and a transcript of the show, visit the show page.

Monte Carlo
The Analytics Power Hour
Mark Sears – CEO @ CloudFactory , Tobias Macey – host

Summary Successful machine learning and artificial intelligence projects require large volumes of data that is properly labelled. The challenge is that most data is not clean and well annotated, requiring a scalable data labeling process. Ideally this process can be done using the tools and systems that already power your analytics, rather than sending data into a black box. In this episode Mark Sears, CEO of CloudFactory, explains how he and his team built a platform that provides valuable service to businesses and meaningful work to developing nations. He shares the lessons learned in the early years of growing the business, the strategies that have allowed them to scale and train their workforce, and the benefits of working within their customer’s existing platforms. He also shares some valuable insights into the current state of the art for machine learning in the real world.

Announcements

Hello and welcome to the Data Engineering Podcast, the show about modern data management When you’re ready to build your next pipeline, or want to test out the projects you hear about on the show, you’ll need somewhere to deploy it, so check out our friends at Linode. With 200Gbit private networking, scalable shared block storage, and a 40Gbit public network, you’ve got everything you need to run a fast, reliable, and bullet-proof data platform. If you need global distribution, they’ve got that covered too with world-wide datacenters including new ones in Toronto and Mumbai. And for your machine learning workloads, they just announced dedicated CPU instances. Go to dataengineeringpodcast.com/linode today to get a $20 credit and launch a new server in under a minute. And don’t forget to thank them for their continued support of this show! Integrating data across the enterprise has been around for decades – so have the techniques to do it. But, a new way of integrating data and improving streams has evolved. By integrating each silo independently – data is able to integrate without any direct relation. At CluedIn they call it “eventual connectivity”. If you want to learn more on how to deliver fast access to your data across the enterprise leveraging this new method, and the technologies that make it possible, get a demo or presentation of the CluedIn Data Hub by visiting dataengineeringpodcast.com/cluedin. And don’t forget to thank them for supporting the show! You listen to this show to learn and stay up to date with what’s happening in databases, streaming platforms, big data, and everything else you need to know about modern data management.For even more opportunities to meet, listen, and learn from your peers you don’t want to miss out on this year’s conference season. We have partnered with organizations such as O’Reilly Media, Dataversity, and the Open Data Science Conference. Coming up this fall is the combined events of Graphorum and the Data Architecture Summit. The agendas have been announced and super early bird registration for up to $300 off is available until July 26th, with early bird pricing for up to $200 off through August 30th. Use the code BNLLC to get an additional 10% off any pass when you register. Go to dataengineeringpodcast.com/conferences to learn more and take advantage of our partner discounts when you register. Go to dataengineeringpodcast.com to subscribe to the show, sign up for the mailing list, read the show notes, and get in touch. To help other people find the show please leave a review on iTunes and tell your friends and co-workers Join the community in the new Zulip chat workspace at dataengineeringpodcast.com/chat Your host is Tobias Macey and today I’m interviewing Mark Sears about Cloud Factory, masters of the art and science of labeling data for Machine Learning and more

Interview

Introduction How did you get involved in the area of data management? Can you start by explaining what CloudFactory is and the story behind it? What are some of the common requirements

AI/ML Analytics Big Data Cloud Computing Data Engineering Data Management Data Science Data Streaming
Data Engineering Podcast
Event How Music Charts 2019-07-12
Vance Joy – Singer-songwriter , Mark Mulligan – Analyst @ Midia Research , Jason Joven – host @ Chartmetric , AC/DC – Rock band @ AC/DC , Steve Boom – Head of Amazon Music @ Amazon Music

Highlights  Who says music is all about young people and streaming? Amazon Music and American radio would beg to differ, and we’ll check out a couple of Australian artists who are doing well on them.Mission   Good morning, it’s Jason here at Chartmetric with your 3-minute Data Dump where we upload charts, artists and playlists into your brain so you can stay up on the latest in the music data world.We’re on the socials at “chartmetric”, that’s Chartmetric, no “S ”- follow us on Instagram, Twitter, Facebook or LinkedIn, and talk to us! We’d love to hear from you.FYI, we’re scaling back to 2 episodes per week, why? Because we’re working on some special projects that we will certainly tell you about over the next few months, but we need to make the time to do them! So don’t worry, your phone isn’t playing games with your heart….it’s just us and the Backstreet Boys.Having said all that….DateThis is your Data Dump for Friday, July 12th, 2019.Vance Joy and AC/DC on Amazon Music and US RadioThe Financial Times reported yesterday on the rise of Amazon Music, and how it has experienced a 70 percent growth in subscribers in the past year.The head of Amazon Music- Steve Boom (that’s a great name for a music guy)-  noted that all the other platforms were playing for the younger crowds, but not older consumers. Apparently 14 percent of subscribers to Amazon Music are aged 55 or older, compared with just 5 percent of Spotify’s customers, according to Midia Research’s Mark Mulligan.Now on the radio side of things, Music Business Worldwide reported that AM/FM US radio consumption is growing! Take that, streaming.Radio reached more folks than any other entertainment platform in 2019, according to Nielsen’s Audio Today 2019 report.272M Americans fire up their radios each week, that is 7M more listeners than 2016...and why? Because Americans love their cars, and radios are just there.Now to help illustrate that with actual artists, we’ll turn to two of Australia’s biggest ones, relative newcomer Vance Joy and classic rock gods AC/DC.Vance Joy, the pop/folk singer-songwriter from Melbourne is currently on19 Amazon editorial playlists, including the contextual playlists Rise and Shine, Road Trip: Folk and a chart-like playlist: Best Folk Songs of 2017.His massive hit “Riptide” is actually NOT the most playlisted on the platform, it’s actually another one of his records, “Lay It On Me”, placing in 9 of those 19 Amazon Music playlists.On the 300 influential American radio stations we cover, Joy had as many as 506 spins in the week of Sept 24th 2018, and the week of July 1st, it was down to 91.But it’s all good because the state of Wisconsin LOVES Vance Joy, as his songs have been 1% of all the tracks that state’s radio stations have played since September. Pretty impressive.Now for all-time rock greats AC/DC, straight out of Sydney:They are on 14 Amazon editorial playlists, including the #2 slot on Classic Rock for Lifting, the #5 spot for Pre-Game Grilling, and the #1 spot for 80s Hard Rock Workout...who’s feeling some testosterone?AC/DC hits like “You Shook Me All Night Long” and “Back in Black” seem to resonate most in Boston, Massachusetts and Gainesville, Florida…...but what’s really good to remember is that in case your phone runs out of battery, you can find either of these artists or others by flicking on the old car radio, or simply asking Alexa to do it for you.Outro That’s it for your Daily Data Dump for Friday, July 12th, 2019. This is Jason from Chartmetric.Free accounts are at chartmetric.comAnd article links and show notes are at: podcast.chartmetric.comHappy Friday, and we’ll see you next week! 

Singer Data Streaming
Jason Joven – host @ Chartmetric

HighlightsAmazon Music’s Brand New Music playlist features a few new Friday releases similar to Spotify and Apple, we’ve got the quick rundown for youMission   Good morning, it’s Jason here at Chartmetric with your 3-minute Data Dump where we upload charts, artists and playlists into your brain so you can stay up on the latest in the music data world.DateThis is your Data Dump for Monday May 20th 2019.New Music Friday Monday: Amazon’s Brand New Music playlistIt’s New Music Friday Monday and we’re looking at Amazon’s Brand New Music playlist that updated on Friday before the weekend.Brand New Music is essentially the tech giant’s version of Spotify’s New Music Friday, currently featuring 70 tracks from big stars to legacy acts to up and comers.In the #1 playlist position is the DJ Khaled’s latest track featuring John Legend and the late Nipsey Hussle on “Higher”. Halsey’s “Nightmare” pulls up in the #2 spot while The Black Keys make their comeback with the new single “Go”, coming into the #3 position.The three major labels have the most singles on the list this week, with UMG having 14, Sony at 11 and Warner Music at 7. However, there is no shortage of independent labels on the list either, with 4AD, 300 Entertainment and AOMG all representing with one track each.Genre-wise, about 30% of the playlist’s tracks feature the “pop” genre tag, the most out of all genres, with Madonna, Mark Ronson and Charli XCX feat. Lizzo in that group. “Hip-hop/rap” takes 2nd place with another 20% of the list’s sound with none other than Wu-Tang Clan debuting a new record to accompany their recently-released Showtime documentary series.EDM and rock have 6 and 5% of the tracks respectively to round things out, with Keith Urban, Lady Antebellum and Maddie & Tae contributing to the “country” genre tags for a total of 4% of the list.Brand New Music’s artists this week are mostly from English-speaking countries, almost 85% of it, which fits in nicely with the fact that 72% of the smart speaker market last year was solely based in the US and the UK.Between Spotify’s top 10 tracks on the New Music Friday and Brand New Music’s top 10, only 4 of the tracks overlap, coincidentally in the 1st, 2nd, 5th and 7th position.And if we throw in Apple Music’s equivalent playlist- Best of the Week- we have the 1st and 2nd position in common with the other two.So if you pulled up all three Global Release Day playlists on any of the platforms, you’re guaranteed to hear DJ Khaled and then Halsey, in that order.In all three, you’ll also end up hearing Lana Del Rey’s Sublime cover of “Doin’ Time”  before you hit track 6.There seems to be some slight experimentation going on: for example, you’ll hear Tyler the Creator on all platforms, but his track “WHAT’S GOOD” on Spotify and Amazon, while hearing “EARFQUAKE” on Apple.On the Latin front, Colombian rapper Maluma debuts a purely Spanish track “11 PM” on Amazon, while he shows off both Spanish and English vocals on another track, “Tu Vecina (feat. Ty Dolla $ign)” on Apple Music.Check out some of these interesting playlist differences for yourself before this Friday comes around!OutroThat’s it for your Daily Data Dump for Monday May 20th 2019. This is Jason from Chartmetric.Free accounts are at app.chartmetric.com/signupAnd article links and show notes are at: podcast.chartmetric.com.Happy Monday, see you tomorrow!

Gnuplot in Action is the first comprehensive introduction to gnuplot—from the basics to the power features and beyond. Besides providing a tutorial on gnuplot itself, it demonstrates how to apply and use gnuplot to extract intelligence from data. Particular attention is paid to tricky or poorly-explained areas. You will learn how to apply gnuplot to actual data analysis problems. This book looks at different types of graphs that can be generated with gnuplot and will discuss when and how to use them to extract actual information from data. About the Technology Statistical data is only as valuable as your ability to analyze, interpret, and present it in a meaningful way. Gnuplot is the most widely used program to plot and visualize data for Unix/Linux systems and it is also popular for Windows and the Mac. It's open-source (as in free!), actively maintained, stable, and mature. It can deal with arbitrarily large data sets and is capable of producing high-quality, publication-ready graphics. So far, the only comprehensive documentation available about gnuplot is the online reference documentation, which makes it both hard to get started and almost impossible to get a complete overview over all of its features. If you've never tried gnuplot—or have found it tough to get your arms around—read on. About the Book One of gnuplot's main advantages is that it requires no programming skills nor knowledge of advanced mathematical or statistical concepts. Gnuplot in Action assumes you have no previous knowledge of either gnuplot or statistics and data analysis. The books starts out with basic gnuplot concepts, then describes in depth how to get a graph ready for final presentation and to make it look "just right" by including arrows, labels, and other decorations. Next the book looks at advanced concepts, such as multi-dimensional graphs and false-color plots—powerful features for special purposes. The author also describes advanced applications of gnuplot, such as how to script gnuplot so that it can run unattended as a batch job, and how to call gnuplot from within a CGI script to generate graphics for dynamic websites on demand. What's Inside Creating graphs with gnuplot Data transformations and filters Preparing/polishing graphs for final presentation Publishing graphs in print or on the Web Using gnuplot's power features Gnuplot scripting and programming Types of graphs and when to use them Techniques of graphical analysis How to build, install, and develop for gnuplot Command and Option reference organized by concept About the Reader Gnuplot in Action makes gnuplot easy for anyone who needs to do data analysis, but doesn't have an education in analytical tools and methods. It's perfect for DBAs, programmers, and performance engineers; business analysts and MBAs; and Six-Sigma Black Belts and process engineers. About the Author Philipp K. Janert is Chief Consultant at Principal Value, LLC. He has been a gnuplot user for more than 15 years and regards it as one of the indispensable tools in his toolbox. He has worked for small start-ups and in large corporate environments, both in the US and overseas, including several years at Amazon.com, where he initiated and led several projects to improve Amazon's order fulfillment processes. Philipp K. Janert has written about software and software development for the O'Reilly Network, IBM developerWorks, IEEE Software, and Linux Magazine. He holds a Ph.D. in Theoretical Physics from the University of Washington. Visit his website at www.principal-value.com. Quotes Knee-deep in data? This is your guidebook to exploring it with gnuplot. - Austin King, Mozilla Sparkles with insight about visualization, image perception, and data exploration. - Richard B. Kreckel, GiNaC.de Incredibly useful for beginners - indispensable for advanced users. - Mark Pruett, Systems Architect Dominion Bridges the gap between gnupolt's reference manual and real-world problems. - Mitchell Johnson, Border Stylo A Swiss Army knife for plotting data. - Nishanth Sastry, Computer Laboratory, University of Cambridge/IBM

data data-science data-science-tasks data-visualization gnuplot IBM Linux Unix
O'Reilly Data Science Books
Showing 8 results