talk-data.com talk-data.com

Filter by Source

Select conferences and events

People (526 results)

See all 526 →

Activities & events

Title & Speakers Event
Event Data Skeptic 2025-11-23
Kyle Polich – host , Florian Atzenhofer-Baumgartner – PhD student @ Graz University of Technology

In this episode of Data Skeptic, we explore the fascinating intersection of recommender systems and digital humanities with guest Florian Atzenhofer-Baumgartner, a PhD student at Graz University of Technology. Florian is working on Monasterium.net, Europe's largest online collection of historical charters, containing millions of medieval and early modern documents from across the continent. The conversation delves into why traditional recommender systems fall short in the digital humanities space, where users range from expert historians and genealogists to art historians and linguists, each with unique research needs and information-seeking behaviors. Florian explains the technical challenges of building a recommender system for cultural heritage materials, including dealing with sparse user-item interaction matrices, the cold start problem, and the need for multi-modal similarity approaches that can handle text, images, metadata, and historical context. The platform leverages various embedding techniques and gives users control over weighting different modalities—whether they're searching based on text similarity, visual imagery, or diplomatic features like issuers and receivers. A key insight from Florian's research is the importance of balancing serendipity with utility, collection representation to prevent bias, and system explainability while maintaining effectiveness. The discussion also touches on unique evaluation challenges in non-commercial recommendation contexts, including Florian's "research funnel" framework that considers discovery, interaction, integration, and impact stages. Looking ahead, Florian envisions recommendation systems becoming standard tools for exploration across digital archives and cultural heritage repositories throughout Europe, potentially transforming how researchers discover and engage with historical materials. The new version of Monasterium.net, set to launch with enhanced semantic search and recommendation features, represents an important step toward making cultural heritage more accessible and discoverable for everyone.  

C#/.NET Funnel
Aditya Chichani – senior machine learning engineer @ Walmart , Kyle Polich – host

In this episode of Data Skeptic's Recommender Systems series, Kyle sits down with Aditya Chichani, a senior machine learning engineer at Walmart, to explore the darker side of recommendation algorithms. The conversation centers on shilling attacks—a form of manipulation where malicious actors create multiple fake profiles to game recommender systems, either to promote specific items or sabotage competitors. Aditya, who researched these attacks during his undergraduate studies at SPIT before completing his master's in computer science with a data science specialization at UC Berkeley, explains how these vulnerabilities emerge particularly in collaborative filtering systems. From promoting a friend's ska band on Spotify to inflating product ratings on e-commerce platforms, shilling attacks represent a significant threat in an industry where approximately 4% of reviews are fake, translating to $800 billion in annual sales in the US alone. The discussion delves deep into collaborative filtering, explaining both user-user and item-item approaches that create similarity matrices to predict user preferences. However, these systems face various shilling attacks of increasing sophistication: random attacks use minimal information with average ratings, while segmented attacks strategically target popular items (like Taylor Swift albums) to build credibility before promoting target items. Bandwagon attacks focus on highly popular items to connect with genuine users, and average attacks leverage item rating knowledge to appear authentic. User-user collaborative filtering proves particularly vulnerable, requiring as few as 500 fake profiles to impact recommendations, while item-item filtering demands significantly more resources. Aditya addresses detection through machine learning techniques that analyze behavioral patterns using methods like PCA to identify profiles with unusually high correlation and suspicious rating consistency. However, this remains an evolving challenge as attackers adapt strategies, now using large language models to generate more authentic-seeming fake reviews. His research with the MovieLens dataset tested detection algorithms against synthetic attacks, highlighting how these concerns extend to modern e-commerce systems. While companies rarely share attack and detection data publicly to avoid giving attackers advantages, academic research continues advancing both offensive and defensive strategies in recommender systems security.

AI/ML Computer Science Data Science Cyber Security
Caitlin Moorman – guest @ Hex

AI is transforming data teams - maybe you're anxious, excited, or kind of bummed. I'll share how I went from AI skeptic to enthusiast, how analytics engineering is evolving, and how data teams can finally deliver on promises we've been making for years.

AI/ML Analytics Analytics Engineering
dbt Coalesce 2025
Event Data Skeptic 2025-04-21
The Small World Hypothesis 2025-04-21 · 05:01
Kyle Polich – host

Kyle discusses the history and proof for the small world hypothesis.

Networks of the Mind 2025-02-18 · 20:13
Kyle Polich – host , Yoed Kennet – assistant professor @ Technion – Israel Institute of Technology

A man goes into a bar… This is the beginning of a riddle that our guest, Yoed Kennet, an assistant professor at the Technion's Faculty of Data and Decision Sciences, uses to measure creativity in subjects. In our talk, Yoed speaks about how to combine cognitive science and network science to explore the complexities and decode the mysteries of the human mind. The listeners will learn how network science provides tools to map and analyze human memory, revealing how problem-solving and creativity emerge from changes in semantic memory structures. Key insights include the role of memory restructuring during moments of insight, the connection between semantic networks and creative thinking, and how understanding these processes can improve problem-solving and analogical reasoning. Real-life applications span enhancing creativity in the workplace, building tools to combat cognitive rigidity in aging, and improving learning strategies by fostering richer, more flexible mental networks.


Want to listen ad-free?  Try our Graphs Course?  Join Data Skeptic+ for $5 / month of $50 / year https://plus.dataskeptic.com

Kyle Polich – host , Gerrit van den Burg – Postdoctoral Researcher @ The Alan Turing Institute

Gerrit van den Burg, Postdoctoral Researcher at The Alan Turing Institute, joins us today to discuss his work "An Evaluation of Change Point Detection Algorithms."

Time Series for Good 2021-11-01 · 13:00
Kyle Polich – host , Bahman Rostami-Tabar – Senior Lecturer in Management Science @ Cardiff University

Bahman Rostami-Tabar, Senior Lecturer in Management Science at Cardiff University, joins us today to talk about his work "Forecasting and its Beneficiaries."

Kyle Polich – host , Henning Lange – Postdoctoral Scholar in Applied Math @ University of Washington , Alex Mallen – Computer Science student @ University of Washington

Alex Mallen, Computer Science student at the University of Washington, and Henning Lange, a Postdoctoral Scholar in Applied Math at the University of Washington, join us today to share their work "Deep Probabilistic Koopman: Long-term Time-Series Forecasting Under Periodic Uncertainties."

Computer Science
Kyle Polich – host , Fotios Petropoulos – Professor of Management Science @ University of Bath

Fotios Petropoulos, Professor of Management Science at the University of Bath in The U.K., joins us today to talk about his work "Fast and Frugal Time Series Forecasting."

Kyle Polich – host , Manie Tadayon – PhD graduate @ University of California, Los Angeles (UCLA)

Manie Tadayon, a PhD graduate from the ECE department at University of California, Los Angeles, joins us today to talk about his work "Comparative Analysis of the Hidden Markov Model and LSTM: A Simulative Approach."

Kyle Polich – host , Sankeerth Rao Karingula – ML Researcher @ Palo Alto Networks

Sankeerth Rao Karingula, ML Researcher at Palo Alto Networks, joins us today to talk about his work "Boosted Embeddings for Time Series Forecasting."

Works Mentioned Boosted Embeddings for Time Series Forecasting by Sankeerth Rao Karingula, Nandini Ramanan, Rasool Tahmasbi, Mehrnaz Amjadi, Deokwoo Jung, Ricky Si, Charanraj Thimmisetty, Luisa Polania Cabrera, Marjorie Sayer, Claudionor Nunes Coelho Jr https://www.linkedin.com/in/sankeerthrao/ https://twitter.com/sankeerthrao3  https://lod2021.icas.cc/ 

AI/ML
David Daly – Performance Engineer @ MongoDB , Kyle Polich – host

David Daly, Performance Engineer at MongoDB, joins us today to discuss "The Use of Change Point Detection to Identify Software Performance Regressions in a Continuous Integration System". Works Mentioned The Use of Change Point Detection to Identify Software Performance Regressions in a Continuous Integration System by David Daly, William Brown, Henrik Ingo, Jim O'Leary, David BradfordSocial Media David's Website David's Twitter Mongodb

CI/CD MongoDB
Kyle Polich – host , Samya Tajmouati – PhD student in Data Science @ University of Science of Kenitra, Morocco

Samya Tajmouati, a PhD student in Data Science at the University of Science of Kenitra, Morocco, joins us today to discuss her work Applying K-Nearest Neighbors to Time Series Forecasting: Two New Approaches.

Data Science
Ultra Long Time Series 2021-09-13 · 13:00
Kyle Polich – host , Dr. Feng Li – Associate Professor of Statistics @ Central University of Finance and Economics

Dr. Feng Li, (@f3ngli) is an Associate Professor of Statistics in the School of Statistics and Mathematics at Central University of Finance and Economics in Beijing, China. He joins us today to discuss his work Distributed ARIMA Models for Ultra-long Time Series.

MiniRocket 2021-09-06 · 13:00
Kyle Polich – host , Angus Dempster – PhD Student @ Monash University

Angus Dempster, PhD Student at Monash University in Australia, comes on today to talk about MINIROCKET: A Very Fast (Almost) Deterministic Transform for Time Series Classification, a fast deterministic transform for time series classification. MINIROCKET reformulates ROCKET, gaining a 75x improvement on larger datasets with essentially the same performance. In this episode, we talk about the insights that realized this speedup as well as use cases.

ARiMA is not Sufficient 2021-08-30 · 13:00
Chongshou Li – Associate Professor @ Southwest Jiaotong University , Kyle Polich – host

Chongshou Li, Associate Professor at Southwest Jiaotong University in China, joins us today to talk about his work Why are the ARIMA and SARIMA not Sufficient.

Comp Engine 2021-08-23 · 13:00
Kyle Polich – host , Ben Fulcher – Senior Lecturer @ University of Sydney, School of Physics

Ben Fulcher, Senior Lecturer at the School of Physics at the University of Sydney in Australia, comes on today to talk about his project Comp Engine. Follow Ben on Twitter: @bendfulcher For posts about time series analysis : @comptimeseries comp-engine.org

Detecting Ransomware 2021-08-16 · 13:00
Kyle Polich – host , Nitin Pundir – PhD candidate @ University of Florida; Florida Institute for Cybersecurity Research

Nitin Pundir, PhD candidate at University Florida and works at the Florida Institute for Cybersecurity Research, comes on today to talk about his work "RanStop: A Hardware-assisted Runtime Crypto-Ransomware Detection Technique." FICS Research Lab - https://fics.institute.ufl.edu/  LinkedIn - https://www.linkedin.com/in/nitin-pundir470/

GANs in Finance 2021-08-09 · 13:00
Kyle Polich – host , Florian Eckerli – recent graduate @ Zurich University of Applied Sciences

Florian Eckerli, a recent graduate of Zurich University of Applied Sciences, comes on the show today to discuss his work Generative Adversarial Networks in Finance: An Overview.

Predicting Urban Land Use 2021-08-02 · 13:00
Kyle Polich – host , Daniel Omeiza – Doctoral student, Computer Science Department @ University of Oxford

Today on the show we have Daniel Omeiza, a doctoral student in the computer science department of the University of Oxford, who joins us to talk about his work Efficient Machine Learning for Large-Scale Urban Land-Use Forecasting in Sub-Saharan Africa.

AI/ML Computer Science