talk-data.com
People (526 results)
See all 526 →Activities & events
| Title & Speakers | Event |
|---|---|
|
Designing Recommender Systems for Digital Humanities
2025-11-23 · 16:50
Kyle Polich
– host
,
Florian Atzenhofer-Baumgartner
– PhD student
@ Graz University of Technology
In this episode of Data Skeptic, we explore the fascinating intersection of recommender systems and digital humanities with guest Florian Atzenhofer-Baumgartner, a PhD student at Graz University of Technology. Florian is working on Monasterium.net, Europe's largest online collection of historical charters, containing millions of medieval and early modern documents from across the continent. The conversation delves into why traditional recommender systems fall short in the digital humanities space, where users range from expert historians and genealogists to art historians and linguists, each with unique research needs and information-seeking behaviors. Florian explains the technical challenges of building a recommender system for cultural heritage materials, including dealing with sparse user-item interaction matrices, the cold start problem, and the need for multi-modal similarity approaches that can handle text, images, metadata, and historical context. The platform leverages various embedding techniques and gives users control over weighting different modalities—whether they're searching based on text similarity, visual imagery, or diplomatic features like issuers and receivers. A key insight from Florian's research is the importance of balancing serendipity with utility, collection representation to prevent bias, and system explainability while maintaining effectiveness. The discussion also touches on unique evaluation challenges in non-commercial recommendation contexts, including Florian's "research funnel" framework that considers discovery, interaction, integration, and impact stages. Looking ahead, Florian envisions recommendation systems becoming standard tools for exploration across digital archives and cultural heritage repositories throughout Europe, potentially transforming how researchers discover and engage with historical materials. The new version of Monasterium.net, set to launch with enhanced semantic search and recommendation features, represents an important step toward making cultural heritage more accessible and discoverable for everyone. |
|
|
Shilling Attacks on Recommender Systems
2025-11-05 · 14:11
Aditya Chichani
– senior machine learning engineer
@ Walmart
,
Kyle Polich
– host
In this episode of Data Skeptic's Recommender Systems series, Kyle sits down with Aditya Chichani, a senior machine learning engineer at Walmart, to explore the darker side of recommendation algorithms. The conversation centers on shilling attacks—a form of manipulation where malicious actors create multiple fake profiles to game recommender systems, either to promote specific items or sabotage competitors. Aditya, who researched these attacks during his undergraduate studies at SPIT before completing his master's in computer science with a data science specialization at UC Berkeley, explains how these vulnerabilities emerge particularly in collaborative filtering systems. From promoting a friend's ska band on Spotify to inflating product ratings on e-commerce platforms, shilling attacks represent a significant threat in an industry where approximately 4% of reviews are fake, translating to $800 billion in annual sales in the US alone. The discussion delves deep into collaborative filtering, explaining both user-user and item-item approaches that create similarity matrices to predict user preferences. However, these systems face various shilling attacks of increasing sophistication: random attacks use minimal information with average ratings, while segmented attacks strategically target popular items (like Taylor Swift albums) to build credibility before promoting target items. Bandwagon attacks focus on highly popular items to connect with genuine users, and average attacks leverage item rating knowledge to appear authentic. User-user collaborative filtering proves particularly vulnerable, requiring as few as 500 fake profiles to impact recommendations, while item-item filtering demands significantly more resources. Aditya addresses detection through machine learning techniques that analyze behavioral patterns using methods like PCA to identify profiles with unusually high correlation and suspicious rating consistency. However, this remains an evolving challenge as attackers adapt strategies, now using large language models to generate more authentic-seeming fake reviews. His research with the MovieLens dataset tested detection algorithms against synthetic attacks, highlighting how these concerns extend to modern e-commerce systems. While companies rarely share attack and detection data publicly to avoid giving attackers advantages, academic research continues advancing both offensive and defensive strategies in recommender systems security. |
|
|
AI, anxiety, and the future of data teams: A playbook for real leadership
2025-10-15 · 22:00
Caitlin Moorman
– guest
@ Hex
AI is transforming data teams - maybe you're anxious, excited, or kind of bummed. I'll share how I went from AI skeptic to enthusiast, how analytics engineering is evolving, and how data teams can finally deliver on promises we've been making for years. |
dbt Coalesce 2025 |
|
The Small World Hypothesis
2025-04-21 · 05:01
Kyle Polich
– host
Kyle discusses the history and proof for the small world hypothesis. |
|
|
Networks of the Mind
2025-02-18 · 20:13
Kyle Polich
– host
,
Yoed Kennet
– assistant professor
@ Technion – Israel Institute of Technology
A man goes into a bar… This is the beginning of a riddle that our guest, Yoed Kennet, an assistant professor at the Technion's Faculty of Data and Decision Sciences, uses to measure creativity in subjects. In our talk, Yoed speaks about how to combine cognitive science and network science to explore the complexities and decode the mysteries of the human mind. The listeners will learn how network science provides tools to map and analyze human memory, revealing how problem-solving and creativity emerge from changes in semantic memory structures. Key insights include the role of memory restructuring during moments of insight, the connection between semantic networks and creative thinking, and how understanding these processes can improve problem-solving and analogical reasoning. Real-life applications span enhancing creativity in the workplace, building tools to combat cognitive rigidity in aging, and improving learning strategies by fostering richer, more flexible mental networks. Want to listen ad-free? Try our Graphs Course? Join Data Skeptic+ for $5 / month of $50 / year https://plus.dataskeptic.com |
|
|
Change Point Detection Algorithms
2021-11-08 · 14:14
Kyle Polich
– host
,
Gerrit van den Burg
– Postdoctoral Researcher
@ The Alan Turing Institute
Gerrit van den Burg, Postdoctoral Researcher at The Alan Turing Institute, joins us today to discuss his work "An Evaluation of Change Point Detection Algorithms." |
|
|
Time Series for Good
2021-11-01 · 13:00
Kyle Polich
– host
,
Bahman Rostami-Tabar
– Senior Lecturer in Management Science
@ Cardiff University
Bahman Rostami-Tabar, Senior Lecturer in Management Science at Cardiff University, joins us today to talk about his work "Forecasting and its Beneficiaries." |
|
|
Long Term Time Series Forecasting
2021-10-25 · 13:00
Kyle Polich
– host
,
Henning Lange
– Postdoctoral Scholar in Applied Math
@ University of Washington
,
Alex Mallen
– Computer Science student
@ University of Washington
Alex Mallen, Computer Science student at the University of Washington, and Henning Lange, a Postdoctoral Scholar in Applied Math at the University of Washington, join us today to share their work "Deep Probabilistic Koopman: Long-term Time-Series Forecasting Under Periodic Uncertainties." |
|
|
Fast and Frugal Time Series Forecasting
2021-10-17 · 20:13
Kyle Polich
– host
,
Fotios Petropoulos
– Professor of Management Science
@ University of Bath
Fotios Petropoulos, Professor of Management Science at the University of Bath in The U.K., joins us today to talk about his work "Fast and Frugal Time Series Forecasting." |
|
|
Causal Inference in Educational Systems
2021-10-11 · 13:00
Kyle Polich
– host
,
Manie Tadayon
– PhD graduate
@ University of California, Los Angeles (UCLA)
Manie Tadayon, a PhD graduate from the ECE department at University of California, Los Angeles, joins us today to talk about his work "Comparative Analysis of the Hidden Markov Model and LSTM: A Simulative Approach." |
|
|
Boosted Embeddings for Time Series
2021-10-04 · 13:00
Kyle Polich
– host
,
Sankeerth Rao Karingula
– ML Researcher
@ Palo Alto Networks
Sankeerth Rao Karingula, ML Researcher at Palo Alto Networks, joins us today to talk about his work "Boosted Embeddings for Time Series Forecasting." Works Mentioned Boosted Embeddings for Time Series Forecasting by Sankeerth Rao Karingula, Nandini Ramanan, Rasool Tahmasbi, Mehrnaz Amjadi, Deokwoo Jung, Ricky Si, Charanraj Thimmisetty, Luisa Polania Cabrera, Marjorie Sayer, Claudionor Nunes Coelho Jr https://www.linkedin.com/in/sankeerthrao/ https://twitter.com/sankeerthrao3 https://lod2021.icas.cc/ |
|
|
Change Point Detection in Continuous Integration Systems
2021-09-27 · 13:00
David Daly
– Performance Engineer
@ MongoDB
,
Kyle Polich
– host
David Daly, Performance Engineer at MongoDB, joins us today to discuss "The Use of Change Point Detection to Identify Software Performance Regressions in a Continuous Integration System". Works Mentioned The Use of Change Point Detection to Identify Software Performance Regressions in a Continuous Integration System by David Daly, William Brown, Henrik Ingo, Jim O'Leary, David BradfordSocial Media David's Website David's Twitter Mongodb |
|
|
Applying k-Nearest Neighbors to Time Series
2021-09-20 · 13:00
Kyle Polich
– host
,
Samya Tajmouati
– PhD student in Data Science
@ University of Science of Kenitra, Morocco
Samya Tajmouati, a PhD student in Data Science at the University of Science of Kenitra, Morocco, joins us today to discuss her work Applying K-Nearest Neighbors to Time Series Forecasting: Two New Approaches. |
|
|
Ultra Long Time Series
2021-09-13 · 13:00
Kyle Polich
– host
,
Dr. Feng Li
– Associate Professor of Statistics
@ Central University of Finance and Economics
Dr. Feng Li, (@f3ngli) is an Associate Professor of Statistics in the School of Statistics and Mathematics at Central University of Finance and Economics in Beijing, China. He joins us today to discuss his work Distributed ARIMA Models for Ultra-long Time Series. |
|
|
MiniRocket
2021-09-06 · 13:00
Kyle Polich
– host
,
Angus Dempster
– PhD Student
@ Monash University
Angus Dempster, PhD Student at Monash University in Australia, comes on today to talk about MINIROCKET: A Very Fast (Almost) Deterministic Transform for Time Series Classification, a fast deterministic transform for time series classification. MINIROCKET reformulates ROCKET, gaining a 75x improvement on larger datasets with essentially the same performance. In this episode, we talk about the insights that realized this speedup as well as use cases. |
|
|
ARiMA is not Sufficient
2021-08-30 · 13:00
Chongshou Li
– Associate Professor
@ Southwest Jiaotong University
,
Kyle Polich
– host
Chongshou Li, Associate Professor at Southwest Jiaotong University in China, joins us today to talk about his work Why are the ARIMA and SARIMA not Sufficient. |
|
|
Comp Engine
2021-08-23 · 13:00
Kyle Polich
– host
,
Ben Fulcher
– Senior Lecturer
@ University of Sydney, School of Physics
Ben Fulcher, Senior Lecturer at the School of Physics at the University of Sydney in Australia, comes on today to talk about his project Comp Engine. Follow Ben on Twitter: @bendfulcher For posts about time series analysis : @comptimeseries comp-engine.org |
|
|
Detecting Ransomware
2021-08-16 · 13:00
Kyle Polich
– host
,
Nitin Pundir
– PhD candidate
@ University of Florida; Florida Institute for Cybersecurity Research
Nitin Pundir, PhD candidate at University Florida and works at the Florida Institute for Cybersecurity Research, comes on today to talk about his work "RanStop: A Hardware-assisted Runtime Crypto-Ransomware Detection Technique." FICS Research Lab - https://fics.institute.ufl.edu/ LinkedIn - https://www.linkedin.com/in/nitin-pundir470/ |
|
|
GANs in Finance
2021-08-09 · 13:00
Kyle Polich
– host
,
Florian Eckerli
– recent graduate
@ Zurich University of Applied Sciences
Florian Eckerli, a recent graduate of Zurich University of Applied Sciences, comes on the show today to discuss his work Generative Adversarial Networks in Finance: An Overview. |
|
|
Predicting Urban Land Use
2021-08-02 · 13:00
Kyle Polich
– host
,
Daniel Omeiza
– Doctoral student, Computer Science Department
@ University of Oxford
Today on the show we have Daniel Omeiza, a doctoral student in the computer science department of the University of Oxford, who joins us to talk about his work Efficient Machine Learning for Large-Scale Urban Land-Use Forecasting in Sub-Saharan Africa. |
|