talk-data.com talk-data.com

Topic

LLM

Large Language Models (LLM)

nlp ai machine_learning

1405

tagged

Activity Trend

158 peak/qtr
2020-Q1 2026-Q1

Activities

1405 activities · Newest first

Send us a text Welcome to the cozy corner of the tech world where ones and zeros mingle with casual chit-chat. Datatopics Unplugged is your go-to spot for relaxed discussions around tech, news, data, and society. Dive into conversations that should flow as smoothly as your morning coffee (but don't), where industry insights meet laid-back banter. Whether you're a data aficionado or just someone curious about the digital age, pull up a chair, relax, and let's get into the heart of data, unplugged style! In This Episode: OpenAI's Election Strategy: A deep dive into how OpenAI is approaching the 2024 worldwide elections. How OpenAI is approaching 2024 worldwide electionsTech Quirks: Exploring the idiosyncrasies of the Continuity Camera in Chrome. Is it a magic trick or just poor engineering?The Rise of Bluesky: Unpacking the significance of Bluesky's newly launched RSS feeds. Bluesky launches RSS feedsHot Takes on Data: Tackling the bold statement - Every data transform is technical debt. An exploration of what technical debt really means in the data world. Every data transform is technical debtJoin hosts Murilo and Bart as they navigate these intriguing topics, bringing their unique perspectives and a touch of humor to the table. Intro music courtesy of fesliyanstudios.com.

Link to Episode 165 on WebsiteDiscuss this episode, leave a comment, or ask a question (on GitHub)Twitter ADSP: The PodcastConor HoekstraBryce Adelstein LelbachShow Notes Date Recorded: 2024-01-18 Date Released: 2024-01-19 ADSP Episode 164: Are We Going to Run Out of Credit Cards?ADSP Episode 163: Sean Parent on Tilt Five, Metrowerks & Be Inc.Acquired Podcast S10E5: Nvidia Part I: The GPU Company (1993-2006)Acquired Podcast S10E6: Nvidia Part II: The Machine Learning Company (2006-2022)Acquired Podcast S13E3: Nvidia Part III: The Dawn of the AI Era (2022-2023)Acquired Podcast: NVIDIA CEO Jensen HuangADSP Episode 150: Is C++ Dying?Rust Programming LanguageHylo Programming LanguageCircle C++ CompilerCarbon Programming LanguageMLIRMojoGoC++ Community SurveyLambdaDaysRun For the Fun of It Episode 14: YüBaí Joins BlackToe 🥳, 5k & Half PBs 🔥 & 2024 Goals 🎯CppNorthIntro Song Info Miss You by Sarah Jansen https://soundcloud.com/sarahjansenmusic Creative Commons — Attribution 3.0 Unported — CC BY 3.0 Free Download / Stream: http://bit.ly/l-miss-you Music promoted by Audio Library https://youtu.be/iYYxnasvfx8

Send us a text Welcome to the cozy corner of the tech world where ones and zeros mingle with casual chit-chat. Datatopics Unplugged is your go-to spot for relaxed discussions around tech, news, data, and society. Dive into conversations that should flow as smoothly as your morning coffee (but don't), where industry insights meet laid-back banter. Whether you're a data aficionado or just someone curious about the digital age, pull up a chair, relax, and let's get into the heart of data, unplugged style! In this episode, we're joined by Ben Mellaerts, an expert Data Strategist, to unravel the intricacies of some of the hottest topics in the tech world. Rabbit R1 AI Action Model: A deep dive into the hype and reality of the Rabbit R1, a potential game-changer in AI.The Verge reports on Rabbit R1's release and featuresThe R1's rapid sell-out post-CESNY Times vs. OpenAI: Exploring the implications of the NY Times lawsuit against OpenAI, a case that could shape the future of AI in media.The New York Times' perspective on the lawsuitChatGPT Store: Unpacking the potential of OpenAI's ChatGPT store and its impact on the tech and data landscape.Intro music courtesy of fesliyanstudios.com.

How to Become a Data Analyst

Start a brand-new career in data analytics with no-nonsense advice from a self-taught data analytics consultant In How to Become a Data Analyst: My Low-Cost, No Code Roadmap for Breaking into Tech, data analyst and analytics consultant Annie Nelson walks you through how she took the reins and made a dramatic career change to unlock new levels of career fulfilment and enjoyment. In the book, she talks about the adaptability, curiosity, and persistence you’ll need to break free from the 9-5 grind and how data analytics—with its wide variety of skills, roles, and options—is the perfect field for people looking to refresh their careers. Annie offers practical and approachable data portfolio-building advice to help you create one that’s manageable for an entry-level professional but will still catch the eye of employers and clients. You’ll also find: Deep dives into the learning journey required to step into a data analytics role Ways to avoid getting lost in the maze of online courses and certifications you can find online—while still obtaining the skills you need to be competitive Explorations of the highs and lows of Annie’s career-change journey and job search—including what was hard, what was easy, what worked well, and what didn’t Strategies for using ChatGPT to help you in your job search A must-read roadmap to a brand-new and exciting career in data analytics, How to Become a Data Analyst is the hands-on tutorial that shows you exactly how to succeed.

We talked about:

Atita’s background How NLP relates to search Atita’s experience with Lucidworks and OpenSource Connections Atita’s experience with Qdrant and vector databases Utilizing vector search Major changes to search Atita has noticed throughout her career RAG (Retrieval-Augmented Generation) Building a chatbot out of transcripts with LLMs Ingesting the data and evaluating the results Keeping humans in the loop Application of vector databases for machine learning Collaborative filtering Atita’s resource recommendations

Links:

LinkedIn: https://www.linkedin.com/in/atitaarora/
Twitter: https://x.com/atitaarora Github: https://github.com/atarora Human-in-the-Loop Machine Learning: https://www.manning.com/books/human-in-the-loop-machine-learning Relevant Search: https://www.manning.com/books/relevant-search Let's learn about Vectors: https://hub.superlinked.com/ Langchain: https://python.langchain.com/docs/get_started/introduction Qdrant blog: https://blog.qdrant.tech/ OpenSource Connections Blog: https://opensourceconnections.com/blog/

Free ML Engineering course: http://mlzoomcamp.com Join DataTalks.Club: https://datatalks.club/slack.html Our events: https://datatalks.club/events.html

For those who celebrate or acknowledge it, Christmas is now in the rearview mirror. Father Time has a beard that reaches down to his toes, and he's ready to hand over the clock to an absolutely adorable little Baby Time when 2024 rolls in. That means it's time for our annual set of reflections on the analytics and data science industry. Somehow, the authoring of this description of the show was completely unaided by an LLM, although the show did include quite a bit of discussion around generative AI. It also included the announcement of a local LLM based on all of our podcast episodes to date (updated with each new episode going forward!), which you can try out here! The discussion was wide-ranging beyond AI: Google Analytics 4, Marketing Mix Modelling (MMM), the technical/engineering side of analytics versus the softer skills of creative analytical thought and engaging with stakeholders, and more, as well as a look ahead to 2024! For complete show notes, including links to items mentioned in this episode and a transcript of the show, visit the show page.

Send us a text Welcome to the cozy corner of the tech world where ones and zeros mingle with casual chit-chat. Datatopics Unplugged is your go-to spot for relaxed discussions around tech, news, data, and society. Dive into conversations that should flow as smoothly as your morning coffee (but don’t), where industry insights meet laid-back banter. Whether you’re a data aficionado or just someone curious about the digital age, pull up a chair, relax, and let’s get into the heart of data, unplugged style! In this episode, we’re joined by Maryam, an Analytics Engineer with a passion for challenges and a knack for curiosity. From sewing to yoga, Maryam brings a unique perspective to our tech-centric discussions. Analytics Engineer Insights: Maryam discusses her role, the rise of Analytics Engineers, and their essential tools. Read more about Analytics Engineering.The Emerging Role of AI Translator: Exploring the link between Analytics Engineers and AI Translators, and the skills required in these evolving fields. Learn about AI Translator.Mistral AI’s New Developments: Analyzing Mistral AI’s latest model and its implications for the industry. Discover Mistral AI’s update.ChatGPT – A Double-Edged Sword: Discussing the impacts of ChatGPT on the AI landscape and the pace of innovation. Reflect on ChatGPT’s impact.ChatGPT & Job Applications: A fresh take on how ChatGPT is influencing job applications and hiring processes.Engineering Management Insights: Exploring whether becoming an Engineering Manager is a path worth considering.Intro music courtesy of fesliyanstudios.com.

We streamed live!

There are a few caveats to using generative AI tools, those caveats have led to a few tips that have quickly become second nature to those that use LLMs like ChatGPT. The main one being: have the domain knowledge to validate the output in order to avoid hallucinations. Hallucinations are one of the weak spots for LLMs due to the nature of the way they are built, as they are trained to correlate data in order to predict what might come next in an incomplete sequence. Does this mean that we’ll always have to be wary of the output of AI products, with the expectation that there is no intelligent decision-making going on under the hood? Far from it. Causal AI is bound by reason—rather than looking at correlation, these exciting systems are able to focus on the underlying causal mechanisms and relationships. As the AI field rapidly evolves, Causal AI is an area of research that is likely to have a huge impact on a huge number of industries and problems.  Paul Hünermund is an Assistant Professor of Strategy and Innovation at Copenhagen Business School. In his research, Dr. Hünermund studies how firms can leverage new technologies in the space of machine learning and artificial intelligence such as Causal AI for value creation and competitive advantage. His work explores the potential for biases in organizational decision-making and ways for managers to counter them. It thereby sheds light on the origins of effective business strategies in markets characterized by a high degree of technological competition and the resulting implications for economic growth and environmental sustainability.  His work has been published in The Journal of Management Studies, the Econometrics Journal, Research Policy, Journal of Product Innovation Management, International Journal of Industrial Organization, MIT Sloan Management Review, and Harvard Business Review, among others.  In the full episode, Richie and Paul explore Causal AI, its differences when compared to other forms of AI, use cases of Causal AI in fields like drug development, marketing, manufacturing, and defense. They also discuss how Causal AI contributes to better decision-making, the role of domain experts in getting accurate results, what happens in the early stages of Causal AI adoption, exciting new developments within the Causal AI space and much more.  Links Mentioned in the Show: Causal Data Science in BusinessCausal AI by causaLensIntro to Causal AI Using the DoWhy Library in PythonLesson: Inference (causal) models

Python 3 and Data Visualization Using ChatGPT /GPT-4

This book is designed to show readers the concepts of Python 3 programming and the art of data visualization. It also explores cutting-edge techniques using ChatGPT/GPT-4 in harmony with Python for generating visuals that tell more compelling data stories. Chapter 1 introduces the essentials of Python, covering a vast array of topics from basic data types, loops, and functions to more advanced constructs like dictionaries, sets, and matrices. In Chapter 2, the focus shifts to NumPy and its powerful array operations, leading into data visualization using prominent libraries such as Matplotlib. Chapter 6 includes Seaborn's rich visualization tools, offering insights into datasets like Iris and Titanic. Further, the book covers other visualization tools and techniques, including SVG graphics, D3 for dynamic visualizations, and more. Chapter 7 covers information about the main features of ChatGPT and GPT-4, as well as some of their competitors. Chapter 8 contains examples of using ChatGPT in order to perform data visualization, such as charts and graphs that are based on datasets (e.g., the Titanic dataset). Companion files with code, datasets, and figures are available for downloading. From foundational Python concepts to the intricacies of data visualization, this book is ideal for Python practitioners, data scientists, and anyone in the field of data analytics looking to enhance their storytelling with data through visuals. It's also perfect for educators seeking material for teaching advanced data visualization techniques.

Está no ar nossa o Data Hackers News ! Os assuntos mais quentes da semana, com as principais notícias da área de Dados, IA e Tecnologia, que você também encontra na nossa Newsletter semanal, agora no Podcast do Data Hackers !

Aperta o play e ouça agora, o Data Hackers News dessa semana !

Para saber sobre essa e outras noticias, se inscreva na Newsletter semanal:

⁠⁠⁠⁠⁠⁠⁠⁠https://www.datahackers.news/⁠⁠⁠⁠⁠⁠⁠⁠

Conheça nossos comentaristas do Data Hackers News:

⁠⁠⁠⁠Monique Femme⁠⁠⁠⁠⁠ ⁠⁠⁠⁠⁠Paulo Vasconcellos⁠⁠⁠⁠⁠

Demais canais do Data Hackers:

⁠⁠⁠⁠⁠⁠⁠⁠Site⁠⁠⁠⁠⁠⁠⁠⁠ ⁠⁠⁠⁠⁠⁠⁠⁠Linkedin⁠⁠⁠⁠⁠⁠⁠⁠ ⁠⁠⁠⁠⁠⁠⁠⁠Instagram⁠⁠⁠⁠⁠⁠⁠⁠ ⁠⁠⁠⁠⁠⁠⁠⁠Tik Tok⁠⁠⁠⁠⁠⁠⁠⁠ ⁠⁠⁠⁠⁠⁠⁠⁠You Tube⁠⁠⁠⁠⁠⁠⁠⁠

Matérias comentadas:

Demonstração do Google Gemini foi ensaiada;

Grupo brasileiro lança rival do ChatGPT em Português;

Criadora do Stable Diffusion pode ser vendida.

Já aproveita para nos seguir no Spotify, Apple Podcasts, ou no seu player de podcasts favoritos !

Over the past year, we’ve seen a full hype cycle of hysteria and discourse surrounding generative AI. It almost seems difficult to think back to a time when no one had used ChatGPT. We are in the midst of the fourth industrial revolution, and technology is moving rapidly. Better performing and more capable models are being released at a stunning rate, and with the growing presence of multimodal AI, can we expect another whirlwind year that vastly changes the state of play within AI again? Who might be able to provide insight into what is to come in 2024? Craig S. Smith is an American journalist, former executive of The New York Times, and host of the podcast Eye on AI. Until January 2000, he wrote for The Wall Street Journal, most notably covering the rise of the religious movement Falun Gong in China. He has reported for the Times from more than 40 countries and has covered several conflicts, including the 2001 invasion of Afghanistan, the 2003 war in Iraq, and the 2006 Israeli-Lebanese war. He retired from the Times in 2018 and now writes about artificial intelligence for the Times and other publications. He was a special Government employee for the National Security Commission on Artificial Intelligence until the commission's end in October 2021.  In the episode, Richie and Craig explore the 2023 advancements in generative AI, such as GPT-4, and the evolving roles of companies like Anthropic and Meta, practical AI applications for research and image generation, challenges in large language models, the promising future of world models and AI agents, the societal impacts of AI, the issue of misinformation, computational constraints, and the importance of AI literacy in the job market, the transformative potential of AI in various sectors and much more.  Links Mentioned in the Show: Eye on AIWayveAnthropicCohereMidjourneyYann Lecun

Send us a text Welcome to the cozy corner of the tech world where ones and zeros mingle with casual chit-chat. Datatopics Unplugged is your go-to spot for relaxed discussions around tech, news, data, and society.

Dive into conversations that should flow as smoothly as your morning coffee (but don't), where industry insights meet laid-back banter. Whether you're a data aficionado or just someone curious about the digital age, pull up a chair, relax, and let's get into the heart of data, unplugged style!

In today's episode Murilo & Bart discuss:

AI and Software Insights Introducing Gemini: Google's latest AI modelGoogle BlogFireship Dev TweetTechCrunch Article on Gemini DemoCommunication and Collaboration in Tech 6 tiny wording tweaks to level up your communication as a software engineerCareer CutlerMLOps and Model Development Navigating the chaos: why you don’t need another MLOps toolOpenLayer BlogChatGPT's performance on Julia vs. Python and R for LLM Code GenerationStochastic LifestyleEmerging Tech and Fun Finds JSONB in SQLiteSQLite ForumWizard Zines for a touch of geekinessWizard ZinesSports Illustrated's AI author sagaThe Verge ArticleMonaspace: A superfamily of fonts for codeMonaspaceHot Takes Paper: You Want My Password or a Dead Patient?Cohost ArticleIntro music courtesy of fesliyanstudios.com Check out the episode on YouTube.

In this talk we will have a look at Haystack, an open source LLM framework, and how we can use it to create custom, private search systems on our own data. We will look at how we can build retrieval augmented generative pipelines for our Notion pages, and how Haystack can help you create custom tooling for larger NLP applications.

Bobur Umurzokov: Querying Live Data With LLM App

Unlock the secrets of querying live data with Bobur Umurzokov as he presents 'Querying Live Data With LLM App.' 🌐🤖 Discover how to build your own AI app in just 30 lines of code, harnessing the power of OpenAI's API and Pathway Python libraries. 🚀 Explore a revolutionary approach to handling real-time, ever-changing data for information retrieval, content recommendation, and dynamic chatbots! 📈📚 #LiveData #AIApp #openai

✨ H I G H L I G H T S ✨

🙌 A huge shoutout to all the incredible participants who made Big Data Conference Europe 2023 in Vilnius, Lithuania, from November 21-24, an absolute triumph! 🎉 Your attendance and active participation were instrumental in making this event so special. 🌍

Don't forget to check out the session recordings from the conference to relive the valuable insights and knowledge shared! 📽️

Once again, THANK YOU for playing a pivotal role in the success of Big Data Conference Europe 2023. 🚀 See you next year for another unforgettable conference! 📅 #BigDataConference #SeeYouNextYear

Está no ar nossa o Data Hackers News ! Os assuntos mais quentes da semana, com as principais notícias da área de Dados, IA e Tecnologia, que você também encontra na nossa Newsletter semanal, agora no Podcast do Data Hackers !

Aperta o play e ouça agora, tudo o que rolou na linha do tempo da treta envolvendo a OpenAI e o Sam Altman !!

Para saber sobre essa e outras noticias, se inscreva na Newsletter semanal:

⁠⁠⁠⁠⁠⁠⁠https://www.datahackers.news/⁠⁠⁠⁠⁠⁠⁠

Conheça nossos comentaristas do Data Hackers News:

⁠⁠⁠⁠⁠⁠⁠⁠⁠Monique Femme⁠⁠⁠⁠⁠⁠⁠⁠⁠ ⁠Gabriel Lages⁠ ⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠

Demais canais do Data Hackers:

⁠⁠⁠⁠⁠⁠⁠Site⁠⁠⁠⁠⁠⁠⁠ ⁠⁠⁠⁠⁠⁠⁠Linkedin⁠⁠⁠⁠⁠⁠⁠ ⁠⁠⁠⁠⁠⁠⁠Instagram⁠⁠⁠⁠⁠⁠⁠ ⁠⁠⁠⁠⁠⁠⁠Tik Tok⁠⁠⁠⁠⁠⁠⁠ ⁠⁠⁠⁠⁠⁠⁠You Tube⁠⁠⁠⁠⁠⁠⁠

Matérias comentadas:

⁠Lei criada com ChatGPT é aprovada em Porto Alegre

Magalu anuncia lançamento Serviço de Nuvem 

Pika: o novo software de IA para criação e edição de vídeos.

Já aproveita para nos seguir no Spotify, Apple Podcasts, ou no seu player de podcasts favoritos !

Asta Bagdonavičienė & Diana Gold: Analytics in 2030

Join Asta Bagdonavičienė and Diana Gold for a captivating discussion on 'Analytics in 2030.' 📈🤖 Explore the impact of ChatGPT and AI on the ever-evolving field of analytics. Discover how roles in analytics have transformed throughout history and gain insights into what the future holds. Stay prepared and relevant in the ever-changing landscape of data and digitalization! 🕰️🔍 #Analytics #futuretrends

✨ H I G H L I G H T S ✨

🙌 A huge shoutout to all the incredible participants who made Big Data Conference Europe 2023 in Vilnius, Lithuania, from November 21-24, an absolute triumph! 🎉 Your attendance and active participation were instrumental in making this event so special. 🌍

Don't forget to check out the session recordings from the conference to relive the valuable insights and knowledge shared! 📽️

Once again, THANK YOU for playing a pivotal role in the success of Big Data Conference Europe 2023. 🚀 See you next year for another unforgettable conference! 📅 #BigDataConference #SeeYouNextYear

David Pilato: Search: A New Era

Embark on a journey into the future of search with David Pilato as he unveils 'Search: A New Era.' 🚀🔍 Explore the evolution from traditional TF/IDF to cutting-edge machine learning and models in search technology. Dive deep into topics like vector search, OpenAI's ChatGPT integration, and the latest advancements in search methodologies, including demonstrations on generating music embeddings and more! 🎶💡 #SearchTechnology #MachineLearning #elasticsearch

✨ H I G H L I G H T S ✨

🙌 A huge shoutout to all the incredible participants who made Big Data Conference Europe 2023 in Vilnius, Lithuania, from November 21-24, an absolute triumph! 🎉 Your attendance and active participation were instrumental in making this event so special. 🌍

Don't forget to check out the session recordings from the conference to relive the valuable insights and knowledge shared! 📽️

Once again, THANK YOU for playing a pivotal role in the success of Big Data Conference Europe 2023. 🚀 See you next year for another unforgettable conference! 📅 #BigDataConference #SeeYouNextYear

Mauro Bennici: Collect Data to Fine-Tune Your Large Language Model

Unlock the power of data to supercharge your Large Language Models with Mauro Bennici! 🚀 Join his session, 'Fine-Tuning for Business Success,' and learn how to create the perfect model without breaking the bank. 📊🤖 #LLMs #DataTuning

✨ H I G H L I G H T S ✨

🙌 A huge shoutout to all the incredible participants who made Big Data Conference Europe 2023 in Vilnius, Lithuania, from November 21-24, an absolute triumph! 🎉 Your attendance and active participation were instrumental in making this event so special. 🌍

Don't forget to check out the session recordings from the conference to relive the valuable insights and knowledge shared! 📽️

Once again, THANK YOU for playing a pivotal role in the success of Big Data Conference Europe 2023. 🚀 See you next year for another unforgettable conference! 📅 #BigDataConference #SeeYouNextYear

Está no ar nossa décima edição Data Hackers News ! Os assuntos mais quentes da semana, com as principais notícias da área de Dados, IA e Tecnologia, que você também encontra na nossa Newsletter semanal, agora no Podcast do Data Hackers !

Aperta o play e ouça agora, tudo o que rolou na linha do tempo da treta envolvendo a OpenAI e o Sam Altman !!

Mas antes, não se esqueça de preencher a pesquisa State of Data 23, que está chegando ao fim: ⁠⁠⁠⁠⁠http://www.stateofdata.com.br/podcast⁠⁠⁠⁠⁠

Para saber sobre essa e outras noticias, se inscreva na Newsletter semanal:

⁠⁠⁠⁠⁠⁠https://www.datahackers.news/⁠⁠⁠⁠⁠⁠

Conheça nossos comentaristas do Data Hackers News:

⁠⁠⁠⁠⁠⁠⁠⁠Monique Femme⁠⁠⁠⁠⁠⁠⁠⁠ Gabriel Lages ⁠⁠⁠⁠⁠⁠⁠⁠

Demais canais do Data Hackers:

⁠⁠⁠⁠⁠⁠Site⁠⁠⁠⁠⁠⁠ ⁠⁠⁠⁠⁠⁠Linkedin⁠⁠⁠⁠⁠⁠ ⁠⁠⁠⁠⁠⁠Instagram⁠⁠⁠⁠⁠⁠ ⁠⁠⁠⁠⁠⁠Tik Tok⁠⁠⁠⁠⁠⁠ ⁠⁠⁠⁠⁠⁠You Tube⁠⁠⁠⁠⁠⁠

Matérias comentadas:

Linha do tempo da treta da OpenAI;

Pesquisadores alertaram conselho sobre avanço da AI antes da Demissão;

Sam Altman e Adam D’Angelo se reúnem e passam o Dia de Ação de Graças juntos.

Já aproveita para nos seguir no Spotify, Apple Podcasts, ou no seu player de podcasts favoritos !