Java

Exploring NATS: A Multi-Paradigm Connectivity Layer for Distributed Applications

2025-04-28 · Data Engineering Podcast Listen

podcast_episode

by Derek Collison (Synadia) , Tobias Macey

AI/ML Cloud Computing Data Engineering Data Management Datafold Kafka Python TIBCO Spotfire

Summary In this episode of the Data Engineering Podcast Derek Collison, creator of NATS and CEO of Synadia, talks about the evolution and capabilities of NATS as a multi-paradigm connectivity layer for distributed applications. Derek discusses the challenges and solutions in building distributed systems, and highlights the unique features of NATS that differentiate it from other messaging systems. He delves into the architectural decisions behind NATS, including its ability to handle high-speed global microservices, support for edge computing, and integration with Jetstream for data persistence, and explores the role of NATS in modern data management and its use cases in industries like manufacturing and connected vehicles.

Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data managementData migrations are brutal. They drag on for months—sometimes years—burning through resources and crushing team morale. Datafold's AI-powered Migration Agent changes all that. Their unique combination of AI code translation and automated data validation has helped companies complete migrations up to 10 times faster than manual approaches. And they're so confident in their solution, they'll actually guarantee your timeline in writing. Ready to turn your year-long migration into weeks? Visit dataengineeringpodcast.com/datafold today for the details.Your host is Tobias Macey and today I'm interviewing Derek Collison about NATS, a multi-paradigm connectivity layer for distributed applications.Interview IntroductionHow did you get involved in the area of data management?Can you describe what NATS is and the story behind it?How have your experiences in past roles (cloud foundry, TIBCO messaging systems) informed the core principles of NATS?What other sources of inspiration have you drawn on in the design and evolution of NATS? (e.g. Kafka, RabbitMQ, etc.)There are several patterns and abstractions that NATS can support, many of which overlap with other well-regarded technologies. When designing a system or service, what are the heuristics that should be used to determine whether NATS should act as a replacement or addition to those capabilities? (e.g. considerations of scale, speed, ecosystem compatibility, etc.)There is often a divide in the technologies and architecture used between operational/user-facing applications and data systems. How does the unification of multiple messaging patterns in NATS shift the ways that teams think about the relationship between these use cases?How does the shared communication layer of NATS with multiple protocol and pattern adaptaters reduce the need to replicate data and logic across application and data layers?Can you describe how the core NATS system is architected?How have the design and goals of NATS evolved since you first started working on it?In the time since you first began writing NATS (~2012) there have been several evolutionary stages in both application and data implementation patterns. How have those shifts influenced the direction of the NATS project and its ecosystem?For teams who have an existing architecture, what are some of the patterns for adoption of NATS that allow them to augment or migrate their capabilities?What are some of the ecosystem investments that you and your team have made to ease the adoption and integration of NATS?What are the most interesting, innovative, or unexpected ways that you have seen NATS used?What are the most interesting, unexpected, or challenging lessons that you have learned while working on NATS?When is NATS the wrong choice?What do you have planned for the future of NATS?Contact Info GitHubLinkedInParting Question From your perspective, what is the biggest gap in the tooling or technology for data management today?Closing Announcements Thank you for listening! Don't forget to check out our other shows. Podcast.init covers the Python language, its community, and the innovative ways it is being used. The AI Engineering Podcast is your guide to the fast-moving world of building AI systems.Visit the site to subscribe to the show, sign up for the mailing list, and read the show notes.If you've learned something or tried out a project from the show then tell us about it! Email [email protected] with your story.Links NATSNATS JetStreamSynadiaCloud FoundryTIBCOApplied Physics Lab - Johns Hopkins UniversityCray SupercomputerRVCM Certified MessagingTIBCO ZMSIBM MQJMS == Java Message ServiceRabbitMQMongoDBNodeJSRedisAMQP == Advanced Message Queueing ProtocolPub/Sub PatternCircuit Breaker PatternZero MQAkamaiFastlyCDN == Content Delivery NetworkAt Most OnceAt Least OnceExactly OnceAWS KinesisMemcachedSQSSegmentRudderstackPodcast EpisodeDLQ == Dead Letter QueueMQTT == Message Queueing Telemetry TransportNATS Kafka Bridge10BaseT NetworkWeb AssemblyRedPandaPodcast EpisodePulsar FunctionsmTLSAuthZ (Authorization)AuthN (Authentication)NATS Auth CalloutsOPA == Open Policy AgentRAG == Retrieval Augmented GenerationAI Engineering Podcast EpisodeHome AssistantPodcast.init EpisodeTailscaleOllamaCDC == Change Data CapturegRPCThe intro and outro music is from The Hug by The Freak Fandango Orchestra / CC BY-SA

Data Intensive AI - Bartosz Mikulski

2025-03-21 · DataTalks.Club Listen

podcast_episode

by Bartosz Mikulski

AI/ML Data Engineering GitHub HTML LLM MLOps Spark

In this podcast episode, we talked with Bartosz Mikulski about Data Intensive AI.

About the Speaker: Bartosz is an AI and data engineer. He specializes in moving AI projects from the good-enough-for-a-demo phase to production by building a testing infrastructure and fixing the issues detected by tests. On top of that, he teaches programmers and non-programmers how to use AI. He contributed one chapter to the book 97 Things Every Data Engineer Should Know, and he was a speaker at several conferences, including Data Natives, Berlin Buzzwords, and Global AI Developer Days.

In this episode, we discuss Bartosz’s career journey, the importance of testing in data pipelines, and how AI tools like ChatGPT and Cursor are transforming development workflows. From prompt engineering to building Chrome extensions with AI, we dive into practical use cases, tools, and insights for anyone working in data-intensive AI projects. Whether you’re a data engineer, AI enthusiast, or just curious about the future of AI in tech, this episode offers valuable takeaways and real-world experiences.

0:00 Introduction to Bartosz and his background 4:00 Bartosz’s career journey from Java development to AI engineering 9:05 The importance of testing in data engineering 11:19 How to create tests for data pipelines 13:14 Tools and approaches for testing data pipelines 17:10 Choosing Spark for data engineering projects 19:05 The connection between data engineering and AI tools 21:39 Use cases of AI in data engineering and MLOps 25:13 Prompt engineering techniques and best practices 31:45 Prompt compression and caching in AI models 33:35 Thoughts on DeepSeek and open-source AI models 35:54 Using AI for lead classification and LinkedIn automation 41:04 Building Chrome extensions with AI integration 43:51 Comparing Cursor and GitHub Copilot for coding 47:11 Using ChatGPT and Perplexity for AI-assisted tasks 52:09 Hosting static websites and using AI for development 54:27 How blogging helps attract clients and share knowledge 58:15 Using AI to assist with writing and content creation

🔗 CONNECT WITH Bartosz LinkedIn: https://www.linkedin.com/in/mikulskibartosz/ Github: https://github.com/mikulskibartosz Website: https://mikulskibartosz.name/blog/

🔗 CONNECT WITH DataTalksClub Join the community - https://datatalks.club/slack.html Subscribe to our Google calendar to have all our events in your calendar - https://calendar.google.com/calendar/r?cid=ZjhxaWRqbnEwamhzY3A4ODA5azFlZ2hzNjBAZ3JvdXAuY2FsZW5kYXIuZ29vZ2xlLmNvbQ Check other upcoming events - https://lu.ma/dtc-events LinkedIn - https://www.linkedin.com/company/datatalks-club/ Twitter - https://twitter.com/DataTalksClub Website - https://datatalks.club/

Developer Experience at Uber with Gautam Korlam

2025-03-12 · The Pragmatic Engineer Listen

podcast_episode

by Gergely Orosz , Gautam Korlam (Gitar)

AI/ML Cloud Computing Marketing

Supported by Our Partners • Sentry — Error and performance monitoring for developers. • The Software Engineer’s Guidebook: Written by me (Gergely) – now out in audio form as well. — In today’s episode of The Pragmatic Engineer, I am joined by former Uber colleague, Gautam Korlam. Gautam is the Co-Founder of Gitar, an agentic AI startup that automates code maintenance. Gautam was mobile engineer no. 9 at Uber and founding engineer for the mobile platform team – and so he learned a few things about scaling up engineering teams. We talk about: • How Gautam accidentally deleted Uber’s Java monorepo – really! • Uber's unique engineering stack and why custom solutions like SubmitQueue were built in-house • Monorepo: the benefits and downsides of this approach • From Engineer II to Principal Engineer at Uber: Gautam’s career trajectory • Practical strategies for building trust and gaining social capital • How the platform team at Uber operated with a product-focused mindset • Vibe coding: why it helps with quick prototyping • How AI tools are changing developer experience and productivity • Important skills for devs to pick up to remain valuable as AI tools spread • And more! — Timestamps (00:00) Intro (02:11) How Gautam accidentally deleted Uber’s Java Monorepo (05:40) The impact of Gautam’s mistake (06:35) Uber’s unique engineering stack (10:15) Uber’s SubmitQueue (12:44) Why Uber moved to a monorepo (16:30) The downsides of a monorepo (18:35) Measurement products built in-house (20:20) Measuring developer productivity and happiness (22:52) How Devpods improved developer productivity (27:37) The challenges with cloud development environments (29:10) Gautam’s journey from Eng II to Principal Engineer (32:00) Building trust and gaining social capital (36:17) An explanation of Principal Engineer at Uber—and the archetypes at Uber (45:07) The platform and program split at Uber (48:15) How Gautam and his team supported their internal users (52:50) Gautam’s thoughts on developer productivity (59:10) How AI enhances productivity, its limitations, and the rise of agentic AI (1:04:00) An explanation of Vibe coding (1:07:34) An overview of Gitar and all it can help developers with (1:10:44) Top skills to cultivate to add value and stay relevant (1:17:00) Rapid fire round — The Pragmatic Engineer deepdives relevant for this episode: • The Platform and Program split at Uber • How Uber is measuring engineering productivity • Inside Uber’s move to the Cloud • How Uber built its observability platform • Software Architect Archetypes — See the transcript and other references from the episode at ⁠⁠https://newsletter.pragmaticengineer.com/podcast⁠⁠ — Production and marketing by ⁠⁠⁠⁠⁠⁠⁠⁠https://penname.co/⁠⁠⁠⁠⁠⁠⁠⁠. For inquiries about sponsoring the podcast, email [email protected].

Get full access to The Pragmatic Engineer at newsletter.pragmaticengineer.com/subscribe

DataTalks.Club 4th Anniversary AMA Podcast – Alexey Grigorev and Johanna Bayer

2024-10-26 · DataTalks.Club Listen

podcast_episode

by Alexey Grigorev (DataTalks.Club)

AI/ML Data Science HTML LLM

We talked about:

00:00 DataTalks.Club intro

00:00 DataTalks.Club anniversary "Ask Me Anything" event with Alexey Grigorev

02:29 The founding of DataTalks .Club

03:52 Alexey's transition from Java work to DataTalks.Club

04:58 Growth and success of DataTalks.Club courses

12:04 Motivation behind creating a free-to-learn community

24:03 Staying updated in data science through pet projects

26 :37 Hosting a second podcast and maintaining programming skills

28:56 Skepticism about LLMs and their relevance

31:53 Transitioning to DataTalks.Club and personal reflections

33:32 Memorable moments and the first event's success

36:19 Community building during the pandemic

38:31 AI's impact on data analysts and future roles

42:24 Discussion on AI in healthcare

44:37 Age and reflections on personal milestones

47:54 Building communities and personal connections

49:34 Future goals for the community and courses

51:18 Community involvement and engagement strategies

53:46 Ideas for competitions and hackathons

54:20 Inviting guests to the podcast

55:29 Course updates and future workshops

56:27 Podcast preparation and research process

58:30 Career opportunities in data science and transitioning fields

1:01 :10 Book recommendations and personal reading experiences

About the speaker:

Alexey Grigorev is the founder of DataTalks.Club.

Join our slack: https://datatalks.club/slack.html

DataOps, Observability, and The Cure for Data Team Blues - Christopher Bergh

2024-08-15 · DataTalks.Club Listen

podcast_episode

by Johanna Berer (DataTalks.Club) , Christopher Bergh (DataKitchen)

Agile/Scrum AI/ML Analytics Big Data Chef Cloud Computing Data Engineering Data Science DataOps DevOps Hadoop Microsoft +1 more