talk-data.com talk-data.com

Filter by Source

Select conferences and events

People (93 results)

See all 93 →

Companies (1 result)

Showing 3 results

Activities & events

Title & Speakers Event
PyData Delhi Meetup #45 2025-04-19 · 08:30

Schedule (2:00 PM - 6:00 PM):

1. Empowering Businesses with Uptitude's AI and Data Solutions by Marcin Dobak

2. The impact of AI on the weakest link by Ben Johnson

  1. Model Context Protocol 101 by Raman Tehlan

4. Break and Networking

5. [BoF Session] How to Open Source by Sanket Verma

6. Hiring Pitches & Closing Notes

People who would like to volunteer for PyData Delhi, please stay for a while after the meetups end!

Note: Please keep your RSVP up to date; it is a very nice thing to do! Most likely, we won't be able to entertain more than the max RSVP limit!

Logistics:

• Venue map: https://maps.app.goo.gl/g9WoTAreAD3LknZh7

• Nearest Metro Station: Sector 55-56 (Rapid Metro Line)

• Please bring a valid government ID to gain entry in the venue premises

Doors open at 1:30 PM, and talks start at 2:00 PM.


Please, unRSVP if you realise you can't make it. We're limited by space on the number of attendees, so please free up your place for your fellow community members!

Follow @pydatadelhi (http://x.com/pydatadelhi) for updates and early announcements. See you at the meetup!

We're on Telegram. Join us for early updates and discussions here (https://t.me/+yVjqRjOUUL1iYjI1)

Share your Lightning Talks! A "lightning talk" is a quick mini-presentation (5 minutes maximum) on any Python, Julia, R, tips & tricks, caveats or personal project you'd like. We'll save time for 2-3 of these each month: to save a spot, post a comment with your talk's title/topic and create an issue at (https://github.com/pydatadelhi/talks/issues). Beginner topics are always welcome!

Presentations: If you'd like to share a topic with the group at an upcoming meetup, propose a talk here (https://github.com/pydatadelhi/talks/issues) :)

Contact: Message us through the message tab on the meetup page.

PyData Delhi Meetup #45
Ben Johnson – guest @ Timber.io

On today’s episode, we’re joined by Ben Johnson Founder, CEO of Particle41, a provider of software and product development solutions crafted by world-class app development, DevOps, and data science teams. We talk about:

What components the CTO owns in a SaaS companyOptimizing the efficiency of dev teamsHow much of the CTO role is internal vs. externalHow to interview & identify a great CTO candidate

Data Science DevOps SaaS
SaaS Scaled - Interviews about SaaS Startups, Analytics, & Operations
Luke Steensen – guest @ Timber.io , Ben Johnson – guest @ Timber.io , Tobias Macey – host

Summary The first stage in every data project is collecting information and routing it to a storage system for later analysis. For operational data this typically means collecting log messages and system metrics. Often a different tool is used for each class of data, increasing the overall complexity and number of moving parts. The engineers at Timber.io decided to build a new tool in the form of Vector that allows for processing both of these data types in a single framework that is reliable and performant. In this episode Ben Johnson and Luke Steensen explain how the project got started, how it compares to other tools in this space, and how you can get involved in making it even better.

Announcements

Hello and welcome to the Data Engineering Podcast, the show about modern data management When you’re ready to build your next pipeline, or want to test out the projects you hear about on the show, you’ll need somewhere to deploy it, so check out our friends at Linode. With 200Gbit private networking, scalable shared block storage, and a 40Gbit public network, you’ve got everything you need to run a fast, reliable, and bullet-proof data platform. If you need global distribution, they’ve got that covered too with world-wide datacenters including new ones in Toronto and Mumbai. And for your machine learning workloads, they just announced dedicated CPU instances. Go to dataengineeringpodcast.com/linode today to get a $20 credit and launch a new server in under a minute. And don’t forget to thank them for their continued support of this show! You listen to this show to learn and stay up to date with what’s happening in databases, streaming platforms, big data, and everything else you need to know about modern data management.For even more opportunities to meet, listen, and learn from your peers you don’t want to miss out on this year’s conference season. We have partnered with organizations such as O’Reilly Media, Dataversity, Corinium Global Intelligence, and Data Council. Upcoming events include the O’Reilly AI conference, the Strata Data conference, the combined events of the Data Architecture Summit and Graphorum, and Data Council in Barcelona. Go to dataengineeringpodcast.com/conferences to learn more about these and other events, and take advantage of our partner discounts to save money when you register today. Your host is Tobias Macey and today I’m interviewing Ben Johnson and Luke Steensen about Vector, a high-performance, open-source observability data router

Interview

Introduction How did you get involved in the area of data management? Can you start by explaining what the Vector project is and your reason for creating it?

What are some of the comparable tools that are available and what were they lacking that prompted you to start a new project?

What strategy are you using for project governance and sustainability? What are the main use cases that Vector enables? Can you explain how Vector is implemented and how the system design has evolved since you began working on it?

How did your experience building the business and products for Timber influence and inform your work on Vector? When you were planning the implementation, what were your criteria for the runtime implementation and why did you decide to use Rust? What led you to choose Lua as the embedded scripting environment?

What data format does Vector use internally?

Is there any support for defining and enforcing schemas?

In the event of a malformed message is there any capacity for a dead letter queue?

What are some strategies for formatting source data to improve the effectiveness of the information that is gathered and the ability of Vector to parse it into useful data? When designing an event flow in Vector what are the available mechanisms for testing the overall delivery and any transformations? What options are available to operators to support visibility into the running system? In terms of deployment topologies, what ca

AI/ML Big Data Data Engineering Data Management Rust Data Streaming
Data Engineering Podcast
Showing 3 results