talk-data.com talk-data.com

Topic

web scraping

6

tagged

Activity Trend

1 peak/qtr
2020-Q1 2026-Q1

Activities

6 activities · Newest first

Async Python for Data Science: Speeding Up IO - Bound Workflows\nMost Python scripts in data science are synchronous — fetching one record at a time, waiting for APIs, or slowly scraping websites. In this talk, we’ll introduce Python’s asyncio ecosystem and show how it transforms IO - heavy data workflows. You'll see how httpx , aiofiles , and async constructs speed up tasks like web scraping and batch API calls. We’ll compare async vs threading, walk through a real - world case study, and wrap with performance benchmarks that demonstrate async's value.\nKeywords: p ython 3.x , AsyncIO, Web Scraping, API, Concurrency, Performance, Optimization

Most Python scripts in data science are synchronous — fetching one record at a time, waiting for APIs, or slowly scraping websites. In this talk, we’ll introduce Python’s asyncio ecosystem and show how it transforms IO-heavy data workflows. You'll see how httpx, aiofiles, and async constructs speed up tasks like web scraping and batch API calls. We’ll compare async vs threading, walk through a real-world case study, and wrap with performance benchmarks that demonstrate async's value.

This project aims to develop an AI-powered system that predicts the most cost-effective locations for users to book flights, ensuring they can access the cheapest possible prices. The project aims to generate a rich dataset generated through various web scraping techniques, querying flight data and prices from various locations. Utilizing a machine learning model, the system analyzes this comprehensive dataset to suggest optimal booking locations and predict potential savings for a user given a query.