Python

[Replay] - Data Collaboration and A.I. with Adam Weinstein - Making Data Simple [Season 3 - Episode 3]

2020-08-19 · Making Data Simple Listen

podcast_episode

by Adam Weinstein (Cursor) , Al Martin (IBM)

Analytics Big Data Data Analytics IBM

Send us a text Adam Weinstein is currently CEO and Co-Founder of Cursor, having worked at LinkedIn as a Senior Manager of Business Development and having founded enGreet, a print-on-demand greeting card company that merged crowd-sourcing with social expressions. In this episode, he describes his data analytics company and provides insight into creating a successful startup.

Shownotes

00:00 - Check us out on YouTube and SoundCloud!

00:10 - Connect with Producer Steve Moore on LinkedIn & Twitter

00:15 - Connect with Producer Liam Seston on LinkedIn & Twitter.

00:20 - Connect with Producer Rachit Sharma on LinkedIn.

00:25 - Connect with Host Al Martin on LinkedIn & Twitter.

00:55 - Connect with Adam Weinstein on LinkedIn.

03:55 - Find out more about Cursor.

06:45 - Learn more about Cursor's Co-Founder and CEO Adam Weinstein.

13:10 - Learn more about Big Data Analytics.

19:20 - What is Python/Jupyter Notebooks?

26:35 - Learn more about Data Fluency.

35:30 - What is a startup? Want to be featured as a guest on Making Data Simple? Reach out to us at [email protected] and tell us why you should be next. The Making Data Simple Podcast is hosted by Al Martin, WW VP Technical Sales, IBM, where we explore trending technologies, business innovation, and leadership ... while keeping it simple & fun.

Machine Learning for Algorithmic Trading - Second Edition

2020-07-31 · O'Reilly AI & ML Books O'Reilly Amazon

book

by Stefan Jansen

AI/ML NLP ai-ml data machine-learning

Explore the intersection of machine learning and algorithmic trading with "Machine Learning for Algorithmic Trading" by Stefan Jansen. This comprehensive guide walks you through applying predictive modeling and data analysis to uncover financial signals and build systematic trading strategies. By the end, you'll be equipped to design and implement machine learning-driven trading systems. What this Book will help me do Develop data-driven trading strategies using supervised, unsupervised, and reinforcement learning methods. Master techniques for extracting predictive features from market and alternative datasets. Gain expertise in backtesting and validating ML-based trading strategies in Python. Apply text analysis techniques like NLP to news articles and transcripts for financial insights. Optimize portfolio risk and returns using advanced Python libraries. Author(s) Stefan Jansen is a quantitative researcher and data scientist with extensive experience in developing algorithmic trading solutions. He specializes in leveraging machine learning to extract financial insights and optimize investment strategies. His practical approach to applying ML in trading is reflected in this comprehensive guide, helping readers navigate complex trading challenges. Who is it for? This book is crafted for Python developers, data scientists, and finance professionals looking to integrate machine learning into algorithmic trading. Ideal for those with a basic understanding of Python and ML principles, it guides readers in crafting data-driven trading strategies. It's especially useful for analysts aiming to harness diverse data types for financial applications.

The Data Analysis Workshop

2020-07-29 · O'Reilly Data Science Books O'Reilly Amazon

book

by Ravi Ranjan Prasad Karn , John Wesley Doyle , Shubhangi Hora , Konstantin Palagachev , Brent Broadnax , Pritesh Tiwari , Gururajan Govindan , Ashish Jain (Microsoft) , Robert Thas John

Analytics Data Science Matplotlib Seaborn data data-science

The Data Analysis Workshop teaches you how to analyze and interpret data to solve real-world business problems effectively. By working through practical examples and datasets, you'll gain actionable insights into modern analytic techniques and build your confidence as a data analyst. What this Book will help me do Understand and apply fundamental data analysis concepts and techniques to tackle diverse datasets. Perform rigorous hypothesis testing and analyze group differences within data sets. Create informative data visualizations using Python libraries like Matplotlib and Seaborn. Understand and use correlation metrics to identify relationships between variables. Leverage advanced data manipulation techniques to uncover hidden patterns in complex datasets. Author(s) The authors, Gururajan Govindan, Shubhangi Hora, and Konstantin Palagachev, are experts in data science and analytics with years of experience in industry and academia. Their background includes performing business-critical analysis for companies and teaching students how to approach data-driven decision-making. They bring their depth of knowledge and engaging teaching styles together in this approachable guide. Who is it for? This book is intended for programmers with proficiency in Python who want to apply their skills to the field of data analysis. Readers who have a foundational understanding of coding and are eager to implement hands-on data science techniques will gain the most value. The content is also suitable for anyone pursuing a data-driven problem-solving mindset. This is an excellent resource to help transition from basic coding proficiency to applying Python in real-world data science.

The Data Wrangling Workshop - Second Edition

2020-07-29 · O'Reilly Data Science Books O'Reilly Amazon

book

by Shubhadeep Roychowdhury , John Wesley Doyle , Harshil Jain , Samik Sen , Akshay Khare , Dr. Tirthajyoti Sarkar , Nagendra Nagaraj , Dr. Vlad Sebastian Ionescu , Robert Thas John , Brian Lipp

Analytics Data Quality Data Science Matplotlib NumPy Pandas RDBMS SQL data data-science data-science-tools

The Data Wrangling Workshop is your beginner's guide to the essential techniques and practices of data manipulation using Python. Throughout the book, you will progressively build your skills, learning key concepts such as extracting, cleaning, and transforming data into actionable insights. By the end, you'll be confident in handling various data wrangling tasks efficiently. What this Book will help me do Understand and apply the fundamentals of data wrangling using Python. Combine and aggregate data from diverse sources like web data, SQL databases, and spreadsheets. Use descriptive statistics and plotting to examine dataset properties. Handle missing or incorrect data effectively to maintain data quality. Gain hands-on experience with Python's powerful data science libraries like Pandas, NumPy, and Matplotlib. Author(s) Brian Lipp, None Roychowdhury, and Dr. Tirthajyoti Sarkar are experienced educators and professionals in the fields of data science and engineering. Their collective expertise spans years of teaching and working with data technologies. They aim to make data wrangling accessible and comprehensible, focusing on practical examples to equip learners with real-world skills. Who is it for? The Data Wrangling Workshop is ideal for developers, data analysts, and business analysts aiming to become data scientists or analytics experts. If you're just getting started with Python, you will find this book guiding you step-by-step. A basic understanding of Python programming, as well as relational databases and SQL, is recommended for smooth learning.

The Data Visualization Workshop

2020-07-28 · O'Reilly Data Visualization Books O'Reilly Amazon

book

by Tim Großmann , Piotr Malak , Rohan Chikorde , Anshu Kumar , Joshua Görner , Mario Döbler , Ankit Verma

Data Science DataViz Matplotlib NumPy Pandas Seaborn data data-science data-science-tasks data-visualization

In "The Data Visualization Workshop," you will explore the fascinating world of data visualization and learn how to turn raw data into compelling visualizations that clearly communicate your insights. This book provides practical guidance and hands-on exercises to familiarize you with essential topics such as plotting techniques and interactive visualizations using Python. What this Book will help me do Prepare and clean raw data for visualization using NumPy and pandas. Create effective and visually appealing charts using libraries like Matplotlib and Seaborn. Generate geospatial visualizations utilizing tools like geoplotlib. Develop interactive visualizations for web integration with the Bokeh library. Apply visualization techniques to real-world data analysis scenarios, including stock data and Airbnb datasets. Author(s) Mario Döbler and Tim Großmann are experienced authors and professionals in the field of Python programming and data science. They bring a wealth of knowledge and practical insights to data visualization. Through their collaborative efforts, they aim to empower readers with the skills to create compelling data visualizations and uncover meaningful data narratives. Who is it for? This book is ideal for beginners new to data visualization, as well as developers and data scientists seeking to enhance their practical skills. It is approachable for readers without prior visualization experience but assumes familiarity with Python programming and basic mathematics. If you're eager to bring your data to life in insightful and engaging ways, this book is for you.

Learning ArcGIS Pro 2 - Second Edition

2020-07-24 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Tripp Corbin, GISP

GIS arcgis data data-engineering geographic-information-system-gis location-data

Learning ArcGIS Pro 2 is your comprehensive guide to mastering the capabilities of ArcGIS Pro for geospatial analysis and cartography. You'll learn to create both 2D and 3D maps, edit and visualize geospatial data, and automate workflows using Python and ModelBuilder. This book provides the foundational skills you need to effectively work with GIS data and projects. What this Book will help me do Navigate the ArcGIS Pro interface to create, analyze, and share GIS projects efficiently. Visualize and interpret geographic data using 2D and 3D mapping techniques. Use Arcade language to customize labels and symbology for better map clarity. Automate GIS workflows through Python scripts and ModelBuilder for increased efficiency. Create and share professional-quality map layouts and series with ease. Author(s) Tripp Corbin, GISP, is a GIS Professional with extensive experience in geographic data analysis and ArcGIS software. As a seasoned instructor and author, Tripp aims to make GIS accessible by breaking down complex topics into manageable concepts. His hands-on teaching approach is reflected throughout this book, providing clear guidance and practical knowledge. Who is it for? This book is ideal for beginner GIS enthusiasts or professionals looking to transition to ArcGIS Pro. It is well-suited for those with minimal exposure to GIS or no prior experience with ArcGIS software. Whether you aim to explore geospatial concepts or acquire skills for professional applications, this book provides a solid foundation.

The Applied Data Science Workshop - Second Edition

2020-07-22 · O'Reilly Data Science Books O'Reilly Amazon

book

by Shovon Sengupta , Paul Van Branteghem , Alex Galea , Karen Yang , Guillermina Bea j

AI/ML Data Science Matplotlib Pandas Seaborn data data-science

Embark on an interactive journey into the world of data science with 'The Applied Data Science Workshop'. By following real-world scenarios and hands-on exercises, you will explore the fundamentals of data analysis and machine learning modeling within Jupyter Notebooks, leveraging Python libraries like pandas and sci-kit learn to draw meaningful insights from data. What this Book will help me do Master the process of setting up and using Jupyter Notebooks effectively for data science tasks. Learn to preprocess, analyze, and visualize data using Python libraries such as pandas, Matplotlib, and Seaborn. Discover methods to train and evaluate machine learning models using real-world data scenarios. Apply techniques to assess model performance and optimize them with advanced validation. Gain the skills to communicate insights through well-documented analyses and stakeholder-ready reports. Author(s) None Galea, an accomplished author in the data science domain, focuses on making technical concepts understandable and relatable. With this book, Galea leverages years of experience to introduce readers to practical applications of data science using Python. The author's approach ensures that readers not only learn the concepts but also apply them hands-on. Who is it for? This book caters to aspiring data scientists and developers interested in data analysis and practical applications of data science techniques. Beginners will find the step-by-step methodology approachable, while those with a basic understanding of Python programming or machine learning can quickly extend their skills. It suits anyone eager to apply data science in their professional toolbox.

Learning Spark, 2nd Edition

2020-07-16 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Denny Lee (Databricks) , Brooke Wenig , Jules S. Damji (Anyscale Inc) , Tathagata Das (Databricks)

AI/ML Analytics API Avro CSV Data Analytics Delta Hive Java JSON Kafka ORC +9 more

Data is bigger, arrives faster, and comes in a variety of formatsâ??and it all needs to be processed at scale for analytics or machine learning. But how can you process such varied workloads efficiently? Enter Apache Spark. Updated to include Spark 3.0, this second edition shows data engineers and data scientists why structure and unification in Spark matters. Specifically, this book explains how to perform simple and complex data analytics and employ machine learning algorithms. Through step-by-step walk-throughs, code snippets, and notebooks, youâ??ll be able to: Learn Python, SQL, Scala, or Java high-level Structured APIs Understand Spark operations and SQL Engine Inspect, tune, and debug Spark operations with Spark configurations and Spark UI Connect to data sources: JSON, Parquet, CSV, Avro, ORC, Hive, S3, or Kafka Perform analytics on batch and streaming data using Structured Streaming Build reliable data pipelines with open source Delta Lake and Spark Develop machine learning pipelines with MLlib and productionize models using MLflow

Hadley Wickham talks about his journey in data science, tidy data concepts, and his many books.

2020-07-15 · Making Data Simple Listen

podcast_episode

by Hadley Wickham , Al Martin (IBM)

AI/ML Big Data Data Science IBM

Send us a text Want to be featured as a guest on Making Data Simple? Reach out to us at [[email protected]] and tell us why you should be next.

Abstract Hosted by Al Martin, VP, Data and AI Expert Services and Learning at IBM, Making Data Simple provides the latest thinking on big data, A.I., and the implications for the enterprise from a range of experts.

This week on Making Data Simple, we have Hadley Wickham is Chief Scientist at RStudio, and an Adjunct Professor of Statistics at the University of Auckland, Stanford University, and Rice University. He builds tools that make data science easier and faster, including the famous tidy verse packages for the R programming language. He was named a Fellow by the American Statistical Association for "pivotal contributions to statistical practice through innovative and pioneering research in statistical graphics and computing".

Show Notes 2:39 – Hadley talks about his journey 5:22 – Hadley talks about his American Statistical Association for "pivotal contributions to statistical practice" 8:00 – Tidy data concept 9:02 - How Hadley became interested in big data and R 10:12 – Python and R 12:30 – What Hadley is doing now 13:47 – Top 3 packages that help data scientists 17:47 – Hadley discusses his book 22:48 – Writing a book vs. code 29:40 – What language is going to take over 31:01 – What’s next for data 31:54 – What’s cool for Hadley 36:26 – Hadley’s Role model Hadley Wickham’s books Ggplot2 R for Data Science Advanced R R Packages Hadl Want to be featured as a guest on Making Data Simple? Reach out to us at [email protected] and tell us why you should be next. The Making Data Simple Podcast is hosted by Al Martin, WW VP Technical Sales, IBM, where we explore trending technologies, business innovation, and leadership ... while keeping it simple & fun.