talk-data.com talk-data.com

X

Speaker

Xiao-Yi Li

2

talks

author

Frequent Collaborators

Filtering by: O'Reilly Data Science Books ×

Filter by Event / Source

Talks & appearances

Showing 2 of 2 activities

Search activities →
Data Mashups in R

How do you use R to import, manage, visualize, and analyze real-world data? With this short, hands-on tutorial, you learn how to collect online data, massage it into a reasonable form, and work with it using R facilities to interact with web servers, parse HTML and XML, and more. Rather than use canned sample data, you'll plot and analyze current home foreclosure auctions in Philadelphia. This practical mashup exercise shows you how to access spatial data in several formats locally and over the Web to produce a map of home foreclosures. It's an excellent way to explore how the R environment works with R packages and performs statistical analysis. Parse messy data from public foreclosure auction postings Plot the data using R's PBSmapping package Import US Census data to add context to foreclosure data Use R's lattice and latticeExtra packages for data visualization Create multidimensional correlation graphs with the pairs() scatterplot matrix package

Data Mashups in R

This article demonstrates how the realworld data is imported, managed, visualized, and analyzed within the R statistical framework. Presented as a spatial mashup, this tutorial introduces the user to R packages, R syntax, and data structures. The user will learn how the R environment works with R packages as well as its own capabilities in statistical analysis. We will be accessing spatial data in several formats-html, xml, shapefiles, and text-locally and over the web to produce a map of home foreclosure auctions and perform statistical analysis on these events.