Solved – Data APIs/feeds available as packages in R

datasetrreferences

EDIT: The Web Technologies and Services CRAN task view contains a much more comprehensive list of data sources and APIs available in R. You can submit a pull request on github if you wish to add a package to the task view.


I'm making a list of the various data feeds that are already hooked into R or that are easy to setup. Here's my initial list of packages, and I was wondering what else I'm missing.

I'm trying to limit this list to "real time" or "close to real time" data feeds/APIs, where the underlying data might change between downloads. There's plenty of lists out there for static datasets, which only require one download.

This list is currently biased towards financial/time series data, and I could use some help expanding it into other domains.

Free Data:
Data Source – Package
Google Finance historical dataquantmod
Google Finance balance sheets – quantmod
Yahoo Finance historical data – quantmod
Yahoo Finance historical data – tseries
Yahoo Finance current options chain – quantmod
Yahoo Finance historical analyst estimatesfImport
Yahoo Finance current key stats – fImport – seems to be broken
OANDA historic exchange rates/metal prices – quantmod
FRED historic macroeconomic indicators – quantmod
World Bank historic macroeconomic indicatorsWDI
Google Trends historic search volume dataRGoogleTrends
Google Docs – RGoogleDocs
Google Storage – RGoogleStorage
TwittertwitteR
ZillowZillow
New York TimesRNYTimes
US Census 2000UScensus2000
infochimpsinfochimps
datamarketrdatamarket – requires free account
Factual.comfactualR
Geocode addresses – RDSTK
Map coordinates to political boundaries – RDSTK
Weather UndergroundRoll your own
Google NewsRoll your own
Earth Sciences netCDF DataRoll your own
Climate DataRoll your own
Public health data – Roll your own
OAI Harvester – Open Archives Initiative harvester
RAmazonS3 – S3 Amazon storage server
Rflikr – Flikr api

Requires a subscription:
BloombergRBloomberg
LIMLIM
Trades and Quotes from NYSERTAQ
Interactive BrokersIBrokers

Best Answer

  1. Instructions for using R to download netCDF data can be found here, a common format used for storing Earth science data, e.g. as in marine geospatial data from OpenEarth or climate model driver and forecasts from UCAR

  2. rnpn (under development) enables you to get data from the National Phenology Network - a citizen science project to track the timing of plant green-up, flowering, and senescence. See the developer's blog post.

  3. -obsolete- RClimate provides tools to download and manipulate flat-file climate data (with tutorials, including here-

  4. Download historical finance data with tseries::get.hist.quote

  5. Michael Samuel's documents downloading public health data

  6. raster::getData provides access to climate variables via worldclim