CRAN package download logs

These log files contain all hits to http://cran.rstudio.com/ related to packages. The raw log files have been parsed into CSV and anonymised as described at the bottom of the page.

Daily package downloads

Each file contents the following variables:

  • date
  • time (in UTC)
  • size (in bytes)
  • r_version, version of R used to download package
  • r_arch (i386 = 32 bit, x86_64 = 64 bit)
  • r_os (darwin9.8.0 = mac, mingw32 = windows)
  • package
  • country, two letter ISO country code. Geocoded from IP using MaxMind's free database
  • ip_id, a daily unique id assigned to each IP address

Downloading from R

  # Here's an easy way to get all the URLs in R
  start <- as.Date('2012-10-01')
  today <- as.Date('2014-12-20')

  all_days <- seq(start, today, by = 'day')

  year <- as.POSIXlt(all_days)$year + 1900
  urls <- paste0('http://cran-logs.rstudio.com/', year, '/', all_days, '.csv.gz')
  # You can then use download.file to download into a directory.

  # If you only want to download the files you don't have, try:
  missing_days <- setdiff(all_days, tools::file_path_sans_ext(dir(), TRUE))
  

Daily R downloads