OpenML.org is an online machine learning platform where researchers can access open data, download and upload data sets, share their machine learning tasks and experiments and organize them online to work and collaborate with other researchers. The R interface allows to query for data sets with specific properties, and allows the downloading and uploading of data sets, tasks, flows and runs.

For more information, have a look at our

How to cite

To cite the OpenML R package in publications, please use our paper entitled OpenML: An R Package to Connect to the Machine Learning Platform OpenML [BibTex]

See also here how to cite the OpenML project itself.

Installation of the package

  • Install the stable version from CRAN
install.packages("OpenML")

or

  • Install the development version from GitHub (using devtools)
devtools::install_github("openml/openml-r")

Furthermore, you need farff installed to process ARFF files:

Alternatively you can make use of the RWeka R package to process ARFF files. However, in particular for larger ARFF files, farff is considerably faster than RWeka.

Contact

Found some nasty bugs? Please use the issue tracker to report on bugs or missing features. Pay attention to explain the problem as good as possible (in the best case with a traceback() result and a sessionInfo()). Moreover, a reproducible example is desirable.