OpenML builds a seamless, open ecosytem of machine learning data, models, and benchmarks - advancing AI openly for the benefit all of humanity.

1000s of data sets, uniformly formatted, easy to load, organized online

Models and pipelines automatically uploaded from machine learning libraries

Extensive APIs to integrate OpenML into your tools and scripts

Easily reproducible results (e.g. models, evaluations) for comparison and reuse

Stand on the shoulders of giants, and collaborate in real time

Make your work more visible and reusable

Built for automation: streamline your experiments and model building

How to use OpenML¶

OpenML is accessible to a wide range of people:

Explore the OpenML website to discover, download and upload ML resources.

Install an OpenML library to access and share resources programmatically through our APIs. Select one of the detailed guides in the top menu.

Get started by learning more about the structure and concepts behind OpenML, such as Datasets, Tasks, Flows, Runs, Benchmarking and much more. This will help you leverage OpenML even better in your work.

Contributing to OpenML¶

OpenML is an open source project, hosted on GitHub. We welcome everybody to help improve OpenML, and make it more useful for everyone. For more information on how to contribute, see the Contributing page.

We want to make machine learning and data analysis simple, accessible, collaborative and open with an optimal division of labour between computers and humans.

Want to get involved?¶

Awesome, we're happy to have you!

OpenML is dependent on the community. If you want to help, please email us (openmlHQ@googlegroups.com). If you feel already comfortable you can help by opening issues or make a pull request on GitHub. We also have regular workshops you can join (they are announced on openml.org).