study
openml.study
#
OpenMLBenchmarkSuite
#
OpenMLBenchmarkSuite(suite_id: int | None, alias: str | None, name: str, description: str, status: str | None, creation_date: str | None, creator: int | None, tags: list[dict] | None, data: list[int] | None, tasks: list[int] | None)
Bases: BaseStudy
An OpenMLBenchmarkSuite represents the OpenML concept of a suite (a collection of tasks).
It contains the following information: name, id, description, creation date, creator id and the task ids.
According to this list of task ids, the suite object receives a list of OpenML object ids (datasets).
Parameters#
suite_id : int the study id alias : str (optional) a string ID, unique on server (url-friendly) main_entity_type : str the entity type (e.g., task, run) that is core in this study. only entities of this type can be added explicitly name : str the name of the study (meta-info) description : str brief description (meta-info) status : str Whether the study is in preparation, active or deactivated creation_date : str date of creation (meta-info) creator : int openml user id of the owner / creator tags : list(dict) The list of tags shows which tags are associated with the study. Each tag is a dict of (tag) name, window_start and write_access. data : list a list of data ids associated with this study tasks : list a list of task ids associated with this study
Source code in openml/study/study.py
openml_url
property
#
The URL of the object on the server, if it was uploaded, else None.
open_in_browser
#
Opens the OpenML web page corresponding to this object in your default browser.
Source code in openml/base.py
publish
#
publish() -> OpenMLBase
Publish the object on the OpenML server.
Source code in openml/base.py
push_tag
#
remove_tag
#
url_for_id
classmethod
#
Return the OpenML URL for the object of the class entity with the given id.
OpenMLStudy
#
OpenMLStudy(study_id: int | None, alias: str | None, benchmark_suite: int | None, name: str, description: str, status: str | None, creation_date: str | None, creator: int | None, tags: list[dict] | None, data: list[int] | None, tasks: list[int] | None, flows: list[int] | None, runs: list[int] | None, setups: list[int] | None)
Bases: BaseStudy
An OpenMLStudy represents the OpenML concept of a study (a collection of runs).
It contains the following information: name, id, description, creation date, creator id and a list of run ids.
According to this list of run ids, the study object receives a list of OpenML object ids (datasets, flows, tasks and setups).
Parameters#
study_id : int the study id alias : str (optional) a string ID, unique on server (url-friendly) benchmark_suite : int (optional) the benchmark suite (another study) upon which this study is ran. can only be active if main entity type is runs. name : str the name of the study (meta-info) description : str brief description (meta-info) status : str Whether the study is in preparation, active or deactivated creation_date : str date of creation (meta-info) creator : int openml user id of the owner / creator tags : list(dict) The list of tags shows which tags are associated with the study. Each tag is a dict of (tag) name, window_start and write_access. data : list a list of data ids associated with this study tasks : list a list of task ids associated with this study flows : list a list of flow ids associated with this study runs : list a list of run ids associated with this study setups : list a list of setup ids associated with this study
Source code in openml/study/study.py
openml_url
property
#
The URL of the object on the server, if it was uploaded, else None.
open_in_browser
#
Opens the OpenML web page corresponding to this object in your default browser.
Source code in openml/base.py
publish
#
publish() -> OpenMLBase
Publish the object on the OpenML server.
Source code in openml/base.py
push_tag
#
remove_tag
#
url_for_id
classmethod
#
Return the OpenML URL for the object of the class entity with the given id.
attach_to_study
#
Attaches a set of runs to a study.
Parameters#
study_id : int OpenML id of the study
list (int)
List of entities to link to the collection
Returns#
int new size of the study (in terms of explicitly linked entities)
Source code in openml/study/functions.py
attach_to_suite
#
Attaches a set of tasks to a benchmarking suite.
Parameters#
suite_id : int OpenML id of the study
list (int)
List of entities to link to the collection
Returns#
int new size of the suite (in terms of explicitly linked entities)
Source code in openml/study/functions.py
create_benchmark_suite
#
create_benchmark_suite(name: str, description: str, task_ids: list[int], alias: str | None = None) -> OpenMLBenchmarkSuite
Creates an OpenML benchmark suite (collection of entity types, where the tasks are the linked entity)
Parameters#
name : str
the name of the study (meta-info)
description : str
brief description (meta-info)
task_ids : list
a list of task ids associated with this study
more can be added later with attach_to_suite
.
alias : str (optional)
a string ID, unique on server (url-friendly)
Returns#
OpenMLStudy A local OpenML study object (call publish method to upload to server)
Source code in openml/study/functions.py
create_study
#
create_study(name: str, description: str, run_ids: list[int] | None = None, alias: str | None = None, benchmark_suite: int | None = None) -> OpenMLStudy
Creates an OpenML study (collection of data, tasks, flows, setups and run), where the runs are the main entity (collection consists of runs and all entities (flows, tasks, etc) that are related to these runs)
Parameters#
benchmark_suite : int (optional)
the benchmark suite (another study) upon which this study is ran.
name : str
the name of the study (meta-info)
description : str
brief description (meta-info)
run_ids : list, optional
a list of run ids associated with this study,
these can also be added later with attach_to_study
.
alias : str (optional)
a string ID, unique on server (url-friendly)
benchmark_suite: int (optional)
the ID of the suite for which this study contains run results
Returns#
OpenMLStudy A local OpenML study object (call publish method to upload to server)
Source code in openml/study/functions.py
delete_study
#
delete_suite
#
detach_from_study
#
Detaches a set of run ids from a study.
Parameters#
study_id : int OpenML id of the study
list (int)
List of entities to unlink from the collection
Returns#
int new size of the study (in terms of explicitly linked entities)
Source code in openml/study/functions.py
detach_from_suite
#
Detaches a set of task ids from a suite.
Parameters#
suite_id : int OpenML id of the study
list (int)
List of entities to unlink from the collection
Returns#
int new size of the study (in terms of explicitly linked entities)
Source code in openml/study/functions.py
get_study
#
get_study(study_id: int | str, arg_for_backwards_compat: str | None = None) -> OpenMLStudy
Retrieves all relevant information of an OpenML study from the server.
Parameters#
study id : int, str study id (numeric or alias)
str, optional
The example given in arxiv.org/pdf/1708.03731.pdf uses an older version of the API which required specifying the type of study, i.e. tasks. We changed the implementation of studies since then and split them up into suites (collections of tasks) and studies (collections of runs) so this argument is no longer needed.
Returns#
OpenMLStudy The OpenML study object
Source code in openml/study/functions.py
get_suite
#
get_suite(suite_id: int | str) -> OpenMLBenchmarkSuite
Retrieves all relevant information of an OpenML benchmarking suite from the server.
Parameters#
study id : int, str study id (numeric or alias)
Returns#
OpenMLSuite The OpenML suite object
Source code in openml/study/functions.py
list_studies
#
list_studies(offset: int | None = None, size: int | None = None, status: str | None = None, uploader: list[str] | None = None, benchmark_suite: int | None = None) -> DataFrame
Return a list of all studies which are on OpenML.
Parameters#
offset : int, optional The number of studies to skip, starting from the first. size : int, optional The maximum number of studies to show. status : str, optional Should be {active, in_preparation, deactivated, all}. By default active studies are returned. uploader : list (int), optional Result filter. Will only return studies created by these users. benchmark_suite : int, optional
Returns#
datasets : dataframe Every dataset is represented by a dictionary containing the following information: - id - alias (optional) - name - benchmark_suite (optional) - status - creator - creation_date If qualities are calculated for the dataset, some of these are also returned.
Source code in openml/study/functions.py
list_suites
#
list_suites(offset: int | None = None, size: int | None = None, status: str | None = None, uploader: list[int] | None = None) -> DataFrame
Return a list of all suites which are on OpenML.
Parameters#
offset : int, optional The number of suites to skip, starting from the first. size : int, optional The maximum number of suites to show. status : str, optional Should be {active, in_preparation, deactivated, all}. By default active suites are returned. uploader : list (int), optional Result filter. Will only return suites created by these users.
Returns#
datasets : dataframe Every row is represented by a dictionary containing the following information: - id - alias (optional) - name - main_entity_type - status - creator - creation_date
Source code in openml/study/functions.py
update_study_status
#
Updates the status of a study to either 'active' or 'deactivated'.
Parameters#
study_id : int The data id of the dataset status : str, 'active' or 'deactivated'