Skip to content

Helper functions

Mostly written in Java, these functions build search indexes, compute dataset characteristics, generate tasks and evaluate the results of certain tasks.

Code

The Java code is available in the 'OpenML' repository: https://github.com/openml/OpenML/tree/master/Java

Components

General: - OpenML: Building Lucene search index and smaller tools, e.g. extracting documentation from WEKA source files and ARFF files - generateApiDocs: Generates API HTML Documentation - http_post_file: Example how to post files to the api using Java.

Support for tasks: - foldgeneration: Java code for generating cross-validation folds. Can be used from command line. - splitgeneration: Split generator for cross validation and holdout. Unsure what's the difference with the previous? - generate_predictions: Helper class to build prediction files based on WEKA output. Move to WEKA repository? - evaluate_predictions: The evaluation engine computing evaluation scores based on submitted predictions