Mainly the training datasets will the data from CROHME 2014 and earlier competitions available online from IAPR TC-11: tc11.cvc.uab.es/datasets/CROHME-2014_2 and the test datasets will be new samples for the full expressions tasks and CROHME 2014 test sets for sub tasks. Indeed, a part of the ground-truth will be necessary available to do the tasks CROHME2016-Symbols and CROHME2016-Structure, thus we can not provide this ground-truth for the main task.
More details are given in the table below:
|CROHME2016-Formulas||CROHME 2014 Train set||CROHME 2014 Test set||New CROHME 2016 Test set|
|CROHME2016-Symbols||CROHME 2014 Train set||CROHME 2013 Test set||CROHME 2014 Test set|
|CROHME2016-Structure||CROHME 2014 Train set||CROHME 2013 Test set||CROHME 2014 Test set|
|CROHME2016-Matrices||CROHME 2014 Matrices Train set||CROHME 2014 Matrices Test set||New CROHME 2016 Matrices Test set|
Note that all previous CROHME datasets are available in the IAPR TC-11 package. The validation sets for tasks Symbols and Structure have been generated using the tools from CROHMELib.
For the first time, the CROHME Competition make available to participants a corpus of expressions which can allow to train Language Models. The source of this corpus is the Math Information Retrieval competition NTCIR-12 MathIR . We provide three corpus, from the more general to the more specific to CROHME tasks:
Data from CROHME 2014 and earlier competitions is available online from IAPR TC-11: tc11.cvc.uab.es/datasets/CROHME-2014_2.
For CROHME 2016 training data will be provided in the same InkML (XML) format used in previous competitions. These InkML files may be visualized using an online tool: saskatoon.cs.rit.edu/inkml_viewer/.
Recognition outputs will be in a Comma-Separated Variable (.csv) format representing a labeled graph over handwritten strokes (.lg).
for conversion between InkML and LG formats (CROHMELib) along with tools for evaluation and visualization (LgEval) will be provided on the competition web page.
Earlier versions of these tools are available online: www.cs.rit.edu/~dprl/Software.html.
The updated evaluation tools LgEval can be obtained online using:
git clone http://saskatoon.cs.rit.edu:10001/root/lgeval.git
The file format is exactly the same as during CROHME2014. Here are more precisions (which are consistent with CROHME2014):