Éva Mújdricza-Maydt
Gutenberg corpus
- training set: 111* file pairs (80.6 MB)
- test set: 3 file pairs, with golden annotation (420 KB)
- training and test set: 114* file pairs (81.0 MB)
- readme
(* NOTE that one file pair in our training set cannot be provided due to copyright restrictions. See readme.)