Ruprecht-Karls-Universität Heidelberg

Éva Mújdricza-Maydt


Gutenberg corpus


This slightly differing version of the Bilingual Formal / Informal Address Corpus was used to train and test the CRF-based sentence aligner CRFalign.

(* NOTE that one file pair in our training set cannot be provided due to copyright restrictions. See readme.)

zum Seitenanfang