Welcome to the Statistical Natural Language Processing Group at the Institute for Computational Linguistics at Heidelberg University. Our research is on the intersection of machine learning and natural language processing, with a special focus on interactive statistical learning techniques. For example, we work on interactive neural machine translation and neural question answering systems, where an artificial intelligence agent learns from human reinforcement/bandit feedback.

We organize the weekly Statistical NLP Colloquium.

group photo
StatNLP group @ Botanical Gardens (next to our department)

Latest news

Second edition of "Validity, Reliability, and Significance: Empirical Methods for NLP and Data Science" published

The monograph is published in the Synthesis Lectures on Human Language Technologies series by Springer. More info

New publication at EAMT2024

New research from the StatNLP group about Prompting Large Language Models with Human Error Markings for Self-Correcting Machine Translation will be presented at EAMT 2024. The paper is available here.

New publications at TACL, AAAI 2024 and EACL 2024

New research from the StatNLP group will be published at TACL, AAAI 2024 and EACL 2024.

New publication at ML4H 2023

New research from the StatNLP group on Validity problems in clinical machine learning by indirect data labeling using consensus definitions will be presented at ML4H 2023. The paper is available here.

New publication at IWSLT 2023

New research from the StatNLP group on Improving End-to-End Speech Translation by Imitation-Based Knowledge Distillation with Synthetic Transcripts will be presented at IWSLT 2023. The paper is available here.