Ruprecht-Karls-Universität Heidelberg
Institut für Computerlinguistik

Bilder vom Neuenheimer Feld, Heidelberg und der Universität Heidelberg

Ableitung von Information aus Texten – Textual Entailment

Kursbeschreibung

Studiengang Modulkürzel Leistungs-
bewertung
BA-2010 AS-CL, AS-FL 8 LP
BA-2010[100%|75%] CS-CL 6 LP
BA-2010[50%] BS-CL 6 LP
BA-2010[25%] BS-AC, BS-FL 4 LP
Master SS-CL-TAC, SS-SC-FAL 8 LP
Dozent Kurt Eberle
Veranstaltungsart Proseminar/Hauptseminar
Sprache Deutsch/Englisch
Erster Termin 10.02.2025
Letzter Termin 14.02.2025
Zeit und Ort 09:00 - 14:30, INF 325 / SR24
Commitment-Frist tba.

Achtung: Anmeldung zur Seminar-Teilnahme bis 14.01.2025!

per email oder in: Moodle

Teilnehmerkreis/Participants

All advanced Bachelor students and all Master students. Students from Computer Science, Mathematics or Scientific computing with Anwendungsgebiet Computational Linguistics are welcome.

Teilnahmevoraussetzungen/Prerequisites for Participation

Introduction to Computational Linguistics or similar introductory courses

Basic knowledge in:

  • Logic (Logische Grundlagen für die Computerlinguistik) and
  • Formal Semantics (Logic representations of texts, Discourse Representation Theory or similar)

Leistungsnachweis/Assessment

  • Paper presentation (4/2 LP)
  • Written Exam (4 LP)

Inhalt/Content

When does a text follow from another? In traditional formal semantics, this is checked with the help of semantic representations: The representations of the two texts are computed, typically within the framework of a modeling system based on Montague grammar, and then it is checked whether the representation of the second text can be inferred from that of the first by using the deduction rules of the system.

It is not a new finding that statements that people typically derive from texts often go beyond what logical inference can deliver. Formal semantic attempts have been made to model such phenomena via 'abduction' and various 'default logics', etc.

Since more and more phenomena in computational linguistics have been successfully modeled with machine learning (ML) approaches, such methods have been tried for semantic inference also. In this sense, 'textual entailment' means 'learning human inference behavior from data'; from data that consist of pairs of (short) texts and even shorter statements (hypotheses) and judgements about whether the hypotheses follow from the texts respectively.

In the seminar we will take a look at various methods that have been suggested for deriving semantic information from texts and for relating corresponding pieces of information to each other; where we will contrast traditional rule-based methods with modern data-driven methods.

We will start with a brief review of different types of deep and shallow text representations and an overview of corresponding representation-specific inference phenomena. Then we consider the generation of semantic representations, where we contrast formal versus ML means. The main part of the seminar will then be devoted to a range of entailment approaches with and without representations, with an emphasis on recent ML-based approaches, including recent methods with large language models such as BERT or in ChatGPT

The goal is to gain a good overview of recent work on the topic, based on a good formal understanding of the phenomena.

Vorraussichtlicher Kursplan/Agenda

Montag Slides
9.15 Introduction Organisation, Motivation: Text entailment, text and hypothesis, representations(do we need reps?), Program, Pascal Development Testsuite (PDT) (Dagan,Glickman, Magnini 2006) Intro
11.00 Types of Inferences Entailment , Conventional & Conversational Implicature and the Pascal Development Suite (Zaenen, Karttunen, Crouch 2005): What is in PDT und what should be there?' Intro
13.00 Sem. Representations TE with deep vs shallow representation: Predicate logic vs 'Light-weight semantics' (Blackburn Bos 2003) vs (Monz de Rijke 2001) Intro
Tuesday
Semantic construction, corpora, TE systems
Wednesday
TE with rules vs TE with features and statistics, neural nets
Thursday
Approaches with NNs, different models
Friday
Others?, difficulties, discourse relations, summary

Literatur/Literature

(Overview. Of course, only a rather small part of this will be discussed in the seminar)

List for Download

1) Phenomena and Representations
Blackburn, Bos 2003 Computational Semantics. Theoria 18(1): 27-45 pdf
Condoravdi, Crouch et al. 2003 Entailment, intensionality and text understanding
de Marneffe, Rafferty, Manning 2008 Finding contradictions in text
Sánchez Valencia 1991 Studies on Natural Logic and Categorial Grammar. Ph.D. thesis, Univ. of Amsterdam
Fyodorov, Winter, Francez 2000 A natural logic inference system
Lakoff 1970 Linguistics and Natural Logic
MacCartney Manning 2008a Natural Logic for Textual Inference pdf
MacCartney Manning 2008b Modeling semantic Containment and Exclusion in Natural Language Inference pdf
MacCartney Manning 2009 An extended model of natural logic
Mronz C., de Rijke 2001 Light-Weight Entailment Checking for Computational Semantics
Pinkal M., 2007 Seminar on Entailment
Zaenen, Karttunen, Crouch 2005 Local Textual Inference: Can it be Defined or Circumscribed? in: ACL 2005 pdf
2) Semantic Parsing
Conneau, A et al. 2017 Supervised learning of universal sentence representations from natural language inference data
Liang Potts 2014 Bringing machine learning and compositional semantics together pdf
Liang 2016 Learning executable semantic parsers for natural language understanding pdf
Zettlemoyer, Collins 2012 Learning to Map Sentences to Logical Form: Structured Classification with Probabilistic Categorial Grammars
3) Corpora, (Parallel) Meaning Banks
https://aclweb.org/aclwiki/Textual_Entailment_Resource_Pool
Abzianidze et al. 2017 The Parallel Meaning Bank: Towards a Multilingual Corpus of Translations Annotated with Compositional Meaning Representations pdf
Banarescu et al. 2013 Abstract Meaning Representation for sembanking
Bentivogli et al. 2010 Building Textual Entailment Specialized Data Sets
Bos et al. 2017 The Groningen Meaning Bank
Bowman, Angeli, Potts, Manning, 2015 A large annotated corpus for learning natural language inference
Cooper et al. 1996 THE FRACAS TEXTUAL INFERENCE PROBLEM SET
Williams et al. 2018 A Broad-Coverage Challenge Corpus for Sentence Understanding through Inference
4) Textual Entailment
Androutsopoulos, Malakasiotis 2010 A Survey of Paraphrasing and Textual Entailment Methods
Bentivogli et al. 2009 The fifth Pascal recognizing Textual Entailment Challenge
Bos , Markert. 2005 Recognising textual entailment with logical inference. In: HLT/EMNLP 2005
Bos , Markert. 2005 Recognising textual entailment with robust logical inference. In: MLCW p 404;
Bos, Markert 2006 Combining Shallow and Deep NLP Methods for Recognizing Textual Entailment
Bowman, Gauthier, Rastogi, Gupta, Manning, Potts 2016 A fast unified model for parsing and sentence understanding
Burchardt, A. Modeling Textual Entailment with Role-Semantic Information
Cabrio, Magnini 2009 Defining specialized entailment engines using natural logic relations. in Vetulani: Human Language Technology
Cabrio Magnini 2010 Towards Qualitative Entailment of Textual Entailment Systems
Chambers et al. 2007 Learning Alignments and Leveraging Natural Logic
Chen, Zhu, Ling, Wei, Jiang, Inkpen 2017 Enhanced LSTM for natural language inference pdf
Dagan, Dolan et al. 2009 Recognizing Textual Entailment: Rational, evaluation and approaches
Dagan, Glickman 2004 Probabilistic Textual Entailment: Generic Applied Modeling of Language Variability
Dagan, Glickman, Magnini 2006 The PASCAL recognising textual entailment challenge. In MLCW pp 177-190 pdf
de Salvo Braz et al. 2005 An Inference Model for Semantic Entailment in Natural Language
Kouylekov Magnini 2005 Tree-Edit Distance for Textual Entailment
Lien, Kouylekov 2015 Semantic Parsing for Textual Entailment pdf
MacCartney 2009 Natural language inference
MacCartney et al. 2006 Learning to recognize features of valid textual entailments pdf
Marelli et al. 2014 SemEval-2014 Task 1: Evaluation of compositional distributional semantic models on full sentences through semantic relatedness and textual entailment
Mirkin, Dagan, Padó 2010 Assessing the Role of Discourse References in Entailment Inference pdf
Monz 1999 Contextual Inference in Computational Semantics
Monz, de Rijke 2001 Light-Weight Entailment Checking for Computational Semantics pdf
Mou, Men, Li, Xu, Zhang, Yan,Jin. 2016 Natural language inference by tree-based convolution and heuristic matching pdf
Nangia, Williams, Lazaridou, Bowman 2017 The RepEval 2017 Shared Task: Multi-genre natural language inference with sentence representations. repeval01 pdf
Nairn et al. 2006 Computing Relative Polarity for Textual Entailment
Pazienza, Pennacchiotti, Zanzotto 2005 Learning Textual Entailment on a Distance Feature Space in : MLWC pp 240-260
Parikh, Täckström, Das Uszkoreit 2016 A decomposable attention model for natural language inference pdf
Pérez, Alfonseca 2005 Using Bleu-like Algorithms for the Automatic Recognition of Entailment in: MLCW, p 191 et seq.
Quiñonero-Candela, Dagan et al. 2005 Machine Learning Challenges:

Evaluating Predictive Uncertainty, Visual Object Classification and Recognizing Textual Entailment

First PASCAL Machine Learning Challenges Workshop, MLCW 2005(MLCW)

Sha et al. 2016 Recent Advances in Natural Language Inference: A Survey of Benchmarks, Resources, and Approaches Link
Storks,Gao, Chai 2020 Recent Advances in Natural Language Inference: A Survey of Benchmarks, Resources, and Approaches Link
Szpektor et al. 2007 Instance-based Evaluation of Entailment Rule Acquisition
TE References 2017 ACLWeb list of references
Vanderwende, Dolan 2005 What Syntax Can Contribute in the Entailment Task in: MLWC p 205 et seq.
Wang , Jiang 2016 Learning natural language inference with LSTM pdf
Williams et al. 2018 A Broad-Coverage Challenge Corpus for Sentence Understanding through Inference
4a) Textual Entailment with LLMs
BERT (and variants) Chronologically ordered papers
Tuteja, H. 2019 Textual Entailment using Bert (Implementierung auf github) Link
Verma, Dh. 2021 Fine-tuning pre-trained transformer models for sentence entailment. A PyTorch and Hugging Face implementation of fine-tuning BERT on the MultiNLI dataset Link
Wehnert et al. 2022 Applying BERT Embeddings to Predict Legal Textual Entailment Link
Alsuhaibani, M. 2023 Deep Learning-based Sentence Embeddings using BERT for Textual Entailment Link
Arakelyan et al. 2024 Semantic Sensitivities and Inconsistent Predictions: Measuring the Fragility of NLI Models
Llama, Mistral
Madaan et al. 2024 Lost in Inference: Rediscovering the Role of Natural Language Inference for Large Language Models Link
(Chat)GPT
Blair-Stanek A. et al. 2023 Can GPT-3 Perform Statutory Reasoning? Link
Katz, D.M et al. 2023 GPT-4 passes the Bar Exam Link
Laskar et al. 2023 A Systematic Study and Comprehensive Evaluation of ChatGPT on Benchmark Datasets Link
Luo et al. 2023 ChatGPT as a Factual Inconsistency Evaluator for Text Summarization Link
Nguyen et al. 2023a Beyond logic programming for legal reasoning Link
Nguyen et al. 2023 b Black-Box Analysis: GPTs Across Time in Legal Textual Entailment Task Link
Nguyen et al. 2023 c How well do Sota Legal Reasoning models support Abductive reasoning? Link
OpenAI 2023 GPT-4 Technical Report Link

» weitere Kursmaterialien

zum Seitenanfang