Bilder vom Neuenheimer Feld, Heidelberg und der Universität Heidelberg

Lehrveranstaltungen
heiCO
Ressourcen	Fachschaft
Studien-FAQ	Technik-FAQ

Ableitung von Information aus Texten – Textual Entailment

Kursbeschreibung

Studiengang	Modulkürzel	Leistungs- bewertung
BA-2010	AS-CL, AS-FL	8 LP
BA-2010[100%\|75%]	CS-CL	6 LP
BA-2010[50%]	BS-CL	6 LP
BA-2010[25%]	BS-AC, BS-FL	4 LP
Master	SS-CL-TAC, SS-SC-FAL	8 LP

Dozent	Kurt Eberle
Veranstaltungsart	Proseminar/Hauptseminar
Sprache	Deutsch/Englisch
Erster Termin	10.02.2025
Letzter Termin	14.02.2025
Zeit und Ort	09:00 - 14:30, INF 325 / SR24
Commitment-Frist	tba.

**Achtung: Anmeldung zur Seminar-Teilnahme bis 14.01.2025!**

per email oder in: Moodle

Teilnehmerkreis/Participants

All advanced Bachelor students and all Master students. Students from Computer Science, Mathematics or Scientific computing with Anwendungsgebiet Computational Linguistics are welcome.

Teilnahmevoraussetzungen/Prerequisites for Participation

Introduction to Computational Linguistics or similar introductory courses

Basic knowledge in:

Logic (Logische Grundlagen für die Computerlinguistik) and
Formal Semantics (Logic representations of texts, Discourse Representation Theory or similar)

Leistungsnachweis/Assessment

Paper presentation (4/2 LP)
Written Exam (4 LP)

Inhalt/Content

When does a text follow from another? In traditional formal semantics, this is checked with the help of semantic representations: The representations of the two texts are computed, typically within the framework of a modeling system based on Montague grammar, and then it is checked whether the representation of the second text can be inferred from that of the first by using the deduction rules of the system.

It is not a new finding that statements that people typically derive from texts often go beyond what logical inference can deliver. Formal semantic attempts have been made to model such phenomena via 'abduction' and various 'default logics', etc.

Since more and more phenomena in computational linguistics have been successfully modeled with machine learning (ML) approaches, such methods have been tried for semantic inference also. In this sense, 'textual entailment' means 'learning human inference behavior from data'; from data that consist of pairs of (short) texts and even shorter statements (hypotheses) and judgements about whether the hypotheses follow from the texts respectively.

In the seminar we will take a look at various methods that have been suggested for deriving semantic information from texts and for relating corresponding pieces of information to each other; where we will contrast traditional rule-based methods with modern data-driven methods.

We will start with a brief review of different types of deep and shallow text representations and an overview of corresponding representation-specific inference phenomena. Then we consider the generation of semantic representations, where we contrast formal versus ML means. The main part of the seminar will then be devoted to a range of entailment approaches with and without representations, with an emphasis on recent ML-based approaches, including recent methods with large language models such as BERT or in ChatGPT

The goal is to gain a good overview of recent work on the topic, based on a good formal understanding of the phenomena.

Vorraussichtlicher Kursplan/Agenda

Montag			Slides
9.15	Introduction	Organisation, Motivation: Text entailment, text and hypothesis, representations(do we need reps?), Program, Pascal Development Testsuite (PDT) (Dagan,Glickman, Magnini 2006)	Intro
11.00	Types of Inferences	Entailment , Conventional & Conversational Implicature and the Pascal Development Suite (Zaenen, Karttunen, Crouch 2005): What is in PDT und what should be there?'	Intro
13.00	Sem. Representations	TE with deep vs shallow representation: Predicate logic vs 'Light-weight semantics' (Blackburn Bos 2003) vs (Monz de Rijke 2001)	Intro
Tuesday
	Semantic construction, corpora, TE systems
Wednesday
	TE with rules vs TE with features and statistics, neural nets
Thursday
	Approaches with NNs, different models
Friday
	Others?, difficulties, discourse relations, summary

Literatur/Literature

(Overview. Of course, only a rather small part of this will be discussed in the seminar)

List for Download

1)	Phenomena and Representations
Blackburn, Bos 2003	Computational Semantics. Theoria 18(1): 27-45	pdf
Condoravdi, Crouch et al. 2003	Entailment, intensionality and text understanding
de Marneffe, Rafferty, Manning 2008	Finding contradictions in text
Sánchez Valencia 1991	Studies on Natural Logic and Categorial Grammar. Ph.D. thesis, Univ. of Amsterdam
Fyodorov, Winter, Francez 2000	A natural logic inference system
Lakoff 1970	Linguistics and Natural Logic
MacCartney Manning 2008a	Natural Logic for Textual Inference	pdf
MacCartney Manning 2008b	Modeling semantic Containment and Exclusion in Natural Language Inference	pdf
MacCartney Manning 2009	An extended model of natural logic
Mronz C., de Rijke 2001	Light-Weight Entailment Checking for Computational Semantics
Pinkal M., 2007	Seminar on Entailment
Zaenen, Karttunen, Crouch 2005	Local Textual Inference: Can it be Defined or Circumscribed? in: ACL 2005	pdf
2)	Semantic Parsing
Conneau, A et al. 2017	Supervised learning of universal sentence representations from natural language inference data
Liang Potts 2014	Bringing machine learning and compositional semantics together	pdf
Liang 2016	Learning executable semantic parsers for natural language understanding	pdf
Zettlemoyer, Collins 2012	Learning to Map Sentences to Logical Form: Structured Classification with Probabilistic Categorial Grammars
3)	Corpora, (Parallel) Meaning Banks
	https://aclweb.org/aclwiki/Textual_Entailment_Resource_Pool
Abzianidze et al. 2017	The Parallel Meaning Bank: Towards a Multilingual Corpus of Translations Annotated with Compositional Meaning Representations	pdf
Banarescu et al. 2013	Abstract Meaning Representation for sembanking
Bentivogli et al. 2010	Building Textual Entailment Specialized Data Sets
Bos et al. 2017	The Groningen Meaning Bank
Bowman, Angeli, Potts, Manning, 2015	A large annotated corpus for learning natural language inference
Cooper et al. 1996	THE FRACAS TEXTUAL INFERENCE PROBLEM SET
Williams et al. 2018	A Broad-Coverage Challenge Corpus for Sentence Understanding through Inference
4)	Textual Entailment
Androutsopoulos, Malakasiotis 2010	A Survey of Paraphrasing and Textual Entailment Methods
Bentivogli et al. 2009	The fifth Pascal recognizing Textual Entailment Challenge
Bos , Markert. 2005	Recognising textual entailment with logical inference. In: HLT/EMNLP 2005
Bos , Markert. 2005	Recognising textual entailment with robust logical inference. In: MLCW p 404;
Bos, Markert 2006	Combining Shallow and Deep NLP Methods for Recognizing Textual Entailment
Bowman, Gauthier, Rastogi, Gupta, Manning, Potts 2016	A fast unified model for parsing and sentence understanding
Burchardt, A.	Modeling Textual Entailment with Role-Semantic Information
Cabrio, Magnini 2009	Defining specialized entailment engines using natural logic relations. in Vetulani: Human Language Technology
Cabrio Magnini 2010	Towards Qualitative Entailment of Textual Entailment Systems
Chambers et al. 2007	Learning Alignments and Leveraging Natural Logic
Chen, Zhu, Ling, Wei, Jiang, Inkpen 2017	Enhanced LSTM for natural language inference	pdf
Dagan, Dolan et al. 2009	Recognizing Textual Entailment: Rational, evaluation and approaches
Dagan, Glickman 2004	Probabilistic Textual Entailment: Generic Applied Modeling of Language Variability
Dagan, Glickman, Magnini 2006	The PASCAL recognising textual entailment challenge. In MLCW pp 177-190	pdf
de Salvo Braz et al. 2005	An Inference Model for Semantic Entailment in Natural Language
Kouylekov Magnini 2005	Tree-Edit Distance for Textual Entailment
Lien, Kouylekov 2015	Semantic Parsing for Textual Entailment	pdf
MacCartney 2009	Natural language inference
MacCartney et al. 2006	Learning to recognize features of valid textual entailments	pdf
Marelli et al. 2014	SemEval-2014 Task 1: Evaluation of compositional distributional semantic models on full sentences through semantic relatedness and textual entailment
Mirkin, Dagan, Padó 2010	Assessing the Role of Discourse References in Entailment Inference	pdf
Monz 1999	Contextual Inference in Computational Semantics
Monz, de Rijke 2001	Light-Weight Entailment Checking for Computational Semantics	pdf
Mou, Men, Li, Xu, Zhang, Yan,Jin. 2016	Natural language inference by tree-based convolution and heuristic matching	pdf
Nangia, Williams, Lazaridou, Bowman 2017	The RepEval 2017 Shared Task: Multi-genre natural language inference with sentence representations. repeval01	pdf
Nairn et al. 2006	Computing Relative Polarity for Textual Entailment
Pazienza, Pennacchiotti, Zanzotto 2005	Learning Textual Entailment on a Distance Feature Space in : MLWC pp 240-260
Parikh, Täckström, Das Uszkoreit 2016	A decomposable attention model for natural language inference	pdf
Pérez, Alfonseca 2005	Using Bleu-like Algorithms for the Automatic Recognition of Entailment in: MLCW, p 191 et seq.
Quiñonero-Candela, Dagan et al. 2005	Machine Learning Challenges: Evaluating Predictive Uncertainty, Visual Object Classification and Recognizing Textual Entailment First PASCAL Machine Learning Challenges Workshop, MLCW 2005(MLCW)
Sha et al. 2016	Recent Advances in Natural Language Inference: A Survey of Benchmarks, Resources, and Approaches	Link
Storks,Gao, Chai 2020	Recent Advances in Natural Language Inference: A Survey of Benchmarks, Resources, and Approaches	Link
Szpektor et al. 2007	Instance-based Evaluation of Entailment Rule Acquisition
TE References 2017	ACLWeb list of references
Vanderwende, Dolan 2005	What Syntax Can Contribute in the Entailment Task in: MLWC p 205 et seq.
Wang , Jiang 2016	Learning natural language inference with LSTM	pdf
Williams et al. 2018	A Broad-Coverage Challenge Corpus for Sentence Understanding through Inference
4a)	Textual Entailment with LLMs
BERT (and variants)	Chronologically ordered papers
Tuteja, H. 2019	Textual Entailment using Bert (Implementierung auf github)	Link
Verma, Dh. 2021	Fine-tuning pre-trained transformer models for sentence entailment. A PyTorch and Hugging Face implementation of fine-tuning BERT on the MultiNLI dataset	Link
Wehnert et al. 2022	Applying BERT Embeddings to Predict Legal Textual Entailment	Link
Alsuhaibani, M. 2023	Deep Learning-based Sentence Embeddings using BERT for Textual Entailment	Link
Arakelyan et al. 2024	Semantic Sensitivities and Inconsistent Predictions: Measuring the Fragility of NLI Models
Llama, Mistral
Madaan et al. 2024	Lost in Inference: Rediscovering the Role of Natural Language Inference for Large Language Models	Link
(Chat)GPT
Blair-Stanek A. et al. 2023	Can GPT-3 Perform Statutory Reasoning?	Link
Katz, D.M et al. 2023	GPT-4 passes the Bar Exam	Link
Laskar et al. 2023	A Systematic Study and Comprehensive Evaluation of ChatGPT on Benchmark Datasets	Link
Luo et al. 2023	ChatGPT as a Factual Inconsistency Evaluator for Text Summarization	Link
Nguyen et al. 2023a	Beyond logic programming for legal reasoning	Link
Nguyen et al. 2023 b	Black-Box Analysis: GPTs Across Time in Legal Textual Entailment Task	Link
Nguyen et al. 2023 c	How well do Sota Legal Reasoning models support Abductive reasoning?	Link
OpenAI 2023	GPT-4 Technical Report	Link