Integrating Vision and Language: Achievements and Challenges in Multimodal Machine Learning
Module Description
Course | Module Abbreviation | Credit Points |
---|---|---|
BA-2010 | AS-FL | 8 LP |
BA-2010 | AS-CL | 8 LP |
BA-2010[100%|75%] | CS-CL | 6 LP |
BA-2010[50%] | BS-CL | 6 LP |
BA-2010[25%] | BS-AC, BS-FL | 4 LP |
Master | SS-CL, SS-TAC, SS-FAL | 8 LP |
Lecturer | Letitia Parcalabescu |
Module Type |
|
Language | English |
First Session | 23.10.2019 |
Time and Place | Wednesday, 16:15-17:45, INF 326 / SR 27 2. OG |
End of Commitment Period | 21.01.2020 |
Prerequisite for Participation
Assessment
Content
Progress in artificial intelligence requires more than separate understanding of text and unrelated processing
of other signals, e.g. image, sound. Multi-modal machine learning aims to handle a combination of different
signal types and relate information from different modalities. In the seminar, we will study the latest machine
learning techniques tackling the multimodal applications and datasets emerged in the last years. We will
discuss the performance of state-of-the-art models and assess the shortcomings and challenges of current
research. Topics include:
Module Overview
Agenda
For the agenda and the respective materials, please check the protected Materials Webpage.
Literature
Literature will be provided by the beginning of the term. A survey: