Title: DSRP Seminar - Can we connect vision and language using graphs?
Date: Wednesday, 16 December 2020
Time: 12:00 (noon)
Venue: Online (Zoom); Register online
Speaker: Mr Brandon Birmingham
A long-standing goal of Artificial Intelligence is to have agents capable of understanding and interpreting the visual world using natural language. The advancements in computing power and the sheer amount of visual and linguistic data available today helps in getting closer to this quest. Research at the intersection of Computer Vision and Natural Language Processing is currently booming and the automatic generation of image captions has recently gained a lot of popularity. Several ideas and architectures have been proposed to machine generate human-like sentences that describe images, but all are short of reaching human-level quality. The focus of this talk is to specifically explore how the graph data structure can be used to connect the vision and language modalities in the context of image caption generation and how such graph-based models compare with the current state-of-the-art deep learning based models.
Attendees are kindly asked to register online.
The Data Science Research Platform (DSRP) at the University of Malta conducts research in the interdisciplinary field of data science. The scope of the group is to use signal processing, machine learning and statistics to develop innovative techniques and to extract useful knowledge from various data sources in an effective manner to benefit the wider public.
For more information about the DSRP, please visit their website.
To receive notifications about future events organised by the DSRP, please subscribe to their mailing list.