TY - GEN
T1 - Do you see what I mean? Visual resolution of linguistic ambiguities
AU - Berzak, Yevgeni
AU - Barbu, Andrei
AU - Harari, Daniel
AU - Katz, Boris
AU - Ullman, Shimon
N1 - Publisher Copyright: © 2015 Association for Computational Linguistics.
PY - 2015/9
Y1 - 2015/9
N2 - Understanding language goes hand in hand with the ability to integrate complex contextual information obtained via perception. In this work, we present a novel task for grounded language understanding: disambiguating a sentence given a visual scene which depicts one of the possible interpretations of that sentence. To this end, we introduce a new multimodal corpus containing ambiguous sentences, representing a wide range of syntactic, semantic and discourse ambiguities, coupled with videos that visualize the different interpretations for each sentence. We address this task by extending a vision model which determines if a sentence is depicted by a video. We demonstrate how such a model can be adjusted to recognize different interpretations of the same underlying sentence, allowing to disambiguate sentences in a unified fashion across the different ambiguity types.
AB - Understanding language goes hand in hand with the ability to integrate complex contextual information obtained via perception. In this work, we present a novel task for grounded language understanding: disambiguating a sentence given a visual scene which depicts one of the possible interpretations of that sentence. To this end, we introduce a new multimodal corpus containing ambiguous sentences, representing a wide range of syntactic, semantic and discourse ambiguities, coupled with videos that visualize the different interpretations for each sentence. We address this task by extending a vision model which determines if a sentence is depicted by a video. We demonstrate how such a model can be adjusted to recognize different interpretations of the same underlying sentence, allowing to disambiguate sentences in a unified fashion across the different ambiguity types.
UR - http://www.scopus.com/inward/record.url?scp=84959867661&partnerID=8YFLogxK
U2 - 10.18653/v1/d15-1172
DO - 10.18653/v1/d15-1172
M3 - منشور من مؤتمر
T3 - Conference Proceedings - EMNLP 2015: Conference on Empirical Methods in Natural Language Processing
SP - 1477
EP - 1487
BT - Conference Proceedings - EMNLP 2015
T2 - Conference on Empirical Methods in Natural Language Processing, EMNLP 2015
Y2 - 17 September 2015 through 21 September 2015
ER -