Now showing items 1-1 of 1
Towards Understandıng Intuıtıve Physıcs Wıth Language And Vısıon
(Fen Bilimleri Enstitüsü, 2021)
Visual question answering (VQA) is one of the difficult tasks in multimodal machine reasoning. VQA requires machines to provide correct answers to questions about an image or a video. Here, the machine should perceive the ...