Visual Question Answering From Theory to Application

Visual Question Answering (VQA) usually combines visual inputs like image and video with a natural language question concerning the input and generates a natural language answer as the output. This is by nature a multi-disciplinary research problem, involving computer vision (CV), natural language p...

Full description

Bibliographic Details
Main Authors: Wu, Qi, Wang, Peng (Author), Wang, Xin (Author), He, Xiaodong (Author)
Format: eBook
Language:English
Published: Singapore Springer Nature Singapore 2022, 2022
Edition:1st ed. 2022
Series:Advances in Computer Vision and Pattern Recognition
Subjects:
Online Access:
Collection: Springer eBooks 2005- - Collection details see MPG.ReNa
Table of Contents:
  • 1. Introduction
  • 2. Deep Learning Basics
  • 3. Question Answering (QA) Basics
  • 4. The Classical Visual Question Answering
  • 5. Knowledge-based VQA.