Survey of visual question answering for intelligent interaction

Home > Archive>Volume 33, Issue 2, 2019 >117-124

Survey of visual question answering for intelligent interaction
DOI:
                        
CSTR:
                        
Author:
                        
Affiliation:
Clc Number:TP391；TN919.9
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

With the application of deep learning method in the field of image processing, the image related intelligent interaction technology has also been rapidly developed. Visual question answering (VQA) collects the image information by asking questions related to the image and ultimately achieves the purpose for enriching the image understanding. Through comprehensive analysis and comparison of related methods of VQA in recent years, the method can be constructively divided into four types according to the model structure: basic model, attention mechanism related model, modular model and external knowledge base model. At the same time, it also points out directions for visual and semantic information processing and future research on visual reasoning in VQA from three aspects.

Reference

Cited by

Get Citation

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:
Revised:
Adopted:
Online: January 04,2024
Published:

Home

Introduction

Editorial Committee

Current Issue

Policy

Contact Us

Chinese

Get Citation

Related Videos

Share

Article Metrics

History

Article QR Code