Visual navigation combining split attention mechanism and next expected observation

doi:10.13382/j.issn.1000-7105.2023.01.011

Home > Archive>Volume 37, Issue 1, 2023 >96-105. DOI:10.13382/j.issn.1000-7105.2023.01.011

Visual navigation combining split attention mechanism and next expected observation
DOI:
                        10.13382/j.issn.1000-7105.2023.01.011
                    
CSTR:
                        
Author:
                        
Affiliation:
Clc Number:TP242;TP391. 41
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

A visual navigation model incorporating split attention mechanism and next expected observation ( NEO) is proposed to address the problem that deep reinforcement learning visual navigation algorithm degrades navigation accuracy, real-time and reliability of image matching due to navigation scene changes. The features of current and target states are first extracted using the ResNest50 backbone network to reduce network redundancy. The shallow target feature information is captured intensively using a cross-stagepartial-connections CSP to enhance the learning ability of the model. Then an improved loss function is proposed to make the inference network closer to the true posterior so that the agent can make the best decision in the current environment and further improve the navigation accuracy of the model in different scenarios. The training and testing are conducted on AVD dataset and AI2-THOR scenes, and the experimental results show that the navigation accuracy of the algorithm in this paper is as high as 96. 8%, with an average SR improvement of about 3% and an average SPL improvement of about 6%, which meets the requirements of navigation accuracy and realtime matching.

Reference

Cited by

Get Citation

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:
Revised:
Adopted:
Online: June 15,2023
Published:

Home

Introduction

Editorial Committee

Current Issue

Policy

Contact Us

Chinese

Get Citation

Related Videos

Share

Article Metrics

History

Article QR Code