电子测量与仪器学报

孙伟,徐凡,张小瑞,胡亚华,赵宇煌.多视图融合和全局特征增强的车辆重识别网络[J].电子测量与仪器学报,2023,37(1):78-86

多视图融合和全局特征增强的车辆重识别网络

Vehicle re-identification network with multi-viewfusion and global feature enhancement

DOI：10.13382/j.issn.1000-7105.2023.01.009

英文关键词:vehicle re-identification view segmentation view stitching attention mechanism feature enhancement

基金项目:国家自然科学基金(61304205, 62272236)、江苏省自然科学基金(BK20191401, BK20201136)、江苏省研究生科研与实践创新计划项目(SJCX21_0363)、大学生创新创业训练项目(XJDC202110300601, 202010300290, 202010300211, 202010300116E)资助

作者	单位
孙伟	1. 南京信息工程大学自动化学院，2. 南京信息工程大学江苏省大气环境与装备技术协同创新中心
徐凡	1. 南京信息工程大学自动化学院
张小瑞	2. 南京信息工程大学江苏省大气环境与装备技术协同创新中心，3. 南京信息工程大学数字取证教育部工程研究中心，4. 南京信息工程大学无锡研究院无锡，5. 南京信息工程大学计算机与软件学院
胡亚华	1. 南京信息工程大学自动化学院
赵宇煌	1. 南京信息工程大学自动化学院

Author	Institution
Sun Wei	1. College of Automation, Nanjing University of Information Science & Technology，2. Jiangsu Collaborative Innovation Center on Atmospheric Environment and Equipment Technology, Nanjing University of Information Science & Technology
Xu Fan	1. College of Automation, Nanjing University of Information Science & Technology
Zhang Xiaorui	2. Jiangsu Collaborative Innovation Center on Atmospheric Environment and Equipment Technology, Nanjing University of Information Science & Technology,3. Engineering Research Center of Digital Forensics, Ministry of Education, Nanjing University of Information Science & Technology,4. Wuxi Research Institute, Nanjing University of Information Science & Technology,5. College of Computer and Software, Nanjing University of Information Science & Technology
Hu Yahua	1. College of Automation, Nanjing University of Information Science & Technology
Zhao Yuhuang	1. College of Automation, Nanjing University of Information Science & Technology

摘要点击次数: 1335

全文下载次数: 2760

中文摘要:

车辆重识别是智能交通领域重要应用之一,现有的车辆识别方法大多集中于预定义的局部区域特征或全局外观特征。然而,在复杂的交通环境下,传统的方法难以获取预定义的局部区域,同时很难捕捉有价值的车辆全局特征信息。因此,本文提出一种具有多视图融合的混合注意力机制和全局特征增强的端到端双分支网络。该网络旨在通过增强车辆的特征表达能力和特征质量来获得更完整、更多样的车辆特征。本文通过视图解析网络对车辆图片 4 个视角的视图进行分割,并通过视图拼接方法缓解分割不准确导致的信息丢失问题。为了更好地突出拼接视图中的显著性局部区域,本文提出一种由通道注意力机制和自注意力机制组成的混合注意力模块。通过该模块从车辆拼接视图中分别获取关键局部信息和局部信息之间的相关性,更好地凸显拼接视图中车辆局部的细节信息。除此之外,还提出了一个全局特征增强模块,通过池化和卷积获得全局特征的空间和通道关系。该模块不仅能提取到语义增强的车辆特征,而且还使车辆特征中包含完好的细节信息,解决获取的车辆图像受视角变化、光照条件变化等因素的影响。在 Veri-776 和 VehicleID 数据集上的大量实验表明,mAP、CMC@ 1 和 CMC@ 5 分别达到了 82. 41%、98. 63%和 99. 23%,优于现有的方法。

英文摘要:

Vehicle re-identification is one of the important applications in the field of intelligent transportation. Most of the existing vehicle Re-ID methods focus on pre-defined local area features or global appearance features. However, in the complex traffic environment, it is difficult for traditional methods to acquire pre-defined local regions, and it is difficult to capture valuable vehicle global feature information. Therefore, an end-to-end dual-branch network with multi-view fusion hybrid attention mechanism and global feature enhancement is proposed. The network aims to obtain more complete and diverse vehicle features by enhancing the feature representation ability and feature quality of vehicles. This paper uses the view parsing network to segment the four views of the vehicle image, and uses the view stitching method to alleviate the problem of information loss caused by inaccurate segmentation. To better highlight salient local regions in stitched views, this paper proposes a hybrid attention module consisting of a channel attention mechanism and a self-attention mechanism. Through this module, the correlation between the key local information and the local information is obtained from the stitched view respectively, so as to better highlight the detailed information of the vehicle part in the stitched view. Besides, this paper also proposes a global feature enhancement module to obtain the spatial and channel relationship of global features through pooling and convolution. This module not only extracts the semantically enhanced vehicle features, but also makes the vehicle features contain complete detailed information, and solve the influence of the acquired vehicle images by factors such as changes in viewing angle and lighting conditions. Extensive experiments in this paper on the Veri-776 and VehicleID datasets show that mAP, CMC@ 1, and CMC@ 5 reach 82. 41%, 98. 63%, and 99. 23%, respectively, which is better than the existing methods.

查看全文查看/发表评论下载PDF阅读器