Cross-modal person re-identification based on fused attention and feature enhancement

doi:10.13878/j.cnki.jnuist.20240330001

2025-6-1- 12

Home > Archive>Volume 16, Issue 4, 2024 >451-460. DOI:10.13878/j.cnki.jnuist.20240330001

Cross-modal person re-identification based on fused attention and feature enhancement
DOI:
                        10.13878/j.cnki.jnuist.20240330001
                    
CSTR:
                        
                    
Author:
                        HUANG ChihanHUANG Chihan
School of Design Art and Media, Nanjing University of Science & Technology, Nanjing 210094, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
SHEN XiaoboSHEN Xiaobo
School of Computer Science and Engineering, Nanjing University of Science & Technology, Nanjing 210094, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:
Clc Number:TP391.41
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

RGB-Infrared person re-identification (Re-ID) is a challenging task which aims to match person images between visible and infrared modalities,playing a crucial role in criminal investigation and intelligent video surveillance.To address the weak feature extraction capability for fine-grained features in current cross-modal person Re-ID tasks,this paper proposes a person re-identification model based on fused attention and feature enhancement.First,automatic data augmentation techniques are employed to mitigate the differences in perspectives and scales among different cameras,and a cross-attention multi-scale Vision Transformer is proposed to generate more discriminative feature representations by processing multi-scale features.Then the channel attention and spatial attention mechanisms are introduced to learn information important for distinguishing features when fusing visible and infrared image features.Finally,a loss function is designed,which adopts the adaptive weight based hard triplet loss,to enhance the correlation between each sample and improve the capability of identifying different persons from visible and infrared images.Extensive experiments conducted on the SYSU-MM01 and RegDB datasets show that the proposed approach achieves mAP of 68.05% and 85.19%,respectively,outperforming many state-of-the-art approaches.Moreover,ablation experiments and comparative analysis validate the superiority and effectiveness of the proposed model.

Key words:person re-identification (Re-ID);cross-modal;cross attention;feature extraction;multi-scale

Get Citation

HUANG Chihan, SHEN Xiaobo. Cross-modal person re-identification based on fused attention and feature enhancement[J]. Journal of Nanjing University of Information Science & Technology,2024,16(4):451-460

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:March 30,2024
Revised:
Adopted:
Online: August 07,2024
Published: July 28,2024

Article QR Code

Address：No. 219, Ningliu Road, Nanjing, Jiangsu Province

Postcode：210044

Phone：025-58731025

Get Citation

Share

Article Metrics

History

Article QR Code