Tech

Researchers advance cross-modality smart security with transformer model

Published

3 months ago

October 31, 2025

cineplex360

Illustration of the VX-ReID task and main idea. Credit: Wang Hongqiang

A research team led by Professor Wang Hongqiang from the Hefei Institutes of Physical Science of the Chinese Academy of Sciences proposed a Global-Local Alignment Attention (GLAA) model based on an Asymmetric Siamese Transformer (AST), which markedly enhances the performance of Visible-X-ray cross-modality package re-identification tasks.

This study was published in IEEE Transactions on Information Forensics and Security.

Visible X-ray cross-modality package re-identification is a core technology in security inspection. The challenge lies in the significant pixel-level differences between the two modal images, making it difficult for traditional methods to extract robust cross-modality invariant features.

In this study, the researchers incorporated an asymmetric design concept into the Siamese Transformer architecture by proposing a Cross-Modality Asymmetric Siamese Transformer (CAST) structure. Embedding LayerNorm layers and modality-aware encoding in one branch effectively enhances the model‘s ability to extract cross-modality invariant features.

They also designed a Global-Local Cross-modality Alignment Attention module. By modeling the interaction between global and local features, it enhances fine-grained feature representation while addressing the spatial misalignment issues in cross-modality images.

Experimental results show that the key metrics of this model on a dedicated cross-modality package re-identification dataset show significant improvement over the current state-of-the-art methods, providing reliable technical support for the intelligentization of security inspection.

This work is the first to introduce the Transformer architecture into the cross-modality package re-identification task, breaking through the limitations of existing methods that rely on symmetric convolutional networks, according to the researchers.

More information:
Yonggan Wu et al, An Asymmetric Siamese Transformer With Global-Local Alignment Attention for Visible-X-Ray Cross-Modality Package Re-Identification, IEEE Transactions on Information Forensics and Security (2025). DOI: 10.1109/tifs.2025.3592540

Provided by
Chinese Academy of Sciences

Citation:
Researchers advance cross-modality smart security with transformer model (2025, October 30)
retrieved 30 October 2025
from https://techxplore.com/news/2025-10-advance-modality-smart.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no
part may be reproduced without the written permission. The content is provided for information purposes only.

Source link

Related Topics:computer news hi-tech news hitech information technology innovation inventions

CinePlex360

Tech

Researchers advance cross-modality smart security with transformer model

Leave a Reply

Leave a Reply

Trending

Leave a Reply Cancel reply

Leave a Reply

Trending

Leave a Reply