Tech
Researchers advance cross-modality smart security with transformer model
A research team led by Professor Wang Hongqiang from the Hefei Institutes of Physical Science of the Chinese Academy of Sciences proposed a Global-Local Alignment Attention (GLAA) model based on an Asymmetric Siamese Transformer (AST), which markedly enhances the performance of Visible-X-ray cross-modality package re-identification tasks.
This study was published in IEEE Transactions on Information Forensics and Security.
Visible X-ray cross-modality package re-identification is a core technology in security inspection. The challenge lies in the significant pixel-level differences between the two modal images, making it difficult for traditional methods to extract robust cross-modality invariant features.
In this study, the researchers incorporated an asymmetric design concept into the Siamese Transformer architecture by proposing a Cross-Modality Asymmetric Siamese Transformer (CAST) structure. Embedding LayerNorm layers and modality-aware encoding in one branch effectively enhances the model‘s ability to extract cross-modality invariant features.
They also designed a Global-Local Cross-modality Alignment Attention module. By modeling the interaction between global and local features, it enhances fine-grained feature representation while addressing the spatial misalignment issues in cross-modality images.
Experimental results show that the key metrics of this model on a dedicated cross-modality package re-identification dataset show significant improvement over the current state-of-the-art methods, providing reliable technical support for the intelligentization of security inspection.
This work is the first to introduce the Transformer architecture into the cross-modality package re-identification task, breaking through the limitations of existing methods that rely on symmetric convolutional networks, according to the researchers.
More information:
												Yonggan Wu et al, An Asymmetric Siamese Transformer With Global-Local Alignment Attention for Visible-X-Ray Cross-Modality Package Re-Identification, IEEE Transactions on Information Forensics and Security (2025). DOI: 10.1109/tifs.2025.3592540
                                                Citation:
                                                Researchers advance cross-modality smart security with transformer model (2025, October 30)
                                                retrieved 30 October 2025
                                                from https://techxplore.com/news/2025-10-advance-modality-smart.html
                                            
                                            This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no
                                            part may be reproduced without the written permission. The content is provided for information purposes only.