Submitted by Tingman Yan.

Submission data

Full nameMatchAttention for High-Resolution Cross-View Matching
DescriptionCross-view matching is fundamentally achieved through cross-attention mechanisms. However, matching of high-resolution images remains challenging due to the quadratic complexity and lack of explicit matching constraints in the existing cross-attention. This paper proposes an attention mechanism, MatchAttention, that dynamically matches relative positions. The relative position determines the attention sampling center of the key-value pairs given a query. Continuous and differentiable sliding-window attention sampling is achieved by the proposed BilinearSoftmax. The relative positions are iteratively updated through residual connections across layers by embedding them into the feature channels. Since the relative position is exactly the learning target for cross-view matching, an efficient hierarchical cross-view decoder, MatchDecoder, is designed with MatchAttention as its core component. To handle cross-view occlusions, gated cross-MatchAttention and a consistency-constrained loss are
ParametersMatchStereo-B, 76M params
Publication titleMatchAttention: Matching the Relative Positions for High-Resolution Cross-View Matching
Publication authorsTingman Yan, Tao Liu, Xilian Yang, Qunfei Zhao, Zeyang Xia
Publication venueArxiv, 2025
Publication URLhttps://arxiv.org/abs/2510.14260
Programming language(s)Pytorch, CUDA
HardwareRTX 4090
Source code or download URLhttps://github.com/TingmanYan/MatchAttention
Submission creation date15 Aug, 2025
Last edited17 Oct, 2025

High-res multi-view results



Infoallhigh-res
multi-view
indooroutdoorbotani.boulde.bridgedoorexhibi.lectur.living.loungeobserv.old co.statueterrac.
No results yet.

Low-res many-view results



Infoalllow-res
many-view
indooroutdoorlakesidesand boxstorage roomstorage room 2tunnel
No results yet.

Low-res two-view results



Infoalllakes. 1llakes. 1ssand box 1lsand box 1sstora. room 1lstora. room 1sstora. room 2lstora. room 2sstora. room 2 1lstora. room 2 1sstora. room 2 2lstora. room 2 2sstora. room 3lstora. room 3stunnel 1ltunnel 1stunnel 2ltunnel 2stunnel 3ltunnel 3s
copylefttwo views0.790.432.510.110.020.930.024.833.550.221.210.190.261.020.580.000.010.000.000.010.01

SLAM results



allboxesboxes darkbuddhacables 4cables 5desk 1desk 2desk changing 2desk dark 1desk dark 2desk global light changesdesk ir lightdinodroneforeground occlusionhelmetkidnap 2lamplarge loop 2large loop 3large non loopmotion 2motion 3motion 4planar 1reflective 2scale changetable 1table 2table 5table 6table global light changestable local light changestable scenetrashbin
MethodInfo
No results yet.