Submitted by Tingman Yan.

Submission data

Full nameMatchAttention for High-Resolution Cross-View Matching
DescriptionCross-view matching is fundamentally achieved through cross-attention mechanisms. However, matching of high-resolution images remains challenging due to the quadratic complexity and lack of explicit matching constraints in the existing cross-attention. This paper proposes an attention mechanism, MatchAttention, that dynamically matches relative positions. The relative position determines the attention sampling center of the key-value pairs given a query. Continuous and differentiable sliding-window attention sampling is achieved by the proposed BilinearSoftmax. The relative positions are iteratively updated through residual connections across layers by embedding them into the feature channels. Since the relative position is exactly the learning target for cross-view matching, an efficient hierarchical cross-view decoder, MatchDecoder, is designed with MatchAttention as its core component. To handle cross-view occlusions, gated cross-MatchAttention and a consistency-constrained loss are
ParametersMatchStereo-B, 76M params
Publication titleMatchAttention: Matching the Relative Positions for High-Resolution Cross-View Matching
Publication authorsTingman Yan, Tao Liu, Xilian Yang, Qunfei Zhao, Zeyang Xia
Publication venueArxiv, 2025
Publication URLhttps://arxiv.org/abs/2510.14260
Programming language(s)Pytorch, CUDA
HardwareRTX 4090
Source code or download URLhttps://github.com/TingmanYan/MatchAttention
Submission creation date15 Aug, 2025
Last edited17 Oct, 2025

High-res multi-view results



Infoallhigh-res
multi-view
indooroutdoorbotani.boulde.bridgedoorexhibi.lectur.living.loungeobserv.old co.statueterrac.
No results yet.

Low-res many-view results



Infoalllow-res
many-view
indooroutdoorlakesidesand boxstorage roomstorage room 2tunnel
No results yet.

Low-res two-view results



Infoalllakes. 1llakes. 1ssand box 1lsand box 1sstora. room 1lstora. room 1sstora. room 2lstora. room 2sstora. room 2 1lstora. room 2 1sstora. room 2 2lstora. room 2 2sstora. room 3lstora. room 3stunnel 1ltunnel 1stunnel 2ltunnel 2stunnel 3ltunnel 3s
copylefttwo views0.700.352.060.050.010.930.014.293.440.101.130.180.250.760.390.000.010.000.000.000.00

SLAM results



allcables 1cables 2cables 3camera shake 1camera shake 2camera shake 3ceiling 1ceiling 2desk 3desk changing 1einstein 1einstein 2einstein darkeinstein flashlighteinstein global light changes 1einstein global light changes 2einstein global light changes 3kidnap 1kidnap darklarge loop 1mannequin 1mannequin 3mannequin 4mannequin 5mannequin 7mannequin face 1mannequin face 2mannequin face 3mannequin headmotion 1planar 2planar 3plant 1plant 2plant 3plant 4plant 5plant darkplant scene 1plant scene 2plant scene 3reflective 1repetitivesfm benchsfm gardensfm house loopsfm lab room 1sfm lab room 2sofa 1sofa 2sofa 3sofa 4sofa dark 1sofa dark 2sofa dark 3sofa shaketable 3table 4table 7vicon light 1vicon light 2
MethodInfo
No results yet.