Li et al., 2025 - Google Patents
CoMatch: Dynamic Covisibility-Aware Transformer for Bilateral Subpixel-Level Semi-Dense Image MatchingLi et al., 2025
View PDF- Document ID
- 8906071201830987320
- Author
- Li Z
- Lu Y
- Tang L
- Zhang S
- Ma J
- Publication year
- Publication venue
- arXiv preprint arXiv:2503.23925
External Links
Snippet
This prospective study proposes CoMatch, a novel semi-dense image matcher with dynamic covisibility awareness and bilateral subpixel accuracy. Firstly, observing that modeling context interaction over the entire coarse feature map elicits highly redundant computation …
- 230000002146 bilateral effect 0 title abstract description 18
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6201—Matching; Proximity measures
- G06K9/6202—Comparing pixel values or logical combinations thereof, or feature values having positional relevance, e.g. template matching
- G06K9/6203—Shifting or otherwise transforming the patterns to accommodate for positional errors
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
- G06K9/4604—Detecting partial patterns, e.g. edges or contours, or configurations, e.g. loops, corners, strokes, intersections
- G06K9/4609—Detecting partial patterns, e.g. edges or contours, or configurations, e.g. loops, corners, strokes, intersections by matching or filtering
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6267—Classification techniques
- G06K9/6268—Classification techniques relating to the classification paradigm, e.g. parametric or non-parametric approaches
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00624—Recognising scenes, i.e. recognition of a whole field of perception; recognising scene-specific objects
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00362—Recognising human body or animal bodies, e.g. vehicle occupant, pedestrian; Recognising body parts, e.g. hand
- G06K9/00369—Recognition of whole body, e.g. static pedestrian or occupant recognition
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/30—Subject of image; Context of image processing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/20—Analysis of motion
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10016—Video; Image sequence
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Jiang et al. | Cotr: Correspondence transformer for matching across images | |
| Potje et al. | Xfeat: Accelerated features for lightweight image matching | |
| Ding et al. | Transmvsnet: Global context-aware multi-view stereo network with transformers | |
| Yang et al. | Sanet: Scene agnostic network for camera localization | |
| Lee et al. | KNN local attention for image restoration | |
| Cai et al. | Objectfusion: Multi-modal 3d object detection with object-centric fusion | |
| Jin et al. | Eigenlanes: Data-driven lane descriptors for structurally diverse lanes | |
| Yu et al. | Adaptive spot-guided transformer for consistent local feature matching | |
| Liu et al. | Beyond human-level license plate super-resolution with progressive vehicle search and domain priori GAN | |
| Zhou et al. | Bev@ dc: Bird's-eye view assisted training for depth completion | |
| Zou et al. | Enhanced 3D convolutional networks for crowd counting | |
| Zhang et al. | Elsd: Efficient line segment detector and descriptor | |
| Wu et al. | Gomvs: Geometrically consistent cost aggregation for multi-view stereo | |
| CN101950426A (en) | Vehicle relay tracking method in multi-camera scene | |
| CN103996201A (en) | Stereo matching method based on improved gradient and adaptive window | |
| Kuang et al. | DenseGAP: Graph-structured dense correspondence learning with anchor points | |
| Li et al. | Dynamic feature-memory transformer network for RGBT tracking | |
| Wang et al. | Lidar2map: In defense of lidar-based semantic map construction using online camera distillation | |
| Schumacher et al. | Matching cost computation algorithm and high speed fpga architecture for high quality real-time semi global matching stereo vision for road scenes | |
| Cao et al. | Embracing events and frames with hierarchical feature refinement network for object detection | |
| Li et al. | CoMatch: Dynamic Covisibility-Aware Transformer for Bilateral Subpixel-Level Semi-Dense Image Matching | |
| Zhu et al. | Robust LiDAR-camera alignment with modality adapted local-to-global representation | |
| Nai et al. | Learning a novel ensemble tracker for robust visual tracking | |
| Dai et al. | Exploring and exploiting high-order spatial–temporal dynamics for long-term frame prediction | |
| Li et al. | MC-Net: Integrating multi-level geometric context for two-view correspondence learning |