如视 真实如你所视,数字空间与空间智能综合解决方案引领者 - 如视
下载APP联系我们
  • VR空间
  • 商城
下载客户端联系我们
如视 真实如你所视,数字空间与空间智能综合解决方案引领者 - 如视
产品
  • 平台与应用
    • 一站式 VR 全景制作平台
      一站式 VR 全景制作平台
      热门
      从创作到管理,你的一站式 VR 管家
    • VR 浏览
      VR 浏览
      沉浸式 VR 漫游,让空间细节尽在掌握
  • 自研采集产品
    • 伽罗华 P4
      伽罗华 P4
      热门
      升级款 3D 激光扫描仪,极精秒速采集
    • 伽罗华 M4
      伽罗华 M4
      NEW
      3D 激光扫描仪,空间数字化一步到位
    • 伽罗华 M2
      伽罗华 M2
      3D 激光扫描仪,适配多种空间需求
    • 庞加莱 R1
      庞加莱 R1
      NEW
      小巧轻便,手持实景扫描仪新标杆
    • 庞加莱 S1
      庞加莱 S1
      手持实景扫描仪,超大空间三维重建
    • Realsee G2
      Realsee G2
      热门
      激光 VR 采集云台,室内空间采集优选
  • 兼容采集方式
    • 全景图生成 VR
      全景图生成 VR
      上传全景图,全流程自动化生成 VR
    • 手机拍 VR
      手机拍 VR
      拿起手机拍 VR,随时随地记录空间
    • 全景相机拍 VR
      全景相机拍 VR
      主流相机一键连,2D 全景自动转 VR
  • 能力展示
    • 如视空间数据
      海量空间数据,丰富产物,立即下载了解
    • Argus 大模型
      图片一键还原立体世界,业内首创大模型
  • 下载 如视VR App
    下载 如视VR App
    超越想象的简单,手机也能轻松拍 VR ,随时随地体验空间采集
  • 上门采集服务
    上门采集服务
    专属摄影师上门,覆盖全国 100+ 主要城市,支持灵活调整方案
解决方案
  • 场景类
    • 营销宣传
      营销宣传
      让每一处场景都成获客入口
    • 导航导览
      导航导览
      3D 精准定位,沉浸式导览新体验
    • 孪生底座
      孪生底座
      空间智能 AI,高效全场景数字孪生
    • 多维式培训
      多维式培训
      1:1 实景还原,多维赋能实战培训
    • 远程协作
      远程协作
      提升效率,大幅降低沟通成本
    • 现场踏勘
      现场踏勘
      全维度记录,让踏勘作业更便捷
  • 行业类
    • 房产租售
      房产租售
      热门
      沉浸 VR 带看,真信任,促成交
    • 特展临展
      特展临展
      热门
      省心落地,高效数字化临展
    • 餐饮酒旅
      餐饮酒旅
      餐厅 VR 上线美团,预定转化快人一步
    • 商业零售
      商业零售
      降本提效,打造场景化购物新体验
    • 家装
      家装
      VR 记录全流程,AI 设计一键赋能营销
    • 家居家电
      家居家电
      场景闭环,打通线上销售最后 1 公里
    • 智慧量房
      智慧量房
      NEW
      4 分钟测量,15 分钟直出 CAD
    • 工厂园区
      工厂园区
      数字孪生工厂大幅提高作业效率
    • 展厅展馆
      展厅展馆
      构建线上观展体验,打破时空局限
    • 公共事务
      公共事务
      三维现勘,为社会公正提供证据支撑
    • 道路事故快勘
      道路事故快勘
      精准固证数字化留痕,现场快勘快处快撤
  • 联系我们
    联系我们
    专业顾问N对1全程服务,为您答疑解惑
  • 精彩案例
    精彩案例
    行业落地项目集锦,直观展现应用价值
VR空间
合作与支持
  • 服务与支持
    • 帮助中心
      常见问题与使用指南,快速获取答案
    • 学习中心
      系列视频课程,轻松学会 VR 制作
    • 开发者手册
      开发者必读,实现快速接入
  • 合作生态
    • 开放平台
      共建三维生态,互联大千世界
    • 渠道与生态
      携手如视,成为如视生态合作伙伴
  • 了解我们
    • 新闻中心
      权威发布企业动态,实时呈现行业热点
    • 案例研究
      沉淀客户实践,赋能业务增长
    • 产品功能与更新
      掌握产品更新,高效落地数字化需求
  • 如视介绍
    如视介绍
    数字空间综合解决方案引领者
  • 联系我们
    联系我们
    专业顾问N对1全程服务,为您答疑解惑
商城
  • 去制作 VR
  • 登录
  • 联系我们
学术成果与能力解读

EDM: Efficient Deep Feature Matching

更新于 2025年08月08日

Abstract

Recent feature matching methods have achieved remarkable performance but lack efficiency consideration. In this paper, we revisit the mainstream detector-free matching pipeline and improve all its stages considering both accuracy and efficiency. We propose an Efficient Deep feature Matching network, EDM. We first adopt a deeper CNN with fewer dimensions to extract multi-level features. Then we present a Correlation Injection Module that conducts feature transformation on high-level deep features, and progressively injects feature correlations from global to local for efficient multi-scale feature aggregation, improving both speed and performance. In the refinement stage, a novel lightweight bidirectional axis-based regression head is designed to directly predict subpixel-level correspondences from latent features, avoiding the significant computational cost of explicitly locating keypoints on high-resolution local feature heatmaps. Moreover, effective selection strategies are introduced to enhance matching accuracy. Extensive experiments show that our EDM achieves competitive matching accuracy on various benchmarks and exhibits excellent efficiency, offering valuable best practices for real-world applications.


图片1.png

Figure 1. Comparison of Matching Accuracy and Latency. Our method achieves competitive accuracy with lower latency. Models are evaluated on the ScanNet dataset to get AUC@5◦ accuracy, while the latency for an image pair with 640×480 resolution is measured on a single NVIDIA 3090 GPU.



https://vr-public.realsee-cdn.cn/release/pagepress/HrMFRAlh/uploads/cd3e2aa3-27f6-48bf-abec-81b861c75932.png

Figure 2. Pipeline Overview. (a) A deeper CNN backbone is adopted to extract multi-level feature maps. (b) In the Correlation Injection Module, we alternately apply self-attention and cross-attention a total of L times to capture and transform the correlations between deep feature FA_d and FB_d. Subsequently, two Injection Layers are employed to progressively inject feature correlations from deep to local levels.(c) After the CIM, the coarse features FA_c and FB_c are flattened and then correlated to produce the similarity matrix. To establish coarse matches, we determine the row-wise maxima in the probability matrix and select the top K values among them. (d) For fine-level matching, the corresponding fine features are extracted by the indices obtained from the coarse matching process. We treat the fine features FA_q and FB_q as queries, while considering the same features but in reversed order, FB_r and FA_r, as eferences. The query and reference features are encoded separately and then merged together. Then, a lightweight regression head is designed to estimate the reference offsets on the X and Y axes, respectively. The final matches are obtained by adding the coarse matches to their corresponding offsets.

 


https://vr-public.realsee-cdn.cn/release/pagepress/nGtpmIuU/uploads/667c7506-c27b-436e-9946-0e64cd6b87cc.png

Figure 3. Bidirectional Refinement. For a coarse matching pair, the center point of one grid serves as query for fine matching, and its corresponding reference point is offset from the center point in another grid, exhibiting duality.



https://vr-public.realsee-cdn.cn/release/pagepress/uHjkwxLl/uploads/5d3e1786-9c67-4b01-aacd-fc4821459c31.png

Table 1. Results of Relative Pose Estimation on the MegaDepth Dataset and ScanNet Dataset. The models are trained on the MegaDepth dataset to evaluate all methods on both datasets. The AUC of relative pose error at multiple thresholds, and the average inference time on the ScanNet dataset for pairwise image of 640×480 resolution is provided


 

https://vr-public.realsee-cdn.cn/release/pagepress/MBPsrSDP/uploads/4c5f5a7c-69c7-44f6-84ce-5950dfa942a6.png

.Table 2. Homography estimation on HPatches.



https://vr-public.realsee-cdn.cn/release/pagepress/kRGCVuyh/uploads/f4b498a6-c824-43c6-92f0-0c557c916ca8.png

 


Figure 4. Attention Visualization. (a) Deep correlations. The green dots represent the query points. (b) Injection weights. Significant response values usually located in detail-rich regions.


https://vr-public.realsee-cdn.cn/release/pagepress/rgIoSJyM/uploads/58fe388a-45be-4b5e-aec6-02636c739b11.png

Table 3. Comparison of image size on the MegaDepth dataset.



https://vr-public.realsee-cdn.cn/release/pagepress/bMcdtsFB/uploads/596af833-83b1-47da-9cee-d15cce6508ca.png

Table 4. Runtime comparisons for each stage on ScanNet dataset.



https://vr-public.realsee-cdn.cn/release/pagepress/xAdZztAU/uploads/1f52f638-cc72-4cae-be9e-915dfadb0d26.png

Figure 5. Qualitative Comparisons. Compared with LoFTR and EfficientLoFTR, our method is more robust in scenarios with large viewpoint changes and repetitive semantics. The red color indicates epipolar error beyond 5e-4 in the normalized image coordinates.


 

https://vr-public.realsee-cdn.cn/release/pagepress/OufTAJeO/uploads/d2eed0e7-14c6-49a4-9919-a9c93dd855c6.png

Table 5. Ablation studies on the MegaDepth dataset at all steps, with average running times measured at 1152×1152 resolution.

下一篇

Multi-view Inverse Rendering for Large-scale Real-world Indoor Scenes
  • 产品
    • 伽罗华 Galois P4
    • Realsee G2
    • 庞加莱 Poincare
    • 手机拍 VR
  • 解决方案
    • 展厅展馆
    • 商业零售
    • 工厂园区
    • 房产租售
  • 关于我们
    • 如视介绍
    • 新闻媒体
    • 联系我们
  • 快速链接
    • 管理后台
    • 开放平台
    • 法律协议
    • 维修售后
  • 联系我们
    • 电话:400-897-9658
    • 时间:工作日 9:00-19:00(北京时间)
    • 邮箱:mkt@realsee.com
    • 地址:北京市海淀区上地六街弘源首著大厦
  • 关注我们
    扫描下方二维码关注我们公众号关注我们微博
    扫码关注我们的微信公众号
    微信公众号
如视Realsee
如你之视(北京)科技有限公司|地址:北京市海淀区信息路7号弘源首著大厦1号楼8层|电话:400-897-9658
©Copyright2026 realsee.com版权所有|营业执照|ICP|京ICP备2022009190号-3
违法和不良信息举报电话:010-8644 0676|违法和不良信息举报邮箱:complaint@realsee.com|京公网安备 11010802039437号,logo京公网安备 11010802039437号

更多「学术成果与能力解读」

Multi-view Inverse Rendering for Large-scale Real-world Indoor Scenes

Multi-view Inverse Rendering for Large-scale Real-world Indoor Scenes

2024年04月25日
 PhyIR: Physics-Based Inverse Rendering for Panoramic Indoor Images

PhyIR: Physics-Based Inverse Rendering for Panoramic Indoor Images

2024年04月25日
Floorplan Generation from 3D Point Clouds: A Space Partitioning Approach

Floorplan Generation from 3D Point Clouds: A Space Partitioning Approach

2024年04月25日
查看更多