即插即用型

OCM3D: Object-Centric Monocular 3D Object Detection

🏛️ 会议/期刊：arxiv 📅 发表年份：2021 💻 开源代码：OBMO_GUPNet 📄 论文题目：OCM3D: Object-Centric Monocular 3D Object Detection 1. 文献背景、研究目的与核心问题研究背景：单目 3D 目标检测（Monocular 3D Object Detection）是一个高度病态（ill-posed）的问题。主流方法通常依赖纯图像或将其转化为伪激光雷达（Pseudo-LiDAR）点云。然而，前者难以捕捉像素间的 3D 空间几何关系，后者则受困于单目深度估计带来的巨大点云噪声。 ...

LR3D: Improving Distant 3D Object Detection Using 2D Box Supervision

🏛️ 会议/期刊：CVPR 📅 发表年份：2024 💻 开源代码：无 📄 论文题目：Improving Distant 3D Object Detection Using 2D Box Supervision 这篇由 NVIDIA 等机构的研究人员发表在 CVPR 2024 的重磅论文《Improving Distant 3D Object Detection Using 2D Box Supervision》(简称 LR3D)，切入了一个目前高阶自动驾驶极其头疼的落地难题：远距离感知（Long-Range Detection）。它展示了如何用最廉价的标注，榨取单目视觉在远距离上的极限潜力。 ...

OBMO: One Bounding Box Multiple Objects for Monocular 3D Object Detection

🏛️ 会议/期刊：IEEE TIP 📅 发表年份：2023 💻 开源代码：mrsempress/OBMO_patchnet 📄 论文题目：OBMO: One Bounding Box Multiple Objects for Monocular 3D Object Detection 这篇发表于 IEEE TIP (2023) 的经典论文《OBMO: One Bounding Box Multiple Objects for Monocular 3D Object Detection》切入点非常犀利。它没有在复杂的网络主干上做文章，而是直击单目 3D 目标检测在“底层数学物理逻辑”上的痛点，提出了一种极其优雅的“即插即用（Plug-and-play）”训练策略。 ...

MonoXiver： Monocular 3D Object Detection with Bounding Box Denoising in 3D by Perceiver

🏛️ 会议/期刊：ICCV 📅 发表年份：2023 💻 开源代码：Xianpeng919/monoxiver (ICCV'23) 📄 论文题目：Monocular 3D Object Detection with Bounding Box Denoising in 3D by Perceiver 这篇发表于 ICCV 2023 的论文《Monocular 3D Object Detection with Bounding Box Denoising in 3D by Perceiver》(简称 MonoXiver)，提供了一个非常经典且极具工程价值的“自上而下（Top-down）”纠错思路。它并没有试图发明一种全新的主干网络，而是设计了一个强大的“插件”，专门用来拯救那些定位不准的预测框。 ...

PLOT: Pseudo-Labeling via Object Tracking for Monocular 3D Object Detection

🏛️ 会议/期刊：ICLR 📅 发表年份：2026 💻 开源代码：无 📄 论文题目：PLOT: Pseudo-Labeling via Object Tracking for Monocular 3D Object Detection 一、背景、研究目的与核心问题研究背景：单目 3D 目标检测模型极度“吃数据”。然而，人工标注 3D 边界框极其昂贵且耗时，导致目前带 3D 标签的数据集规模很小，严重限制了模型的泛化能力。 ...

Object-Scene-Camera Decomposition and Recomposition for Data-Efficient Monocular 3D Object Detection

🏛️ 会议/期刊：IJCV 📅 发表年份：2026 💻 开源代码：GitHub 链接 📄 论文题目：Object-Scene-Camera Decomposition and Recomposition for Data-Efficient Monocular 3D Object Detection 一、背景、研究目的与核心问题研究背景：单目 3D 目标检测本身是一个病态（ill-posed）问题，由于缺乏深度信息，模型极度依赖海量、多样化的训练数据来学习鲁棒的特征。 ...