daily-papers

Daily Papers

Monocular 3d Object Detection
3D Visual Grounding

Updated on 2026.04.01

Monocular 3d Object Detection

Date	Title	Authors	PDF	Code	Comments
2026-3-28	Unleashing the Power of Chain-of-Prediction for Monocular 3D Object Detection	Zhihao Zhang et.al	paper	-	<summary>detail</summary>Journal ref:CVPR 2026
2026-3-27	Towards Intrinsic-Aware Monocular 3D Object Detection	Zhihao Zhang et.al	paper	-	<summary>detail</summary>This paper is accepted by CVPR 2026
2026-3-10	SPAN: Spatial-Projection Alignment for Monocular 3D Object Detection	Yifan Wang et.al	paper	-	<summary>detail</summary>Accepted by CVPR 2026
2026-3-10	SpikeSMOKE: Spiking Neural Networks for Monocular 3D Object Detection with Cross-Scale Gated Coding	Xuemei Chen et.al	paper	-	-
2026-3-8	Selective Transfer Learning of Cross-Modality Distillation for Monocular 3D Object Detection	Rui Ding et.al	paper	-	-
2026-2-24	Object-Scene-Camera Decomposition and Recomposition for Data-Efficient Monocular 3D Object Detection	Zhaonian Kuang et.al	paper	-	<summary>detail</summary>IJCV
2026-1-2	Mono3DV: Monocular 3D Object Detection with 3D-Aware Bipartite Matching and Variational Query DeNoising	Kiet Dang Vu et.al	paper	-	-
2025-11-25	Open Vocabulary Monocular 3D Object Detection	Jin Yao et.al	paper	code	<summary>detail</summary>3DV 2026
2025-11-17	Difficulty-Aware Label-Guided Denoising for Monocular 3D Object Detection	Soyul Lee et.al	paper	-	<summary>detail</summary>AAAI 2026 accepted
2025-11-14	Efficient Feature Aggregation and Scale-Aware Regression for Monocular 3D Object Detection	Yifan Wang et.al	paper	-	-
2025-11-11	MonoCLUE : Object-Aware Clustering Enhances Monocular 3D Object Detection	Sunghun Yang et.al	paper	-	<summary>detail</summary>AAAI 2026
2025-11-8	RaGS: Unleashing 3D Gaussian Splatting from 4D Radar and Monocular Cues for 3D Object Detection	Xiaokai Bai et.al	paper	-	-
2025-9-7	S-LAM3D: Segmentation-Guided Monocular 3D Object Detection via Feature Space Fusion	Diana-Alexandra Sas et.al	paper	-	-
2025-9-5	3D-MOOD: Lifting 2D to 3D for Monocular Open-Set Object Detection	Yung-Hsu Yang et.al	paper	-	<summary>detail</summary>ICCV 2025
2025-8-28	Adaptive Dual Uncertainty Optimization: Boosting Monocular 3D Object Detection under Test-Time Shifts	Zixuan Hu et.al	paper	-	<summary>detail</summary>Accepted by ICCV 2025 (Highlight)
2025-8-27	Generalizing Monocular 3D Object Detection	Abhinav Kumar et.al	paper	-	<summary>detail</summary>PhD Thesis submitted to MSU
2025-7-3	PLOT: Pseudo-Labeling via Video Object Tracking for Scalable Monocular 3D Object Detection	Seokyeong Lee et.al	paper	-	-
2025-6-14	MonoVQD: Monocular 3D Object Detection with Variational Query Denoising and Self-Distillation	Kiet Dang Vu et.al	paper	-	-
2025-4-25	LiDAR-Guided Monocular 3D Object Detection for Long-Range Railway Monitoring	Raul David Dominguez Sanchez et.al	paper	-	<summary>detail</summary>Accepted for the Data-Driven Learning for Intelligent Vehicle Applications Workshop at the 36th IEEE Intelligent Vehicles Symposium (IV) 2025
2025-4-10	MonoPlace3D: Learning 3D-Aware Object Placement for 3D Monocular Detection	Rishubh Parihar et.al	paper	code	<summary>detail</summary>CVPR 2025 Camera Ready

3D Visual Grounding

Date	Title	Authors	PDF	Code	Comments
2026-3-31	MVGGT: Multimodal Visual Geometry Grounded Transformer for Multiview 3D Referring Expression Segmentation	Changli Wu et.al	paper	code	<summary>detail</summary>CVPR 2026
2026-3-18	OmniVLN: Omnidirectional 3D Perception and Token-Efficient LLM Reasoning for Visual-Language Navigation across Air and Ground Platforms	Zhongyuang Liu et.al	paper	-	-
2026-3-9	UniGround: Universal 3D Visual Grounding via Training-Free Scene Parsing	Jiaxi Zhang et.al	paper	-	-
2026-2-19	JAEGER: Joint 3D Audio-Visual Grounding and Reasoning in Simulated Physical Environments	Zhan Liu et.al	paper	-	-
2026-2-3	Z3D: Zero-Shot 3D Visual Grounding from Images	Nikita Drozdov et.al	paper	code	-
2026-1-30	Learning Geometrically-Grounded 3D Visual Representations for View-Generalizable Robotic Manipulation	Di Zhang et.al	paper	-	-
2026-1-13	Reasoning Matters for 3D Visual Grounding	Hsiang-Wei Huang et.al	paper	-	<summary>detail</summary>2025 CVPR Workshop on 3D-LLM/VLA: Bridging Language
2025-12-31	OpenGround: Active Cognition-based Reasoning for Open-World 3D Visual Grounding	Wenyuan Huang et.al	paper	code	-
2025-12-30	MoniRefer: A Real-world Large-scale Multi-modal Dataset based on Roadside Infrastructure for 3D Visual Grounding	Panquan Yang et.al	paper	-	-
2025-12-28	UniPR-3D: Towards Universal Visual Place Recognition with Visual Geometry Grounded Transformer	Tianchen Deng et.al	paper	code	-
2025-12-23	PanoGrounder: Bridging 2D and 3D with Panoramic Scene Representations for VLM-based 3D Visual Grounding	Seongmin Jung et.al	paper	-	-
2025-12-9	View-on-Graph: Zero-shot 3D Visual Grounding via Vision-Language Reasoning on Scene Graphs	Yuanyuan Liu et.al	paper	-	-
2025-11-30	S$^2$-MLLM: Boosting Spatial Reasoning Capability of MLLMs for 3D Visual Grounding with Structural Guidance	Beining Xu et.al	paper	-	-
2025-11-10	Mono3DVG-EnSD: Enhanced Spatial-aware and Dimension-decoupled Text Encoding for Monocular 3D Visual Grounding	Yuzhen Li et.al	paper	-	-
2025-10-27	From Objects to Anywhere: A Holistic Benchmark for Multi-level Visual Grounding in 3D Scenes	Tianxu Wang et.al	paper	code	<summary>detail</summary>Update v3 of the NeurIPS 2025 Datasets and Benchmarks paper (v2)
2025-10-16	ChangingGrounding: 3D Visual Grounding in Changing Scenes	Miao Hu et.al	paper	code	-
2025-10-13	DSM: Constructing a Diverse Semantic Map for 3D Visual Grounding	Qinghongbing Xie et.al	paper	code	-
2025-9-19	Zero-Shot Visual Grounding in 3D Gaussians via View Retrieval	Liwei Liao et.al	paper	code	-
2025-9-12	Talk2PC: Enhancing 3D Visual Grounding through LiDAR and Radar Point Clouds Fusion for Autonomous Driving	Runwei Guan et.al	paper	code	-
2025-9-4	TriCLIP-3D: A Unified Parameter-Efficient Framework for Tri-Modal 3D Visual Grounding based on CLIP	Fan Li et.al	paper	-	-