Media Summary: [CVPR-2023] Advancing Visual Grounding with Scene Knowledge: Benchmark and Method Disentangle-then-Align: Non-Iterative Hybrid Multimodal Image Registration via Cross-Scale Feature Disentanglement. Adaptive Spatial-Temporal Window: Unlocking the Potential of Event Cameras in Heterogeneous Velocity Scenarios Zhipeng Sui, ...

Cvpr 2024 Visual Programming For Zero Shot Open Vocabulary 3d Visual Grounding - Detailed Analysis & Overview

[CVPR-2023] Advancing Visual Grounding with Scene Knowledge: Benchmark and Method Disentangle-then-Align: Non-Iterative Hybrid Multimodal Image Registration via Cross-Scale Feature Disentanglement. Adaptive Spatial-Temporal Window: Unlocking the Potential of Event Cameras in Heterogeneous Velocity Scenarios Zhipeng Sui, ... [CVPR 2026] VGent: Visual Grounding via Modular Design for Disentangling Reasoning and Prediction [CVPR 2024] 3D Geometry-aware Deformable Gaussian Splatting for Dynamic View Synthesis [CVPR2026] PV-Ground: Text-Guided Point-Voxel Interaction for 3D Visual Grounding

Photo Gallery

[CVPR 2024] Visual Programming for Zero-shot Open-Vocabulary 3D Visual Grounding
Open3DSG [CVPR 2024]
[CVPR 2024] Test-Time Zero-Shot Temporal Action Localization
[CVPR 2024] Open3DIS: Open-Vocabulary 3D Instance Segmentation with 2D Mask Guidance
[CVPR-2023] Advancing Visual Grounding with Scene Knowledge: Benchmark and Method
Improved Zero-Shot Classification by Adapting VLMs with Text Descriptions [CVPR 2024]
[CVPR 2026]
[CVPR'23] OpenScene: 3D Scene Understanding with Open Vocabularies
[CVPR 2026]  Adaptive Spatial-Temporal Window
[CVPR 2026] VGent: Visual Grounding via Modular Design for Disentangling Reasoning and Prediction
[CVPR 2024 Highlight] Feature 3DGS (5 min talk)
(CVPR 2023) PLA: Language-Driven Open-Vocabulary 3D Scene Understanding
Sponsored
Sponsored
View Detailed Profile
Sponsored
Sponsored