Media Summary: Disentangle-then-Align: Non-Iterative Hybrid Multimodal Image Registration via Cross-Scale Feature Disentanglement. Title:MU-GeNeRF: Multi-view Uncertainty-guided Generalizable Neural Radiance Fields for Distractor-aware Scene ... VIMCAN: Visual-Inertial 3D Human Pose Estimation with Hybrid Mamba-Cross-Attention Network.
Cvpr 2026 Locateanything3d - Detailed Analysis & Overview
Disentangle-then-Align: Non-Iterative Hybrid Multimodal Image Registration via Cross-Scale Feature Disentanglement. Title:MU-GeNeRF: Multi-view Uncertainty-guided Generalizable Neural Radiance Fields for Distractor-aware Scene ... VIMCAN: Visual-Inertial 3D Human Pose Estimation with Hybrid Mamba-Cross-Attention Network. MoCapAnything: Unified 3D Motion Capture for Arbitrary Skeletons from Monocular Videos View-Aware Semantic Alignment for Aerial-Ground Person Re-Identification. Authors: Matteo Ballegeer, Dries F. Benoit Paper: Google Scholar: ...
OccAny is the first generalized model for metric 3D occupancy prediction in unconstrained urban scenes — no in-domain ... Toward Real-world Infrared Image Super-Resolution: A Unified Autoregressive Framework and Benchmark Dataset Official ... Current millimeter-wave (mmWave) datasets for human pose estimation (HPE) are scarce and lack diversity in both point cloud ... Code: github.com/brucee1323/Exact-GS This is our Explanation video for TraceGen: World Modeling in 3D Trace-Space Enables Learning from Cross-Embodiment Videos ( Demo for UniPart: Part-Level 3D Generation with Unified 3D Geom-Seg Latents - Project page: ...