Handvqa Cvpr 2026

HandVQA | CVPR 2026

HandVQA

Adaptive Spatial-Temporal Window: Unlocking the Potential of Event Cameras in Heterogeneous Velocity Scenarios Zhipeng Sui, ...

Title: Enhancing Hands in 3D Whole-Body Pose Estimation with Conditional Hands ModulatorWebsite: ...

Title:MU-GeNeRF: Multi-view Uncertainty-guided Generalizable Neural Radiance Fields for Distractor-aware Scene ...

Large-Scale Codec Avatars (LCA): The Unreasonable Effectiveness of Large-Scale Avatar Pretraining

Disentangle-then-Align: Non-Iterative Hybrid Multimodal Image Registration via Cross-Scale Feature Disentanglement.

This is the video presentation for the paper titled "Intra-class Distribution-guided Generative Hashing with Neighbor Refinement ...

Joonki Min, Chaeyun Kim, Hyungwook Choi, Yejin Kim, Kihyun Kim, Yohan Jo, Joonseok Lee. Fine-Grained Multi-Image Object ...

Video for CVPR26 paper "HumanBA: Human-Aware Bundle Adjustmentvia Global Human-Camera Decoupling"

MixerCSeg: An Efficient Mixer Architecture for Crack Segmentation via Decoupled Mamba Attention.

VIMCAN: Visual-Inertial 3D Human Pose Estimation with Hybrid Mamba-Cross-Attention Network.

This video presents GHPT, a novel framework for real-time relightable Gaussian Splatting using hybrid path tracing. Project Page: ...

UniPR: Unified Object-level Real-to-Sim Perception and Reconstruction from a Single Stereo Pair Project Page: ...

In this video, we introduce a novel video object detection framework called D2FANet. D2FANet is the first framework to jointly ...

Video2Robo: 3DGS-based Synthetic Data from One Video Enables Scalable Robot Learning Project page: ...

PA-Attack: Guiding Gray-Box Attacks on LVLM Vision Encoders with Prototypes and Attention.

CVPR 2026

TAPE: Task-Adaptive Prototype Evolution in Audio-Language Models for Fully Few-shot Class-incremental Audio Classification.