Media Summary: MERL researcher Pedro Miraldo presents the paper “Revisiting Monocular SLAM with Disentangle-then-Align: Non-Iterative Hybrid Multimodal Image Registration via Cross-Scale Feature Disentanglement. Chengxing Lin, Jinhong Deng, Yinjie Lei, Wen Li. "Deformation-based In-Context Learning for Point Cloud Understanding.
Cvpr 2026 Adaptive Spatial Temporal Window - Detailed Analysis & Overview
MERL researcher Pedro Miraldo presents the paper “Revisiting Monocular SLAM with Disentangle-then-Align: Non-Iterative Hybrid Multimodal Image Registration via Cross-Scale Feature Disentanglement. Chengxing Lin, Jinhong Deng, Yinjie Lei, Wen Li. "Deformation-based In-Context Learning for Point Cloud Understanding. VIMCAN: Visual-Inertial 3D Human Pose Estimation with Hybrid Mamba-Cross-Attention Network. In this video, we introduce a novel video object detection framework called D2FANet. D2FANet is the first framework to jointly ... PixARMesh is a mesh-native autoregressive framework for single-view 3D scene reconstruction. Instead of reconstructing via ...
Title: Scene-Centric Unsupervised Video Panoptic Segmentation Authors: Christoph Reich*, Oliver Hahn*, Nikita Araslanov, ... Despite significant progress has been made in image deraining, we note that most existing methods are often developed for only ... Toward Real-world Infrared Image Super-Resolution: A Unified Autoregressive Framework and Benchmark Dataset Official ... HandVQA: Diagnosing and Improving Fine-Grained