Patrick Rim

I am a Ph.D. student (2024 – [Expected] 2028) at the Yale Vision Lab researching 3D vision and multimodal AI systems. I completed my B.S. in Computer Science & Information/Data Sciences at Caltech. Previously, I was a Research Scientist Intern at Meta Reality Labs, building SHOW3D for in-the-wild 3D hand-object pose estimation.

Currently, I am a Research Intern at NVIDIA Research in Santa Clara, exploring world models and egocentric perception! 🚀

What I Research and Why

My work is centered on building embodied AI agents with adaptive, efficient, and robust perception, as well as multimodal capabilities spanning vision, language, and range sensing. To that end, I develop methods for recognition (e.g., multi-sensor 3D object detection in autonomous driving and AR/VR settings), reconstruction (e.g., forecasting dynamic scenes by learning latent ODEs that temporally extrapolate deformable 3D Gaussian splats), and generation (e.g., text-conditional depth map generation with diffusion models).

Recent Publications

SHOW3D: Capturing Scenes of 3D Hands and Objects in the Wild
Patrick Rim, Kevin Harris, Braden Copple, Shangchen Han, Xu Xie, Ivan Shugurov, Sizhe An, He Wen, Alex Wong, Tomas Hodan, Kun He
CVPR 2026

Radar-Guided Polynomial Fitting for Metric Depth Estimation
Patrick Rim, Hyoungseob Park, Vadim Ezhov, Jeffrey Moon, Alex Wong
CVPR 2026

Iris: Integrating Language into Diffusion-based Monocular Depth Estimation
Ziyao Zeng, Jingcheng Ni, Daniel Wang, Patrick Rim, Younjoon Chung, Fengyu Yang, Byung-Woo Hong, Alex Wong
CVPR 2026

ODE-GS: Latent ODEs for Dynamic Scene Extrapolation with 3D Gaussian Splatting
Daniel Wang, Patrick Rim, Tian Tian, Alex Wong, Ganesh Sundaramoorthi
ICLR 2026

Unsupervised Depth Completion via Occluded Region Completion as Supervision
Hyoungseob Park, Runjian Chen, Patrick Rim, Dong Lao, Alex Wong
ICLR 2026

ProtoDepth: Unsupervised Continual Depth Completion with Prototypes
Patrick Rim, Hyoungseob Park, S. Gangopadhyay, Ziyao Zeng, Younjoon Chung, Alex Wong
CVPR 2025

ETA: Energy-based Test-time Adaptation for Depth Completion
Younjoon Chung*, Hyoungseob Park*, Patrick Rim*, Xiaoran Zhang, Jihe He, Ziyao Zeng, Safa Cicek, Byung-Woo Hong, James S. Duncan, Alex Wong
ICCV 2025

Extending Foundational Monocular Depth Estimators to Fisheye Cameras with Calibration Tokens
S. Gangopadhyay*, Jung-Hee Kim*, Xien Chen*, Patrick Rim, Hyoungseob Park, Alex Wong
ICCV 2025

SparseFusion: Fusing Multi-Modal Sparse Representations for Multi-Sensor 3D Object Detection
Yichen Xie, Chenfeng Xu, MJ Rakotosaona, Patrick Rim, Federico Tombari, Kurt Keutzer, Masayoshi Tomizuka, Wei Zhan
ICCV 2023

Quadric Representations for LiDAR Odometry, Mapping and Localization
Chao Xia*, Chenfeng Xu*, Patrick Rim, Mingyu Ding, Nanning Zheng, Kurt Keutzer, Masayoshi Tomizuka, Wei Zhan
RA-L 2023

* denotes Equal Contribution