Patrick Rim

I am a Ph.D. student (2024 – [Expected] 2028) at the Yale Vision Lab researching 3D vision and multimodal AI systems, advised by Prof. Alex Wong. Previously, I completed my B.S. in Computer Science and Information/Data Sciences at the California Institute of Technology (Caltech).

I am also currently a Research Scientist Intern at Meta Reality Labs, working on the Extended Reality (XR) team with Kun He to build something new and ambitious! 🚀

What I Research and Why

My work is centered on building embodied AI agents with adaptive, efficient, and robust perception, as well as multimodal capabilities spanning vision, language, and range sensing. To that end, I develop methods for recognition (e.g., multi-sensor 3D object detection in autonomous driving and AR/VR settings), reconstruction (e.g., forecasting dynamic scenes by learning latent ODEs that temporally extrapolate deformable 3D Gaussian splats), and generation (e.g., text-conditional depth map generation with diffusion models). Currently, I am exploring in-the-wild egocentric hand-object tracking to enable contextualized input in unconstrained real-world environments.

Recent Publications

ProtoDepth: Unsupervised Continual Depth Completion with Prototypes
Patrick Rim, Hyoungseob Park, S. Gangopadhyay, Ziyao Zeng, Younjoon Chung, Alex Wong
CVPR 2025

Radar-Guided Polynomial Fitting for Metric Depth Estimation
Patrick Rim, Hyoungseob Park, Vadim Ezhov, Jeffrey Moon, Alex Wong
Under Review

ETA: Energy-based Test-time Adaptation for Depth Completion
Younjoon Chung*, Hyoungseob Park*, Patrick Rim*, Xiaoran Zhang, Jihe He, Ziyao Zeng, Safa Cicek, Byung-Woo Hong, James S. Duncan, Alex Wong
ICCV 2025

Extending Foundational Monocular Depth Estimators to Fisheye Cameras with Calibration Tokens
S. Gangopadhyay*, Jung-Hee Kim*, Xien Chen*, Patrick Rim, Hyoungseob Park, Alex Wong
ICCV 2025

ODE-GS: Latent ODEs for Dynamic Scene Extrapolation with 3D Gaussian Splatting
Daniel Wang, Patrick Rim, Tian Tian, Alex Wong, Ganesh Sundaramoorthi
Under Review

SparseFusion: Fusing Multi-Modal Sparse Representations for Multi-Sensor 3D Object Detection
Yichen Xie, Chenfeng Xu, MJ Rakotosaona, Patrick Rim, Federico Tombari, Kurt Keutzer, Masayoshi Tomizuka, Wei Zhan
ICCV 2023

Quadric Representations for LiDAR Odometry, Mapping and Localization
Chao Xia*, Chenfeng Xu*, Patrick Rim, Mingyu Ding, Nanning Zheng, Kurt Keutzer, Masayoshi Tomizuka, Wei Zhan
RA-L 2023

* denotes Equal Contribution