Basic Information
I am a Ph.D. student in the Computer Vision and Learning Group (VLG) at ETH Zürich, where I am supervised by Prof. Siyu Tang. Prior to this, I received my bachelor’s degree and master’s degree from Tsinghua University.
Social
Publications
Authors:Yutong Chen, Yiming Wang, Xucong Zhang, Sergey Prokudin, Siyu Tang
GGPT can use reliable geometric guidance to augment various feed-forward method for 3D reconstruction.SplatFormer: Point Transformer for Robust 3D Gaussian Splatting
Conference: The Thirteenth International Conference on Learning Representations (ICLR 2025) spotlight presentation
Authors:Yutong Chen, Marko Mihajlovic, Xiyi Chen, Yiming Wang, Sergey Prokudin, Siyu Tang
We analyze the performance of novel view synthesis methods in challenging out-of-distribution (OOD) camera views and introduce SplatFormer, a data-driven 3D transformer designed to refine 3D Gaussian splatting primitives for improved quality in extreme camera scenarios.Authors:Gen Li, Yutong Chen*, Yiqian Wu*, Kaifeng Zhao*, Marc Pollefeys, Siyu Tang (*equal contribution)
EgoM2P: A large-scale egocentric multimodal and multitask model, pretrained on eight extensive egocentric datasets. It incorporates four modalities—RGB and depth video, gaze dynamics, and camera trajectories—to handle challenging tasks like monocular egocentric depth estimation, camera tracking, gaze estimation, and conditional egocentric video synthesis