Cheng-You Lu

I am a PhD student at the University of Technology Sydney's Human-centric Artificial Intelligence Centre in Australia. My research focuses on developing a 3D reconstruction system using drones, supervised by Prof. Chin-Teng Lin from UTS and co-advised by Prof. Yu-Lun Liu from NYCU. I also have an informal cooperation with Prof. Srinath Sridhar from Brown University.

I received my Master's degree in Computer Science from Brown University under the supervision of Prof. Srinath Sridhar. I earned my Bachelor's degree in Computer Science from NYCU under the supervision of Prof. Wen-Hsiao Peng.

Email  /  CV  /  Scholar  /  Github  /  LinkedIn

profile photo

News

  • 06/2026: One paper accepted by ECCV 2026
  • 12/2025: One paper accepted by IEEE Trans on AI 2025
  • 11/2025: One paper accepted by WACV 2026
  • 11/2025: One paper accepted by AAAI 2026
  • 10/2025: Serve as a reviewer for TPAMI
  • 03/2024: Awarded the Taiwan Government Scholarship to study abroad 2024-2026
  • 02/2024: One paper accepted by CVPR 2024 Highlight
DF3DV-1K: A Large-Scale Dataset and Benchmark for Distractor-Free Novel View Synthesis
Cheng-You Lu, Yi-Shan Hung2, Wei-Ling Chi2, Hao-Ping Wang2, Charlie Li-Ting Tsai2, Yu-Cheng Chang, Yu-Lun Liu, Thomas Do, Chin-Teng Lin
ECCV, 2026
project page / arXiv

DF3DV-1K, a large-scale real-world dataset for distractor-free novel view synthesis, comprising 1,048 scenes with clean and cluttered images per scene, together with DI2FIX, a diffusion-based enhancement module that improves radiance field renderings.

Hestia: Voxel-Face-Aware Hierarchical Next-Best-View Acquisition for Efficient 3D Reconstruction
Cheng-You Lu, Zhuoli Zhuang, Nguyen Thanh Trung Le, Da Xiao, Yu-Cheng Chang, Thomas Do, Srinath Sridhar, Chin-Teng Lin
WACV, 2026
project page / arXiv

Hestia, a generalizable RL-based next-best-view planner that actively predicts viewpoints for data capture in 3D reconstruction tasks.

Multi-View Clustering with Granularity-Aware Pseudo Supervision
Jie Yang, Cheng-You Lu, Zhongli Wang, Hsiang-Ting Chen, Guang-Kui Xu, Chenglong Zhang, Shuting Dong, Xinyan Liang, Bingbing Jiang
AAAI, 2026
project page / arXiv

GAPS, a Granularity-Aware Pseudo Supervision framework for multi-view clustering, adaptively generates hierarchical pseudo-labels and selects reliable views via a Separation-Compactness Index to enhance robust and discriminative representation learning.

HFGaussian: Learning Generalizable Gaussian Human with Integrated Human Features
Arnab Dey, Cheng-You Lu, Andrew I. Comport, Srinath Sridhar, Chin-Teng Lin, Jean Martinet
IEEE Transactions on artificial intelligence, 2025
project page / arXiv

HFGaussian, a generalizable 3D Gaussian Splatting that can estimate novel views and human features, including the 3D skeleton, 3D keypoints, and dense pose, from sparse input images in real time.

AEGIS: Human Attention-based Explainable Guidance for Intelligent Vehicle Systems
Zhuoli Zhuang, Cheng-You Lu, Yu-Cheng Fred Chang, Yu-Kai Wang, Thomas Do, Chin-Teng Lin
CHI, 2025
project page / arXiv

AEGIS, a Human Attention-based Explainable Guidance for Intelligent Vehicle Systems, leverages a pretrained human attention model to identify critical regions of interest for decision-making.

DiVa-360: The Dynamic Visual Dataset for Immersive Neural Fields
Cheng-You Lu1, Peisen Zhou1, Angela Xing1, Chandradeep Pokhariya , Arnab Dey, Ishaan N Shah, Rugved Mavidipalli, Dylan Hu, Andrew Comport , Kefan Chen, Srinath Sridhar
CVPR, 2024 (Highlight)
project page / arXiv

DiVa-360, a high-quality and high-frame-rate multi-view dataset for long-duration dynamic radiance fields.

NeuralODF: Learning Omnidirectional Distance Fields for 3D Shape Representation
Trevor Houchens1, Cheng-You Lu1, Shivam Duggal, Rao Fu, Srinath Sridhar
Technical Report, 2022
project page / arXiv

Omnidirectional Distance Fields (ODFs), a 3D shape representation that stores distances from any 3D position in any direction, along with efficient algorithms for converting ODFs to and from common 3D formats.

Video Rescaling Networks with Joint Optimization Strategies for Downscaling and Upscaling
Yan-Cheng Huang, Yi-Hsin Chen, Cheng-You Lu, Hui-Po Wang, Wen-Hsiao Peng, Ching-Chun Huang
CVPR, 2021
project page / arXiv

Multi-input Multi-output Video Rescaling Network (MIMO-VRN), a new strategy for downscaling and upscaling a group of video frames simultaneously.

Weakly-Supervised Image Semantic Segmentation Using Gaph Convolutional Networks
Shun-Yi Pan1, Cheng-You Lu1, Shih-Po Lee, Wen-Hsiao Peng
ICME, 2021
project page / arXiv

A GCN-based framework for weakly-supervised image semantic segmentation, improving pseudo label quality via Laplacian and entropy regularization.


Template from Jon Barron