Hao Phung

CS PhD student @ Cornell University, Cornell Tech, NYC

prof_pic.jpg

Manhattan, NYC

Greetings! I am a PhD student in Computer Science at Cornell Tech, NYC, advised by Hadar Averbuch-Elor. My research interests are in generative models and multimodal computer vision, with a current interest in controllable generation.

Previously, I was an AI Research Resident at VinAI, where I was fortunate to work under the mentorship of Dr. Anh Tran. Before that, I received my Bachelor of Computer Science from Ho Chi Minh City University of Science (HCMUS) in Vietnam.

During my PhD study, I am fortunate to intern at Apple AI/ML.

News

Mar 30, 2026 :sparkles: Raster2Seq and Prox-E have been accepted to SIGGRAPH 2026.
Feb 14, 2026 :sparkles: HyperCT, a paper I co-authored, has been accepted to MIDL 2026.
Sep 18, 2025 :sparkles: E2D2, a paper I co-authored, has been accepted to NeurIPS 2025.
Jun 02, 2025 :sunny: I have joined Apple AIML as a research intern, working with Haoxuan You.
Jan 22, 2025 :sparkles: Discrete Diffusion Guidance has been accepted to ICLR 2025.
Dec 09, 2024 :sparkles: Self-Corrected Flow Distillation has been accepted to AAAI 2025.
Sep 25, 2024 :sparkles: DiMSUM has been accepted to NeurIPS 2024.
Aug 26, 2024 :baby_chick: I started my PhD studies at Cornell University.
Jul 14, 2023 :sparkles: Anti-DreamBooth has been accepted to ICCV 2023.
Feb 28, 2023 :sparkles: WaveDiff has been accepted to CVPR 2023.

Publications

* denotes equal contribution

  1. Prox-E: Fine-Grained 3D Shape Editing via Primitive-Based Abstractions
    Etai Sella*, Hao Phung*, Nitay Amiel, Or Litany, Or Patashnik, and Hadar Averbuch-Elor
    In Special Interest Group on Computer Graphics and Interactive Techniques Conference Conference Papers, 2026
  2. Raster2Seq: Polygon Sequence Generation for Floorplan Reconstruction
    Hao Phung and Hadar Averbuch-Elor
    In Special Interest Group on Computer Graphics and Interactive Techniques Conference Conference Papers, 2026
  3. Simple Guidance Mechanisms for Discrete Diffusion Models
    Yair Schiff*, Subham Sekhar Sahoo*, Hao Phung*, Guanghan Wang*, Sam Boshar, Hugo Dalla-torre, Bernardo P Almeida, Alexander Rush, Thomas Pierrot, and Volodymyr Kuleshov
    In The Thirteenth International Conference on Learning Representations, 2025
  4. Self-Corrected Flow Distillation for Consistent One-Step and Few-Step Image Generation
    In The 39th Annual AAAI Conference on Artificial Intelligence (AAAI), 2025
  5. DiMSUM: Diffusion Mamba - A Scalable and Unified Spatial-Frequency Method for Image Generation
    In The Thirty-eighth Annual Conference on Neural Information Processing Systems (NeurIPS), 2024
  6. Anti-DreamBooth: Protecting users from personalized text-to-image synthesis
    Thanh Van Le*, Hao Phung*, Thuan Hoang Nguyen*, Quan Dao*, Ngoc Tran, and Anh Tran
    In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2023
  7. Wavelet Diffusion Models are fast and scalable Image Generators
    Hao Phung*, Quan Dao*, and Anh Tran
    In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023
  1. Prox-E: Fine-Grained 3D Shape Editing via Primitive-Based Abstractions
    Etai Sella*, Hao Phung*, Nitay Amiel, Or Litany, Or Patashnik, and Hadar Averbuch-Elor
    In Special Interest Group on Computer Graphics and Interactive Techniques Conference Conference Papers, 2026
  2. Raster2Seq: Polygon Sequence Generation for Floorplan Reconstruction
    Hao Phung and Hadar Averbuch-Elor
    In Special Interest Group on Computer Graphics and Interactive Techniques Conference Conference Papers, 2026
  3. HyperCT: Low-Rank Hypernet for Unified Chest CT Analysis
    Fengbei Liu, Sunwoo Kwak, Hao Phung, Nusrat Binta Nizam, Ilan Richter, Nir Uriel, Hadar Averbuch-Elor, Deborah Estrin, and Mert R. Sabuncu
    In Medical Imaging with Deep Learning, 2026
  4. Encoder-Decoder Diffusion Language Models for Efficient Training and Inference
    Marianne Arriola, Yair Schiff, Hao Phung, Aaron Gokaslan, and Volodymyr Kuleshov
    In The Thirty-ninth Annual Conference on Neural Information Processing Systems, 2025
  5. Simple Guidance Mechanisms for Discrete Diffusion Models
    Yair Schiff*, Subham Sekhar Sahoo*, Hao Phung*, Guanghan Wang*, Sam Boshar, Hugo Dalla-torre, Bernardo P Almeida, Alexander Rush, Thomas Pierrot, and Volodymyr Kuleshov
    In The Thirteenth International Conference on Learning Representations, 2025
  6. Self-Corrected Flow Distillation for Consistent One-Step and Few-Step Image Generation
    In The 39th Annual AAAI Conference on Artificial Intelligence (AAAI), 2025
  7. DiMSUM: Diffusion Mamba - A Scalable and Unified Spatial-Frequency Method for Image Generation
    In The Thirty-eighth Annual Conference on Neural Information Processing Systems (NeurIPS), 2024
  8. lfm_archi.png
    Flow Matching in Latent Space
    Quan Dao*, Hao Phung*, Binh Nguyen, and Anh Tran
    arXiv preprint arXiv:2307.08698, 2023
  9. Anti-DreamBooth: Protecting users from personalized text-to-image synthesis
    Thanh Van Le*, Hao Phung*, Thuan Hoang Nguyen*, Quan Dao*, Ngoc Tran, and Anh Tran
    In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2023
  10. Wavelet Diffusion Models are fast and scalable Image Generators
    Hao Phung*, Quan Dao*, and Anh Tran
    In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023
  11. NICS
    VQASTO: Visual Question Answering System for Action Surveillance based on Task Ontology
    Huy Quoc Vo*, Tien-Hao Phung*, and Ngoc Quoc Ly
    In 2020 7th NAFOSTED Conference on Information and Computer Science (NICS), 2020