Haohan Guo

Haohan Guo

PhD Student @ CUHK

The Chinese University of Hong Kong

Biography

Hello! I am Haohan Guo (郭浩瀚), a PhD student @ CUHK, supervised by Prof. Helen Mei Ling MENG. Before it, I received my M.S. and B.S. degrees from Northwestern Polytechnical University, supervised by Prof. Lei Xie. Then, I worked as a researcher at Sogou Inc during 2020-2021. My current research topic is deep learning based speech synthesis. If you are interested in my works, welcome to contact me.

Interests
  • Speech & Audio Processing
  • Speech Synthesis
  • Voice Conversion
  • Audio Generation
Education
  • PhD in Computer Science, 2021-

    The Chinese University of Hong Kong

  • MSc in Computer Science, 2017-2020

    Northwestern Polytechnical University

  • BSc in Computer Science, 2013-2017

    Northwestern Polytechnical University

Work Experience

lab, internship, full-time employee

 
 
 
 
 
Amazon
Applied Scientist Intern
Jun 2023 – Nov 2023 Cambridge, UK
Work as an applied scientist intern to develop large-scale TTS system based on large language models (LLM).
 
 
 
 
 
Xiaohongshu
Research Intern
Aug 2020 – May 2022 Beijing, China
Work as a researcher intern to investigate the application of speech representations in TTS.
 
 
 
 
 
Sogou
Researcher
Dec 2020 – Jul 2021 Beijing, China
Work as a researcher on singing voice conversion. We aim to develop a commercial singing conversion system which can convert arbitrary singing voice to the target timbre. High sound quality and accurate melody expression are both required.
 
 
 
 
 
Tencent AI Lab
Research Intern
May 2020 – Dec 2020 Beijing, China
Research topic is multi-singer singing voice conversion. We propose a MelGAN based end-to-end PPG-SVC model. It significantly improves the sound quality and singer similarity over the conventional PPG-SVC framework. The work is summarized to the paper, Phonetic Posteriorgrams based Many-to-Many Singing Voice Conversion via Adversarial Training.
 
 
 
 
 
Microsoft Research Asia & Microsoft STCA
Research Intern
May 2018 – Sep 2019 Beijing, China
Supervised by Frank K. Soong and Lei He. We aim to improve the robustness and naturalness of end-to-end TTS. Two main works are published to INTERSPEECH 2019, A New GAN-based End-to-End TTS Training Algorithm and Exploiting Syntactic Features in a Parsed Tree to Improve End-to-End TTS. We also investigate the conversational TTS using the end-to-end approach. The work is published to SLT 2021, Conversational End-to-End TTS for Voice Agents.
 
 
 
 
 
Chumenwenwen
Research Intern
Sep 2016 – Jun 2016 Beijing, China
Be responsible for the optimization of the front-end modules of TTS system, including G2P and Prosody model.

Education

 
 
 
 
 
The Chinese University of Hong Kong (CUHK)
Ph.D. Student
Aug 2021 – Present Hong Kong SAR, China
Supervised by Prof. Helen Meng.
 
 
 
 
 
M.S. Student
Sep 2017 – May 2020 Xi'an, Shannxi, China
Supervised by Prof. Lei Xie.
 
 
 
 
 
B.S. Student
Sep 2013 – Jul 2017 Xi'an, Shannxi, China
School of Computer Science