The document summarizes Kun Zhou's PhD research on emotional voice conversion with non-parallel data at the National University of Singapore. It introduces emotional voice conversion and its challenges, including the lack of parallel training data. It then summarizes Kun's publications, which propose CycleGAN-based and VAW-GAN approaches to model prosody for speaker-dependent and independent emotional voice conversion. One publication introduces a method for transferring both seen and unseen emotional styles using a pre-trained speech emotion recognizer to describe emotional styles.