Abstract
Facial expression synthesis has been receiving more attention for research and application in computer vision. The state-of-the-art latent diffusion model (LDM) can generate high-quality images from text prompts. However, to edit the facial expression of existing images, the model can over-editing and remove some identity from the original images. In this study, we build a facial expression synthesis pipeline to edit the original image with different expressions: anger, disgust, contempt, fear, happiness, sadness, surprise, and neutral. Our pipeline includes facial segmentation to extract the necessary area for editing, denoising diffusion probabilistic models (DDPM) with text embedding to generate and control the output expression, and image combining to combine generated image back to the original image. In this paper, we experiment and analyze the potential of DDPM with our method.
Original language | English |
---|---|
Title of host publication | IET International Conference on Engineering Technologies and Applications (ICETA 2023) |
Publisher | Institution of Engineering and Technology |
Pages | 107-108 |
Number of pages | 2 |
ISBN (Print) | 9781839539404 |
DOIs | |
Publication status | Published - 6 Mar 2024 |
Event | 2023 IET International Conference on Engineering Technologies and Applications, ICETA 2023 - Yunlin, Taiwan, Province of China Duration: 21 Oct 2023 → 23 Oct 2023 |
Conference
Conference | 2023 IET International Conference on Engineering Technologies and Applications, ICETA 2023 |
---|---|
Country/Territory | Taiwan, Province of China |
City | Yunlin |
Period | 21/10/23 → 23/10/23 |
Keywords
- diffusion mode
- face segmentation
- facial expression synthesis
- generative models
- text embedding