Input Video | pSp + StyleGAN | e4e + StyleGAN | e4e + StyleGAN_va (ours) |
Edited e4e + StyleGAN_va (- Beard) |
Input Video | pSp + StyleGAN | e4e + StyleGAN | e4e + StyleGAN_va (ours) |
Edited e4e + StyleGAN_va (+ Makeup) |
Input Video | pSp + StyleGAN | e4e + StyleGAN | e4e + StyleGAN_va (ours) |
Edited e4e + StyleGAN_va (- Age) |
Input Video | pSp + StyleGAN | e4e + StyleGAN | e4e + StyleGAN_va (ours) |
Edited e4e + StyleGAN_va (+ Age) |
Input Video | pSp + StyleGAN | e4e + StyleGAN | e4e + StyleGAN_va (ours) |
Edited e4e + StyleGAN_va (+ Yaw) |
Input Video | pSp + StyleGAN | e4e + StyleGAN | e4e + StyleGAN_va (ours) |
Edited e4e + StyleGAN_va (+ Chubby) |
Input Video | pSp + StyleGAN | e4e + StyleGAN | e4e + StyleGAN_va (ours) |
Edited e4e + StyleGAN_va (- Nose Size) |
Input Video | pSp + StyleGAN | e4e + StyleGAN | e4e + StyleGAN_va (ours) |
Edited e4e + StyleGAN_va (+ Chubby) |
Input Frame | e4e + StyleGAN | e4e + StyleGAN_va (ours) |
Edited e4e + StyleGAN_va (+ Age) |
Input Frame | e4e + StyleGAN | e4e + StyleGAN_va (ours) |
Edited e4e + StyleGAN_va (+ Makeup) |
Input Frame | e4e + StyleGAN | e4e + StyleGAN_va (ours) |
Edited e4e + StyleGAN_va (+ Beard) |
Input Video | Opt. + StyleGAN | R + Opt. + StyleGAN | Opt. + StyleGAN_va | R + Opt. + StyleGAN_va | e4e + StyleGAN_va (ours) |
Input Video | Opt. + StyleGAN | R + Opt. + StyleGAN | Opt. + StyleGAN_va | R + Opt. + StyleGAN_va | e4e + StyleGAN_va (ours) |
Figure 8. Qualitative results on expression editing.
Videos with Exp. Dyn. Optim. better follow the original input video's lip contact than those without Exp. Dyn. Optim. The eyes in the other hand follow the intented edited results.
Note that for all expression directions, the mouth is opening, and our method tries to preserve the original expression dynamics such as the lip contact to follow the original motion while maintaining the edited expression results.
Show video1,
video2,
video3,
video4,
video5,
video6,
video7,
video8, and
video9.
Input Video | w/o Exp. Dyn. Optim. | w/ Exp. Dyn. Optim. (+ Happiness) |
Input Video | w/o Exp. Dyn. Optim. | w/ Exp. Dyn. Optim. (+ Happiness) |
Input Video | w/o Exp. Dyn. Optim. | w/ Exp. Dyn. Optim. (+ Happiness) |
Input Video | w/o Exp. Dyn. Optim. | w/ Exp. Dyn. Optim. (+ Anger) |
Input Video | w/o Exp. Dyn. Optim. | w/ Exp. Dyn. Optim. (+ Anger) |
Input Video | w/o Exp. Dyn. Optim. | w/ Exp. Dyn. Optim. (+ Anger) |
Input Video | w/o Exp. Dyn. Optim. | w/ Exp. Dyn. Optim. (+ Anger) |
Input Video | w/o Exp. Dyn. Optim. | w/ Exp. Dyn. Optim. (+ Surprise) |
Input Video | w/o Exp. Dyn. Optim. | w/ Exp. Dyn. Optim. (+ Surprise) |
Input Video | w/o Exp. Dyn. Optim. (+ Anger) |
w/o Exp. Dyn. Optim. (+ Anger - Arched eyebrow) |
w Exp. Dyn. Optim. (+ Anger) |
w Exp. Dyn. Optim. (+ Anger - Arched eyebrow) |
Input Video | w/o Exp. Dyn. Optim. (+ Happiness) |
w/o Exp. Dyn. Optim. (+ Happiness - Eye openess) |
w Exp. Dyn. Optim. (+ Happiness) |
w Exp. Dyn. Optim. (+ Happiness - Eye openess) |
Input Video | e4e+StyleGAN2_va | e4e+StyleGAN3_va |
Input Video | e4e+StyleGAN2_va | e4e+StyleGAN3_va |
Input Video | e4e+StyleGAN2_va | e4e+StyleGAN3_va |
Figure 12 (b). Limitations and capacity of StyleGAN3.
Misalignment of the eyes is effectively solved by using StyleGAN3.
Show
video1.
Input Video | e4e+StyleGAN2_va | e4e+StyleGAN3_va |