HiFi-Portrait: Zero-shot Identity-preserved Portrait Generation with High-fidelity Multi-face Fusion

摘要

Recent advancements in diffusion-based technologies have made significant strides, particularly in identity-preserved portrait generation (IPG). However, when using multiple reference images from the same ID, existing methods typically produce lower-fidelity portraits and struggle to customize face attributes precisely. To address these issues, this paper presents HiFi-Portrait, a high-fidelity method for zero-shot portrait generation. Specifically, we first introduce the face refiner and landmark generator to obtain fine-grained multi-face features and 3D-aware face landmarks. The landmarks include the reference ID and the target attributes. Then, we design HiFi-Net to fuse multi-face features and align them with landmarks, which improves ID fidelity and face control. In addition, we devise an automated pipeline to construct an ID-based dataset for training HiFi-Portrait. Extensive experimental results demonstrate that our method surpasses the SOTA approaches in face similarity and controllability. Furthermore, our method is also compatible with previous SDXL-based works.

出版物
In 2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Yifang Xu(徐一舫)
硕士(2020-2023)

简略介绍

Benxiang Zhai(翟本祥)
Benxiang Zhai(翟本祥)
硕士(2023-)

简略介绍

Ming Li(李明)
硕博连读(2017-2024)

简略介绍

Yang Li(李杨)
Yang Li(李杨)
副教授

简略介绍

Sidan Du(都思丹)
Sidan Du(都思丹)
教授

简略介绍