REFace

This repository gives the official implementation of Realistic and Efficient Face Swapping: A Unified Approach with Diffusion Models (WACV 2025)

Paper

Sanoojan Baliah, Qinliang Lin, Shengcai Liao, Xiodan Liang, and Muhammad Haris Khan

Abstract

Despite promising progress in face swapping task, realistic swapped images remain elusive, often marred by artifacts, particularly in scenarios involving high pose variation, color differences, and occlusion. To address these issues, we propose a novel approach that better harnesses diffusion models for face-swapping by making following core contributions. (a) We propose to re-frame the face-swapping task as a self-supervised, train-time inpainting problem, enhancing the identity transfer while blending with the target image. (b) We introduce a multi-step Denoising Diffusion Implicit Model (DDIM) sampling during training, reinforcing identity and perceptual similarities. (c) Third, we introduce CLIP feature disentanglement to extract pose, expression, and lighting information from the target image, improving fidelity. (d) Further, we introduce a mask shuffling technique during inpainting training, which allows us to create a so-called universal model for swapping, with an additional feature of head swapping. Ours can swap hair and even accessories, beyond traditional face swapping. Unlike prior works reliant on multiple off-the-shelf models, ours is a relatively unified approach and so it is resilient to errors in other off-the-shelf models. Extensive experiments on FFHQ and CelebA datasets validate the efficacy and robustness of our approach, showcasing high-fidelity, realistic face-swapping with minimal inference time. Our code is available here (https://github.com/Sanoojan/REFace)

News

2024-09-10 Release training code
2024-09-10 Release test benchmark.
2024-09-14 Release checkpoints and other dependencies

Requirements

A suitable conda environment named REFace can be created and activated with:

conda env create -f environment.yaml
conda activate REFace

Pretrained model

Download our trained model here.

Other dependencies

Download the following models from the provided links and place them in the corresponding paths to perform face swapping and quantitative evaluation.

Testing

To test our model on a dataset with facial masks (Follow dataset preparation), you can use scripts/inference_test_bench.py. For example,

CUDA_VISIBLE_DEVICES=${device} python scripts/inference_test_bench.py \
    --outdir "${Results_dir}" \
    --config "${CONFIG}" \
    --ckpt "${CKPT}" \
    --scale 3.5 \
    --n_samples 10 \
    --device_ID ${device} \
    --dataset "CelebA" \
    --ddim_steps 50

or simply run:

sh inference_test_bench.sh

For a choosen folder of source and targets do faceswapping run this:

sh inference_selected.sh

Training

Data preparing

Download CelebAHQ dataset

The data structure is like this:

dataset/FaceData
├── CelebAMask-HQ
│  ├── CelebA-HQ-img
│  │  ├── 0.png
│  │  ├── 1.png
│  │  ├── ...
│  ├── CelebA-HQ-mask
│  │  ├── Overall_mask
│  │  │   ├── 0.png
│  │  │   ├── ...

Download the pretrained model of Stable Diffusion

We utilize the pretrained Stable Diffusion v1-4 as initialization, please download the pretrained models from Hugging Face and save the model to directory pretrained_models. Then run the following script to add zero-initialized weights for 5 additional input channels of the UNet (4 for the encoded masked-image and 1 for the mask itself).

python scripts/modify_checkpoints.py

Training REFace

To train a new model on CelebAHQ, you can use main_swap.py. For example,

python -u main_swap.py \
--logdir models/REFace/ \
--pretrained_model pretrained_models/sd-v1-4-modified-9channel.ckpt \
--base configs/train.yaml \
--scale_lr False

or simply run:

sh train.sh

Test Benchmark

We build a test benchmark for quantitative analysis.

Quantitative Results

By default we assume the original dataset images, selected source images and target images and corresponding swapped images are generated. To evaluate the face swapping in terms if FID, ID retrieval, Pose and Expression simply run:

bash inference_test_bench.sh

Citing Us

@article{baliah2024realisticefficientfaceswapping,
  title={Realistic and Efficient Face Swapping: A Unified Approach with Diffusion Models},
  author={Sanoojan Baliah and Qinliang Lin and Shengcai Liao and Xiaodan Liang and Muhammad Haris Khan},
  journal={arXiv preprint arXiv:2409.07269},
  year={2024}
}

Acknowledgements

This code borrows heavily from Paint-By-Example.

Maintenance

Please open a GitHub issue for any help. If you have any questions regarding the technical details, feel free to contact us.

License

(MIT)See License

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
Other_dependencies		Other_dependencies
REFace.egg-info		REFace.egg-info
assets		assets
configs		configs
eval_tool		eval_tool
examples/FaceSwap		examples/FaceSwap
ldm		ldm
pretrained/face_parsing		pretrained/face_parsing
scripts		scripts
src		src
thinplatespline		thinplatespline
.gitignore		.gitignore
Crop_and_mask.py		Crop_and_mask.py
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml
evaluate_all_FFHQ.sh		evaluate_all_FFHQ.sh
inference_selected.sh		inference_selected.sh
inference_test_bench.sh		inference_test_bench.sh
inference_video_swap.sh		inference_video_swap.sh
main_swap.py		main_swap.py
setup.py		setup.py
train.sh		train.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

REFace

Paper

Abstract

News

Requirements

Pretrained model

Other dependencies

face parsing model (segmentation)

Arcface ID retrieval model

Landmark detection model

Expression model (For quantitative analysis only)

pose model (For quantitative analysis only)

Testing

Training

Data preparing

Download the pretrained model of Stable Diffusion

Training REFace

Test Benchmark

Quantitative Results

Citing Us

Acknowledgements

Maintenance

License

About

Uh oh!

Releases

Packages

Languages

License

CV-Synthesis/REFace

Folders and files

Latest commit

History

Repository files navigation

REFace

Paper

Abstract

News

Requirements

Pretrained model

Other dependencies

face parsing model (segmentation)

Arcface ID retrieval model

Landmark detection model

Expression model (For quantitative analysis only)

pose model (For quantitative analysis only)

Testing

Training

Data preparing

Download the pretrained model of Stable Diffusion

Training REFace

Test Benchmark

Quantitative Results

Citing Us

Acknowledgements

Maintenance

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages