DEEPFAKER: A Unified Evaluation Platform for Facial Deepfake and Detection Models

doi:10.1145/3634914

DOI: 10.1145/3634914 ISSN: 2471-2566

DEEPFAKER: A Unified Evaluation Platform for Facial Deepfake and Detection Models

Li Wang, Xiangtao Meng, Dan Li, Xuhong Zhang, Shouling Ji, Shanqing Guo

Safety, Risk, Reliability and Quality
General Computer Science

DeepFake data contains realistically manipulated faces - its abuses pose a huge threat to the security and privacy-critical applications. Intensive research from academia and industry has produced many deepfake/detection models, leading to a constant race of attack and defense. However, due to the lack of a unified evaluation platform, many critical questions on this subject remain largely unexplored. (i) How is the anti-detection ability of the existing deepfake models? (ii) How generalizable are existing detection models against different deepfake samples? (iii) How effective are the detection APIs provided by the cloud-based vendors? (iv) How evasive and transferable are adversarial deepfakes in the lab and real-world environment? (v) How do various factors impact the performance of deepfake and detection models?

To bridge the gap, we design and implement

DEEPFAKER

, a unified and comprehensive deepfake-detection evaluation platform. Specifically,

DEEPFAKER

has integrated 10 state-of-the-art deepfake methods and 9 representative detection methods, while providing a user-friendly interface and modular design that allows for easy integration of new methods. Leveraging

DEEPFAKER

, we conduct a large-scale empirical study of facial deepfake/detection models and draw a set of key findings: (i) the detection methods have poor generalization on samples generated by different deepfake methods; (ii) there is no significant correlation between anti-detection ability and visual quality of deepfake samples; (iii) the current detection APIs have poor detection performance and adversarial deepfakes can achieve about 70% ASR (attack success rate) on all cloud-based vendors, calling for an urgent need to deploy effective and robust detection APIs; (iv) the detection methods in the lab are more robust against transfer attacks than the detection APIs in the real-world environment; (v) deepfake videos may not always be more difficult to detect after video compression. We envision that

DEEPFAKER

will benefit future research on facial deepfake and detection.

Outline

DEEPFAKER: A Unified Evaluation Platform for Facial Deepfake and Detection Models

More from our Archive