DFDS: EfficientNet-B2 Deepfake Detection Engine.

An open-source, production-ready deepfake detection model optimized for FinTech Identity Verification (KYC) pipelines. This model leverages a two-phase transfer learning approach using EfficientNet-B2, achieving a 96.00% accuracy on in-domain verification and demonstrating high resilience against generative attacks.

This repository contains the finalized Inference Weights (pytorch_model.bin) and a plug-and-play Python Execution Pipeline (inference.py) equipped with RetinaFace bounding logic.

Ecosystem Links.


Quickstart: Running Inference.

This repository includes a standalone inference.py script that automatically handles RetinaFace mathematical cropping, Tensor Normalization, and universal hardware routing (CUDA, Apple MPS, or CPU).

1. Install Dependencies

pip install torch torchvision opencv-python retina-face pillow

2. Download and Run Download pytorch_model.bin and inference.py to your local directory.

from inference import predict_deepfake

# Pass any raw image to the pipeline
result = predict_deepfake("sample_id_photo.jpg")

print(result)
# Output: {'prediction': 'FAKE', 'fake_confidence': '98.40%', 'real_confidence': '1.60%'}

Performance Benchmarks (Run 5).

The model was evaluated using a strict identity-isolation protocol to ensure zero data leakage between Training and Testing splits.

1. In-Domain Generalization (Primary Test Set).

Evaluated on 21,324 isolated images from the primary Training distribution.

Metric Score
Accuracy 96.00%
ROC-AUC 99.30%
F1 Score 0.9597
Validation Loss 0.1344

Class-Level Breakdown:

  • Fake (0): Precision: 0.9563 | Recall: 0.9644 | F1: 0.9603
  • Real (1): Precision: 0.9638 | Recall: 0.9555 | F1: 0.9597

2. Cross-Dataset Generalization (Zero-Shot)

To test real-world resilience, the model was evaluated on 6,216 completely unseen images spanning 8 novel manipulation methods from the DF40 Benchmark.

  • Overall Accuracy: 67.95%
  • Overall ROC-AUC: 71.50%

Context on Architectural Generalization: It is critical to note that this is a strict zero-shot evaluation. The data proves the model successfully learns the underlying mechanics of generation families rather than simply memorizing training datasets. When exposed to entirely unseen data that shares a generative family with its training distribution (example: Diffusion-based EFS like MidJourney and CollabDiff), the model maintains a massive 83%+ accuracy.

The performance drop-off at the bottom of the table is expected, as methods like StarGAN-v2 and StyleClip represent entirely foreign generation architectures (GANs and Latent Edits) that were mathematically absent from the training distribution. Ultimately, the model is highly lethal against modern FinTech threat vectors (EFS and modern Face Swaps).

Generation Method Type Accuracy F1 Score Fake Detection % Real Detection %
MidJourney Diffusion / EFS 93.13% 0.9266 99.5% 86.8%
CollabDiff Diffusion / EFS 83.13% 0.8525 68.8% 97.5%
HeyGen Face Swap 80.00% 0.7050 51.2% 89.8%
StarGAN GAN 74.81% 0.7831 58.7% 90.9%
DeepFaceLab Face Swap 69.37% 0.6805 73.5% 65.2%
StarGAN-v2 GAN 52.75% 0.6730 8.2% 97.2%
StyleClip Latent Edit 51.54% 0.6696 4.9% 98.2%
whichfaceisreal GAN 50.00% 0.6667 0.0% 100.0%

3. Aggregated Threat Vector Performance.

Threat Category Support (N) Accuracy ROC-AUC
EFS (MidJourney + CollabDiff) 1,542 87.93% 0.9830
Face Swap (HeyGen + DFL) 1,502 69.97% 0.7948
Total FinTech Relevant (EFS + FS) 3,044 79.07% 0.8901

Technical Architecture.

  • Backbone: EfficientNet-B2 (Pre-trained on ImageNet)
  • Custom Head: Dropout (p=0.3) -> Linear (1408 to 2 nodes)
  • Loss Function: CrossEntropyLoss
  • Optimizer: AdamW
  • Input Resolution: 260x260 RGB (Normalized to ImageNet mean/std)
  • Face Extraction: RetinaFace (Confidence Threshold > 0.90)

Author.

Thinod Wickramasinghe · University of Plymouth · 2026

GitHub: https://github.com/thinothw

Project Supervisor - Dr. Rasika Ranaweera.

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Dataset used to train ThinothW/DFDS-EfficientNet-B2-Deepfake-Detection-Engine