Improving Vision Transformer for Deepfake Detection

General Information

ISSN: 2301-3699 (Print); 2972-3973 (Online)
Frequency: Bimonthly
Managing Editor: Ms. Inez Chan
DOI: 10.18178/joig
Abstracting/Indexing: Scopus (Since 2021), CNKI, Google Scholar, Crossref, etc.
APC: 500 USD
Average Days to Accept: 116 days
Acceptance Rate: 38%
E-mail: editor@joig.net
Journal Metrics:

Editor-in-Chief

Dr. Branislav Vuksanovic
Deputy Head of Department, Systems Engineering Department, Military Technological College, Muscat, Oman
I am very excited to serve as the first Editor-in-Chief of the International Journal of Image and Graphics (JOIG) and hope that the publication can enrich the readers’ experience... [Read More]

What's New

2026-06-04

The 2025 CiteScores have been released by Scopus. JOIG received the CiteScore 2025 with 4.3!

2026-04-30

Volume 14, No. 2 has been published now.

2026-02-27

Volume 14, No. 1 has been published now.

Home > Articles > All Issues > 2026 > Volume 14, No. 1, 2026 >

JOIG 2026 Vol.14(1):76-83
doi: 10.18178/joig.14.1.76-83

Orvis L. Siagian, Reinhard Ebenhaizer, Pandu Wicaksono *, and Zahra N. Izdihar

Computer Science Department, School of Computer Science, Bina Nusantara University, Jakarta, Indonesia
Email: orvis.siagian@binus.ac.id (O.L.S.); reinhard.ebenhaizer@binus.ac.id (R.E.); pandu.wicaksono005@binus.ac.id (P.W.); zahra.izdihar@binus.ac.id (Z.N.I.)
*Corresponding author

Manuscript received August 28, 2025; revised September 18, 2025; accepted October 30, 2025; published February 27, 2026.

Abstract—Machine learning is rapidly advancing across various fields and accelerating a paradigm shift in image and video manipulation. Deepfakes represent one of the challenges emerging from this development. Deepfakes are synthetically manipulated media using deep learning algorithms. Criminals have abused deepfakes as a weapon to spread false information. The distribution of deepfake videos or images may lead to some significant public risks, such as misleading information, privacy violation, and misuse in political and social realms. Therefore, the development of a counter for those threats is needed, namely a reliable deepfake detection method. One of the promising methods in the deepfake detection cases is the Vision Transformer (ViT). ViT is a deep learning architecture that uses self-attention mechanisms to understand complex relationships between images. Despite its potential, ViT needs a substantial amount of computational costs and a large dataset, which pose challenges for development. In this research, we present a rigorous evaluation of the ViT model with the use of the balanced FaceForensics++ dataset and 5-fold crossvalidation strategy to ensure a more reliable result. The result shows an average accuracy of 85.39%, meaning that the model achieves a robust and stable performance. The model also showed an excellent balance between precision score (85.40%) and recall score (85.39%), which suggests to us that it is a reliable method in detecting deepfakes without significant bias. These findings indicate that a properly trained ViT, particularly with a balanced dataset, can serve as an effective and powerful tool to combat the threats posed by deepfakes.

Keywords—vision transformer, deep learning, deepfake, machine learning, video manipulation

Cite: Orvis L. Siagian, Reinhard Ebenhaizer, Pandu Wicaksono, and Zahra N. Izdihar, "Improving Vision Transformer for Deepfake Detection," Journal of Image and Graphics, Vol. 14, No. 1, pp. 76-83, 2026.

Copyright © 2026 by the authors. This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited (CC-BY-4.0).

附件说明

Article Metrics in Dimensions

PREVIOUS PAPER

Enhancing Facial Expression Recognition: Leveraging MobileNetV3 for Periocular Analysis

NEXT PAPER

Edge Detection Using Clip ReLU-Based Enhanced Hybrid Network

Home

Articles

Author Guide

Editor Guide

Reviewer Guide

Topics and Special Issues

journal menu