학술논문

Unmasking Deception: Empowering Deepfake Detection with Vision Transformer Network
Document Type
article
Source
Mathematics, Vol 11, Iss 17, p 3710 (2023)
Subject
deepfake
identification
Vision Transformer
pretrained
fine tuning
Mathematics
QA1-939
Language
English
ISSN
2227-7390
Abstract
With the development of image-generating technologies, significant progress has been made in the field of facial manipulation techniques. These techniques allow people to easily modify media information, such as videos and images, by substituting the identity or facial expression of one person with the face of another. This has significantly increased the availability and accessibility of such tools and manipulated content termed ‘deepfakes’. Developing an accurate method for detecting fake images needs time to prevent their misuse and manipulation. This paper examines the capabilities of the Vision Transformer (ViT), i.e., extracting global features to detect deepfake images effectively. After conducting comprehensive experiments, our method demonstrates a high level of effectiveness, achieving a detection accuracy, precision, recall, and F1 rate of 99.5 to 100% for both the original and mixture data set. According to our existing understanding, this study is a research endeavor incorporating real-world applications, specifically examining Snapchat-filtered images.