학술논문

Generalized Deepfake Video Detection Through Time-Distribution and Metric Learning
Document Type
Periodical
Source
IT Professional IT Prof. IT Professional. 24(2):38-44 Apr, 2022
Subject
Computing and Processing
Engineering Profession
Components, Circuits, Devices and Systems
Power, Energy and Industry Applications
Computer vision
Training data
Artificial intelligence
Computational modeling
Detectors
Feature extraction
Data mining
Deepfakes
Fake news
Language
ISSN
1520-9202
1941-045X
Abstract
Rapid advancements in the field of computer vision and AI have enabled the creation of synthesized images and videos known as deepfakes. Deepfakes are used as a source of spreading false news and misinformation. The constant evolution of generative models, used for creating deepfakes, makes it difficult and yet very important to find effective generalized solutions for such deepfake videos. We have designed a generalized deepfake detector by creating a two-stream network that uses CNN-LSTM as its backbone. Our contributions in this article are twofold: 1) using a time-distributed network to create representations using spatial and temporal information of a video, and 2) improving the discriminative ability of the extracted feature embeddings by using metric learning during training. Results gathered through extensive experiments show the effectiveness of our solution even on a cross-modal FaceForensic++ dataset proving the generalization ability of the solution.