On the effectiveness of handcrafted features for deepfake video detection

Recent developments in advanced generative deep learning techniques have led to considerable progress in deepfake technology. CNN-based deepfake detection approaches have demonstrated superior performance. The ability to learn meaningful representations generated by convolutional multilayer nonlinea...

وصف كامل

محفوظ في:

التفاصيل البيبلوغرافية
المؤلف الرئيسي:	Kaddar, Bachir (author)
مؤلفون آخرون:	Fezza, Sid Ahmed (author), Hamidouche, Wassim (author), Akhtar, Zahid (author), Hadid, Abdenour (author)
منشور في:	2023
الوصول للمادة أونلاين:	https://depot.sorbonne.ae/handle/20.500.12458/1449
الوسوم:	إضافة وسم لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!

_version_	1857415062795321344
author	Kaddar, Bachir
author2	Fezza, Sid Ahmed Hamidouche, Wassim Akhtar, Zahid Hadid, Abdenour
author2_role	author author author author
author_facet	Kaddar, Bachir Fezza, Sid Ahmed Hamidouche, Wassim Akhtar, Zahid Hadid, Abdenour
author_role	author
dc.creator.none.fl_str_mv	Kaddar, Bachir Fezza, Sid Ahmed Hamidouche, Wassim Akhtar, Zahid Hadid, Abdenour
dc.date.none.fl_str_mv	2023-11-30T07:21:00Z 2023-11-30T07:21:00Z 2023
dc.identifier.none.fl_str_mv	1017-9909 https://depot.sorbonne.ae/handle/20.500.12458/1449 10.1117/1.JEI.32.5.053033
dc.language.none.fl_str_mv	en
dc.relation.none.fl_str_mv	Journal of Electronic Imaging
dc.title.none.fl_str_mv	On the effectiveness of handcrafted features for deepfake video detection
dc.type.none.fl_str_mv	Controlled Vocabulary for Resource Type Genres::text::periodical::journal::contribution to journal::journal article
description	Recent developments in advanced generative deep learning techniques have led to considerable progress in deepfake technology. CNN-based deepfake detection approaches have demonstrated superior performance. The ability to learn meaningful representations generated by convolutional multilayer nonlinear structures is the key to success. However, the black-box nature of such approaches has been a major concern for exploring hidden and complex characteristics as well as potential limitations of CNN-based models. To gain insights into the scope of the deepfake detection task, we investigate the effectiveness of handcrafted feature-based methods for deepfake video detection. First, we experiment with six top-performing handcrafted descriptors to extract the discriminating image features and then train SVMs on the extracted features to learn a suitable model. We also study the effect of selecting specific facial components on the detection performance. Specifically, we consider features extracted from the left eye, right eye, mouth, and entire face. Moreover, we propose a combination of these features and highlight the importance of this combination in terms of detection performance. Experimental results show that the SIFT feature descriptor achieves the best performance on deepfake videos generated by the neural texture technique, with a detection accuracy of 83.50%, which is better than deep learning-based methods. This is in contrast to the conventional understanding that deep learning methods systematically outperform handcrafted feature-based approaches. In addition, the obtained results on the FaceForensics++ dataset highlight the benefit of using some facial components to further boost the detection performance. Moreover, motivated by the effectiveness of the LBPTOP and SIFT in the deepfake detection task, we combined the LBPTOP and SIFT to best characterize the specific spatiotemporal inconsistencies commonly found in fake videos for boosting deepfake detection performance. Finally, we show the strengths and weaknesses of methods based on handcrafted features for deepfake detection and provide directions for future research.
id	sorbonner_3692adfc5791622332f245f4c6a8a032
identifier_str_mv	1017-9909 10.1117/1.JEI.32.5.053033
language_invalid_str_mv	en
network_acronym_str	sorbonner
network_name_str	Sorbonne University Abu Dhabi repository
oai_identifier_str	oai:depot.sorbonne.ae:20.500.12458/1449
publishDate	2023
repository.mail.fl_str_mv
repository.name.fl_str_mv
repository_id_str
spelling	On the effectiveness of handcrafted features for deepfake video detectionKaddar, BachirFezza, Sid AhmedHamidouche, WassimAkhtar, ZahidHadid, AbdenourRecent developments in advanced generative deep learning techniques have led to considerable progress in deepfake technology. CNN-based deepfake detection approaches have demonstrated superior performance. The ability to learn meaningful representations generated by convolutional multilayer nonlinear structures is the key to success. However, the black-box nature of such approaches has been a major concern for exploring hidden and complex characteristics as well as potential limitations of CNN-based models. To gain insights into the scope of the deepfake detection task, we investigate the effectiveness of handcrafted feature-based methods for deepfake video detection. First, we experiment with six top-performing handcrafted descriptors to extract the discriminating image features and then train SVMs on the extracted features to learn a suitable model. We also study the effect of selecting specific facial components on the detection performance. Specifically, we consider features extracted from the left eye, right eye, mouth, and entire face. Moreover, we propose a combination of these features and highlight the importance of this combination in terms of detection performance. Experimental results show that the SIFT feature descriptor achieves the best performance on deepfake videos generated by the neural texture technique, with a detection accuracy of 83.50%, which is better than deep learning-based methods. This is in contrast to the conventional understanding that deep learning methods systematically outperform handcrafted feature-based approaches. In addition, the obtained results on the FaceForensics++ dataset highlight the benefit of using some facial components to further boost the detection performance. Moreover, motivated by the effectiveness of the LBPTOP and SIFT in the deepfake detection task, we combined the LBPTOP and SIFT to best characterize the specific spatiotemporal inconsistencies commonly found in fake videos for boosting deepfake detection performance. Finally, we show the strengths and weaknesses of methods based on handcrafted features for deepfake detection and provide directions for future research.2023-11-30T07:21:00Z2023-11-30T07:21:00Z2023Controlled Vocabulary for Resource Type Genres::text::periodical::journal::contribution to journal::journal article1017-9909https://depot.sorbonne.ae/handle/20.500.12458/144910.1117/1.JEI.32.5.053033enJournal of Electronic Imagingoai:depot.sorbonne.ae:20.500.12458/14492023-11-30T07:21:00Z
spellingShingle	On the effectiveness of handcrafted features for deepfake video detection Kaddar, Bachir
title	On the effectiveness of handcrafted features for deepfake video detection
title_full	On the effectiveness of handcrafted features for deepfake video detection
title_fullStr	On the effectiveness of handcrafted features for deepfake video detection
title_full_unstemmed	On the effectiveness of handcrafted features for deepfake video detection
title_short	On the effectiveness of handcrafted features for deepfake video detection
title_sort	On the effectiveness of handcrafted features for deepfake video detection
url	https://depot.sorbonne.ae/handle/20.500.12458/1449

On the effectiveness of handcrafted features for deepfake video detection

مواد مشابهة