A study on Speaker Recognition System

"The huge development in information technology opened the door for finding an increasing number of security gaps in the daily used systems like email accounts. Security systems developers and manufacturers are trying hardly to cope with the increasing security breaching attacks. The need to ov...

وصف كامل

محفوظ في:
التفاصيل البيبلوغرافية
المؤلف الرئيسي: Bakkar, Hazem Wa'il Mohammed (author)
منشور في: 2015
الموضوعات:
الوصول للمادة أونلاين:https://bspace.buid.ac.ae/handle/1234/1699
الوسوم: إضافة وسم
لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!
_version_ 1862980617699852288
author Bakkar, Hazem Wa'il Mohammed
author_facet Bakkar, Hazem Wa'il Mohammed
author_role author
dc.creator.none.fl_str_mv Bakkar, Hazem Wa'il Mohammed
dc.date.none.fl_str_mv 2015-05
2020-11-15T13:19:10Z
2020-11-15T13:19:10Z
dc.format.none.fl_str_mv application/pdf
dc.identifier.none.fl_str_mv 2013128078
https://bspace.buid.ac.ae/handle/1234/1699
dc.language.none.fl_str_mv en
dc.publisher.none.fl_str_mv The British University in Dubai (BUiD)
dc.subject.none.fl_str_mv information technology
speaker recognition system
security systems
human biometrics
dc.title.none.fl_str_mv A study on Speaker Recognition System
dc.type.none.fl_str_mv Dissertation
description "The huge development in information technology opened the door for finding an increasing number of security gaps in the daily used systems like email accounts. Security systems developers and manufacturers are trying hardly to cope with the increasing security breaching attacks. The need to overcome this challenge forced many researchers and manufacturers to think about adding extra levels of security to protect information and resources; these extra levels of security are mainly involve around using the human biometrics in order to identify the real identity of the user. Speaker recognition methods are considered a leading approach in applying biometric security systems. In this thesis we aimed to develop a unique speaker recognition system with a user friendly interface. The proposed system was mainly developed using Python (Python.org, 2015). This system was used to implement and study several methods and techniques in speaker recognition domain. Another main goal for conducting this research is to make a scientific comparison between tools and methods that are related to speaker recognition domain, the following are the techniques that were studied : 1) Energy based tool and Long-Term Spectral Divergence (LTSD) in the preprocessing module of the system, 2) Mel Frequency Cepstral Coefficients (MFCC) and Linear Predictive Cepstral Coefficients (LPCC) in the feature extraction module, and 3) scikit-learn Gaussian Mixture Model (GMM), Universal Background Model (UBM), Continuous Restricted Boltzmann Machine (CRBM) and Joint Factor Analysis (JFA) in the recognition module. Finally, we proposed a new GMM in this thesis which was compared with the famous scikit-learn GMM technique. All the mentioned tools and methods were tested and experimented in this thesis. Findings of the experiments showed that: 1) LTSD for voice activity detection is faster and more practical than the energy based tools, 2) MFCC is computationally more expensive than LPCC but MFCC is faster and more accurate, also LPCC needs double size utterance to achieve the same accuracy MFCC generates. 3) The new GMM showed that it is five times faster than scikit-learn GMM, also the proposed GMM outperforms all other techniques studied in this thesis. As a result, to build a user-friendly speaker recognition system, it is better to use LSTD for preprocessing, MFCC for feature extraction, and our enhanced GMM for speaker testing and recognition."
id budr_51922a7c097645c89c05fdfd3863282f
identifier_str_mv 2013128078
language_invalid_str_mv en
network_acronym_str budr
network_name_str The British University in Dubai repository
oai_identifier_str oai:bspace.buid.ac.ae:1234/1699
publishDate 2015
publisher.none.fl_str_mv The British University in Dubai (BUiD)
repository.mail.fl_str_mv
repository.name.fl_str_mv
repository_id_str
spelling A study on Speaker Recognition SystemBakkar, Hazem Wa'il Mohammedinformation technologyspeaker recognition systemsecurity systemshuman biometrics"The huge development in information technology opened the door for finding an increasing number of security gaps in the daily used systems like email accounts. Security systems developers and manufacturers are trying hardly to cope with the increasing security breaching attacks. The need to overcome this challenge forced many researchers and manufacturers to think about adding extra levels of security to protect information and resources; these extra levels of security are mainly involve around using the human biometrics in order to identify the real identity of the user. Speaker recognition methods are considered a leading approach in applying biometric security systems. In this thesis we aimed to develop a unique speaker recognition system with a user friendly interface. The proposed system was mainly developed using Python (Python.org, 2015). This system was used to implement and study several methods and techniques in speaker recognition domain. Another main goal for conducting this research is to make a scientific comparison between tools and methods that are related to speaker recognition domain, the following are the techniques that were studied : 1) Energy based tool and Long-Term Spectral Divergence (LTSD) in the preprocessing module of the system, 2) Mel Frequency Cepstral Coefficients (MFCC) and Linear Predictive Cepstral Coefficients (LPCC) in the feature extraction module, and 3) scikit-learn Gaussian Mixture Model (GMM), Universal Background Model (UBM), Continuous Restricted Boltzmann Machine (CRBM) and Joint Factor Analysis (JFA) in the recognition module. Finally, we proposed a new GMM in this thesis which was compared with the famous scikit-learn GMM technique. All the mentioned tools and methods were tested and experimented in this thesis. Findings of the experiments showed that: 1) LTSD for voice activity detection is faster and more practical than the energy based tools, 2) MFCC is computationally more expensive than LPCC but MFCC is faster and more accurate, also LPCC needs double size utterance to achieve the same accuracy MFCC generates. 3) The new GMM showed that it is five times faster than scikit-learn GMM, also the proposed GMM outperforms all other techniques studied in this thesis. As a result, to build a user-friendly speaker recognition system, it is better to use LSTD for preprocessing, MFCC for feature extraction, and our enhanced GMM for speaker testing and recognition."The British University in Dubai (BUiD)2020-11-15T13:19:10Z2020-11-15T13:19:10Z2015-05Dissertationapplication/pdf2013128078https://bspace.buid.ac.ae/handle/1234/1699enoai:bspace.buid.ac.ae:1234/16992021-10-17T13:11:42Z
spellingShingle A study on Speaker Recognition System
Bakkar, Hazem Wa'il Mohammed
information technology
speaker recognition system
security systems
human biometrics
title A study on Speaker Recognition System
title_full A study on Speaker Recognition System
title_fullStr A study on Speaker Recognition System
title_full_unstemmed A study on Speaker Recognition System
title_short A study on Speaker Recognition System
title_sort A study on Speaker Recognition System
topic information technology
speaker recognition system
security systems
human biometrics
url https://bspace.buid.ac.ae/handle/1234/1699