DSpace Repository

Scale-invariant MFCCs for speech/speaker recognition

Show simple item record

dc.contributor.author Tufekci, Zekeriya
dc.contributor.author Disken, Gokay
dc.date.accessioned 2019-11-29T12:46:53Z
dc.date.available 2019-11-29T12:46:53Z
dc.date.issued 2019
dc.identifier.citation Tufekci, Z., & Disken, G. (2019). Scale-invariant MFCCs for speech/speaker recognition. Turkish Journal of Electrical Engineering and Computer Sciences, 27(5), 3758-3762. https://doi.org/10.3906/elk-1901-231 tr_TR
dc.identifier.issn 1300-0632
dc.identifier.issn 1303-6203
dc.identifier.uri http://openaccess.adanabtu.edu.tr:8080/xmlui/handle/123456789/625
dc.identifier.uri https://doi.org/10.3906/elk-1901-231
dc.description WOS indeksli yayınlar koleksiyonu. / WOS indexed publications collection. TR Dizin indeksli yayınlar koleksiyonu. / TR Dizin indexed publications collection.
dc.description.abstract The feature extraction process is a fundamental part of speech processing. Mel frequency cepstral coefficients (MFCCs) are the most commonly used feature types in the speech/speaker recognition literature. However, the MFCC framework may face numerical issues or dynamic range problems, which decreases their performance. A practical solution to these problems is adding a constant to filter-bank magnitudes before log compression, thus violating the scale-invariant property. In this work, a magnitude normalization and a multiplication constant are introduced to make the MFCCs scale-invariant and to avoid dynamic range expansion of nonspeech frames. Speaker verification experiments are conducted to show the effectiveness of the proposed scheme. tr_TR
dc.language.iso en tr_TR
dc.publisher TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES / TUBITAK SCIENTIFIC & TECHNICAL RESEARCH COUNCIL TURKEY tr_TR
dc.relation.ispartofseries 2019;Volume: 27 Issue: 5
dc.subject Feature extraction tr_TR
dc.subject speaker recognition
dc.subject speech recognition
dc.subject SPEECH
dc.subject Computer Science
dc.subject Artificial Intelligence
dc.subject Engineering
dc.subject Electrical & Electronic
dc.title Scale-invariant MFCCs for speech/speaker recognition tr_TR
dc.type Article tr_TR


Files in this item

Files Size Format View

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Advanced Search

Browse

My Account