Automatic Speech Recognition by Using Neural Network Based on Mel Frequency Cepstral Coefficient
DOI:
https://doi.org/10.62019/vs3esy64Keywords:
Speech Recognition, Short Term Energy, Zero Crossing Rate, Mel frequency Cepstral coefficient, Neural Networks, Feed forward Neural Networks, Back PropagationAbstract
This paper deliberated and estimated the Neural Networks Automatic Speech Recognition (ASR) system based on an isolated small vocabulary speaker-independent manual cropping technique, from the training stage to the recognition stage. Besides this, the paper also examines three distinct blocks of speech recognition, i.e., Speech Preprocessor, Feature Extractor, and a Recognizer. Speech preprocessing involves windowing, framing, Short Term and Zero Crossing threshold energy, and End Point Detection calculation. Mel Frequency Cepstral Coefficients (MFCC) are extracted to represent the speech signal in frames and then passed through a Mel frequency filter. Multi-layer feed-forward network trained by the back-propagation method.

Downloads
Published
Issue
Section
License
Copyright (c) 2025 Muhammad Daud Abbasi, Zubair Sajid, Shahzad Karim Khawer, Syed Zain Mir, Abdul Basit, Muhammad Kashif

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.