Speech To Text Using Deep Learning

Authors

  • Piyush Arora DEPARTMENT OF ARTIFICIAL INTELLIGENCE AND DATA SCIENCE, Dr. Akhilesh Das Institute of Professional Studies, Delhi Author

DOI:

https://doi.org/10.70849/IJSCI

Keywords:

Artificial Intelligence (AI),Deep Learning, Automatic Speech Recognition (ASR) , Intelligent Assistant, Natural Language Processing (NLP)

Abstract

The surge in huge audio data and progress in deep learning technologies has opened up possibilities for the development of highly accurate Automatic Speech Recognition (ASR) systems. These systems promise to automate transcription, provide real-time accessibility, and ultimately empower seamless Human-Computer Interaction (HCI). This paper outlines a streamlined workflow for training and deploying neural ASR models. We synthesize findings from previous studies on statistical modelling in the auditory domain, detailing of our system's architecture using Composite Neural Framework, and report on experimental results from simulation studies. Our findings suggest that integrating advanced audio processing with intelligent models which helps to reduces the Word Error Rate (WER) and enhances accessibility.

Downloads

Published

28-11-2025

How to Cite

[1]
Piyush Arora, “Speech To Text Using Deep Learning”, Int. J. Sci. Inno. Eng., vol. 2, no. 11, pp. 1531–1539, Nov. 2025, doi: 10.70849/IJSCI.