International Journal of Electrical and Computer Engineering (IJECE)
Vol. 9, No. 5, October 2019, pp. 3642~3648
ISSN: 2088-8708, DOI: 10.11591/ijece.v9i5.pp3642-3648 3642
Journal homepage: http://iaescore.com/journals/index.php/IJECE
Speech to text conversion and summarization for effective
understanding and documentation
Vinnarasu A., Deepa V. Jose
Department of Computer Science, CHRIST (Deemed to be University), India
Article Info ABSTRACT
Article history:
Received Jan 17, 2019
Revised Apr 1, 2019
Accepted Apr 10, 2019
Speech, is the most powerful way of communication with which human
beings express their thoughts and feelings through different languages.
The features of speech differs with each language. However, even while
communicating in the same language, the pace and the dialect varies with
each person. This creates difficulty in understanding the conveyed message
for some people. Sometimes lengthy speeches are also quite difficult to
follow due to reasons such as different pronunciation, pace and so on. Speech
recognition which is an inter disciplinary field of computational linguistics
aids in developing technologies that empowers the recognition and
translation of speech into text. Text summarization extracts the utmost
important information from a source which is a text and provides
the adequate summary of the same. The research work presented in this paper
describes an easy and effective method for speech recognition. The speech is
converted to the corresponding text and produces summarized text. This has
various applications like lecture notes creation, summarizing catalogues for
lengthy documents and so on. Extensive experimentation is performed to
validate the efficiency of the proposed method.
Keywords:
Feature extraction
Natural language processing
Natural language toolkit
Speech recognition
Text summarization
Copyright © 2019 Institute of Advanced Engineering and Science.
All rights reserved.
Corresponding Author:
Vinnarasu A.,
Department of Computer Science,
CHRIST (Deemed to be University),
Bengaluru, Karnataka, India.
Email: vinnarasu.a@cs.christuniversity.in
1. INTRODUCTION
Speech is the most important part of communication between human beings. Though there are
different means to express our thoughts and feeling, speech is considered as the main meduim for
communication. Speech recognition is the process of making a machine recognize the speech of diffrent
people based on certain words or phrases. Variations in the pronunciation are quite evident in each
individual’s speech. The original form of the speech is a signal, and a signal is processed such that all the
information present in the signal is converted in to the text format. The feature extraction is the process of
taking a signal and converting it to the required format with certain logic. Even though speech is the easiest
way of communication, there exist some problems with speech recognition like the fluency, pronunciation,
broken words, stuttering issues etc. All these have to be addressed while processing a speech.
Text summarization is one of the major concepts used in the field of documentation. Lengthy documents are
difficult to read and understand as it consumes lot of time. Text summarisation solves this problem
by providing a shortened summary of it with semantics.
In the proposed work a combination of speech to text conversion and text summarisation is
implemented. This hybrid method will aid applications that require brief summary of lengthy speeches which
is quite useful for documentation. The flow diagram of the proposed approach is mentioned in Figure 1,
in which the speech recognition and text summarization is given as two different modules. The combination