I nternational Journal of Emerging Trends& Technology in Computer Science (I JETTCS) Web Site: www.ijettcs.org Email: editor@ijettcs.org Volume 3, Issue 6, November-December 2014 ISSN 2278-6856 Volume 3, Issue 6, November-December 2014 Page 273 Abstract This paper is a culmination of two dynamic fields which is of major importance in the practical world Voice Recognition and Music Therapy. Voice Recognition, which is a very important field and is under development by various commercial and research organizations all over the world. It is implemented in critical applications such as healthcare, military, to regular applications such as smart phones, microwaves, biometric security. However, a portion of our speech which is less quantitative in nature is the emotion involved in speech. It is that portion of our speech which adds dynamism to our speech. This paper is an attempt to extract the emotional part of a speech and detect the emotional content of the speech. The detection of emotion is independent of any adjective-oriented word spoken but rather the emphasis on the pitch, tonality and stress on particular words. Keywords:- DCT, DFT, FFT, MFCC, Cepstrum, Mel Scale, Mel Spectrum, Hamming Window, Music Therapy, Emotion Detection. 1. INTRODUCTION 1.1 History of Music Therapy Music Therapy [6] is the systematic application of music in the treatment of the physiological and psychosocial aspects of an illness or disability. It focuses on the acquisition of non-musical skills and behaviors, as determined by a board certified music therapist through systematic assessment and treatment planning. Therefore, it is an allied health profession and one of the expressive therapies, consisting of an interpersonal process in which a certified music therapist uses music and all of its facets—physical, emotional, mental, social, aesthetic, and spiritual—to help clients to improve or maintain their health. Music therapy in the United States of America began in the late 18th century. However, using music as a healing medium dates back to ancient times. This is evident in biblical scriptures and historical writings of ancient civilizations such as Egypt, China, India, Greece and Rome. Today, the power of music remains the same but music is used much differently than it was in ancient times. The profession of music therapy in the United States began to develop during W.W.I and W.W. II, when music was used in Veterans Administration Hospitals as an intervention to address traumatic war injuries. Veterans actively and passively engaged in music activities that focused on relieving pain perception. Numerous doctors and nurses witnessed the effect music had on veterans' psychological, physiological, cognitive, and emotional state. Since then, colleges and universities developed programs to train musicians how to use music for therapeutic purposes. In 1950, a professional organization was formed by a collaboration of music therapists that worked with veterans, mentally retarded, hearing/visually impaired, and psychiatric populations. This was the birth of the National Association for Music Therapy (NAMT). In 1998, NAMT joined forces with another music therapy organization to become what is now known as the American Music Therapy Association (AMTA). 1.2 History of Voice Recognition Voice Recognition or a more accurate term which we can say is speech recognition is a form of technology which was developed solely to remove the concept of typing or writing or rather introduces the human voice as an input. Speech Recognition was first developed in the institute which is the mother of all computer oriented inventions, the Bell Labs. The first voice operated system developed was AUDREY in the year 1952, which had the ability to identify the spoken digits. Exactly 10 years from then, the leading corporate giants and another powerhouse of innovation, IBM, first demonstrated the ‘ShoeBox’, voice recognition software which had the ability to recognized 16 spoken English words. Slowly the idea of voice or rather speech recognition was spreading far and wide and laboratories in United States, Japan, England, and Soviet Union were developing voice recognition systems and also developing dedicated hardware to support such systems. Despite how little these efforts might sound but it was impressive beginning given to the fact that computation was quite primitive and not so developed. In 1970s, the US Department of Defense started taking interest in voice recognition and a Speech Understanding Research (SUR) cell was formed. Research work was going on all over the world when Carnegie-Mellon University first developed “HARPY” speech understanding system which was capable or recognizing 1011 words. 1011 words could be supposed to be an average vocabulary of a 3 year old child. The most interesting point in HARPY was the search techniques involved a heuristic search based algorithm called the BEAM SEARCH, which provided an optimal Stress management with Music Therapy Sougata Das 1 and Ayan Mukherjee 2 1 Senior Systems Engineer, IBM India Private Limited, Kolkata, India 2 Assistant Professor, Dept. of MCA, Brainware Group of Institutions, Barasat, Kolkata, India