Computer Science Project Topics

Design and Implementation of Text – to – Speech/audio System.

Design and Implementation of Text-to-Speech audio System.

Design and Implementation of Text-to-Speech/audio System.

CHAPTER ONE

OBJECTIVES OF STUDY

The main objective of the paper is to design and implement a Text-to-Speech/Audio System. The Speech/Audio system focuses precisely on the following objectives:

  1. To Design and Implement a Speech synthesizer that converts text to audio.
  2. To Design and Implement a System that can read out text in any frequency that user specifies.
  3. To design and implement a speech synthesizer that can read out text in both female and male voices.

CHAPTER TWO

LITERATURE REVIEW

A Text-To-Speech (TTS) synthesizer is a computer-based system that should be able to read any text aloud, whether it was directly introduced in the computer by an operator or scanned and submitted to an Optical Character Recognition (OCR) system. Let us try to be clear. There is a fundamental difference between the system we are about to discuss here and any other talking machine (as a cassette-player for example) in the sense that we are interested in the automatic production of new sentences. This definition still needs some refinements. Systems that simply concatenate isolated words or parts of sentences, denoted as Voice Response Systems, are only applicable when a limited vocabulary is required (typically a few one hundreds of words), and when the sentences to be pronounced respect a very restricted structure, as is the case for the announcement of arrivals in train stations for instance. In the context of TTS synthesis, it is impossible (and luckily useless) to record and store all the words of the language. It is thus more suitable to define Text-To-Speech as the automatic production of speech, through a grapheme-to-phoneme transcription of the sentences to utter.

At first sight, this task does not look too hard to perform. After all, is not the human being potentially able to correctly pronounce an unknown sentence, even from his childhood ? We all have, mainly unconsciously, a deep knowledge of the reading rules of our mother tongue. They were transmitted to us, in a simplified form, at primary school, and we improved them year after year. However, it would be a bold claim indeed to say that it is only a short step before the computer is likely to equal the human being in that respect. Despite the present state of our knowledge and techniques and the progress recently accomplished in the fields of Signal Processing and Artificial Intelligence, we would have to express some reservations. As a matter of fact, the reading process draws from the furthest depths, often unthought of, of the human intelligence.

 

CHAPTER THREE

METHODOLOGY AND ANALYSIS OF THE EXISTING

GENERAL DESCRIPTION OF THE EXISTING SYSTEM

In today’s world where most information is shared digitally, visually impaired persons always require their reading glasses to have access to this information, in a situation where they somehow forgot their reading glasses, they won’t be to have access this information. But with text to speech system digital information can be read out to a visually impaired person.

CHAPTER FOUR

DESIGN AND IMPLEMENTATION OF THE NEW SYSTEM

DESIGN STANDARD

OUTPUT SPECIFICATION AND DESIGN

The output design was based on the inputs. The report generated gives a meaningful report. These outputs can be generated as softcopy or printed in hard copy.

CHAPTER FIVE

SUMMARY, CONCLUSION AND RECOMMENDATIONS

SUMMARY

In summary, this Academic Work project has done a great deal of giving a broad knowledge of what Text-To-Speech system is all about and how it can be operated.

CONCLUSION

From this Academic Work, I have been able to show the application of Text-to-Speech and how text can be synthesized for the visually impaired and children to read easily.

RECOMMENDATION

I hereby recommend this Academic work to be used by staff and management of ……. and indeed any other Institution with similar structure and organizational framework for the following reasons:

  1. The academic work has been able to solve the problem related to easy access of contact information.
  2. It has aided the visually impaired to read without stress.
  3. It has aided Children to learn how to read and pronounce English words.

References

  • [Abrantes et al. 91]   A.J. ABRANTES, J.S. MARQUES, I.M. TRANSCOSO, “Hybrid Sinusoïdal Modeling of Speech without Voicing Decision”, EUROSPEECH 91, pp. 231-234.
  • [Allen 85]   J. ALLEN, “A Perspective on Man-Machine Communication by Speech”, Proceedings of the IEEE, vol. 73, n�11, November 1985, pp. 1541-1550.
  • [Allen et al. 87]   J. ALLEN, S. HUNNICUT, D. KLATT, From Text To Speech, The MITTALK System, Cambridge University Press, 1987, 213 pp.
  • [Bachenko & Fitzpatrick 90]   J. BACHENKO, E. Fitzpatrick, “Acomputational grammar of discourse-neutral prosodic phrasing in English”, Computational Linguistics, n�16, September 1990, pp. 155-167.
  • [Belrhali et al. 94]  R. BELRHALI, V. AUBERGE, L.J. BOE, “From lexicon to rules : towards a descriptive method of French text-to-phonetics transcription”, Proc. ICSLP 92, Alberta, pp. 1183-1186.
  • [Benello et al. 88]   J. BENELLO, A.W. MACKIE, J.A. ANDERSON, “Syntactic category disambiguation with neural networks”, Computer Speech and Language, 1989, n�3, pp. 203-217.
WeCreativez WhatsApp Support
Our customer support team is here to answer your questions. Ask us anything!