From data to speech: a general approach

M. Theune, E.A.M. Klabbers, J.E.J.M. Odijk, J.R. Pijper, de, E.J. Krahmer

    Research output: Contribution to journalArticleAcademicpeer-review

    31 Citations (Scopus)
    1 Downloads (Pure)

    Abstract

    We present a data-to-speech system called D2S, which can be used for the creation of datato-speech systems in different languages and domains. The most important characteristic of a data-to-speech system is that it combines language and speech generation: language generation is used to produce a natural language text expressing the system's input data, and speech generation is used to make this text audible. In D2S, this combination is exploited by using linguistic information available in the language generation module for the computation of prosody. This allows us to achieve a better prosodic output quality than can be achieved in a plain text-to-speech system. For language generation in D2S, the use of syntactically enriched templates is guided by knowledge of the discourse context, while for speech generation pre-recorded phrases are combined in a prosodically sophisticated manner. This combination of techniques makes it possible to create linguistically sound but efficient systems with a high quality language and speech output.
    Original languageEnglish
    Pages (from-to)47-86
    Number of pages40
    JournalNatural Language Engineering
    Volume7
    Issue number1
    Publication statusPublished - 2001

    Fingerprint Dive into the research topics of 'From data to speech: a general approach'. Together they form a unique fingerprint.

    Cite this