Презентация Text to speech synthesis онлайн
На нашем сайте вы можете скачать и просмотреть онлайн доклад-презентацию на тему Text to speech synthesis абсолютно бесплатно. Урок-презентация на эту тему содержит всего 37 слайдов. Все материалы созданы в программе PowerPoint и имеют формат ppt или же pptx. Материалы и темы для презентаций взяты из открытых источников и загружены их авторами, за качество и достоверность информации в них администрация сайта не отвечает, все права принадлежат их создателям. Если вы нашли то, что искали, отблагодарите авторов - поделитесь ссылкой в социальных сетях, а наш сайт добавьте в закладки.
Презентации » Образование » Text to speech synthesis
Оцените!
Оцените презентацию от 1 до 5 баллов!
- Тип файла:ppt / pptx (powerpoint)
- Всего слайдов:37 слайдов
- Для класса:1,2,3,4,5,6,7,8,9,10,11
- Размер файла:4.57 MB
- Просмотров:70
- Скачиваний:1
- Автор:неизвестен
Слайды и текст к этой презентации:
№11 слайд
Содержание слайда: NLP and DSP Modules
The NLP module is capable of producing a phonetic transcription of the text to be read, together with the desired intonation and rhythm. It takes in the text as input and give narrow phonetic transcription as output which is further forwarded to the DSP module. And the DSP module which transforms the symbolic information it receives into natural sounding speech. “Narrow phonetic transcription” which is taken as intermediate varies from synthesizer system to another.
№15 слайд
Содержание слайда: Preprocessor
Takes in texts as strings of ASCII characters
Transforms text into Broad Segmentation Units (BSU’s) following the set:
A sequence of characters
A sequence of digits
A single punctuation mark or another special character
A sequence of white space characters
Eg: (I)()(know)()(1)(,)(000)()(words)(,)()(Dr)(.)()
(Jones)(.)
Rewrites the BSU’s into a list of word-like units and of syntax bearing punctuation marks called Final Segmentation Units are produced (FSU’s).
№16 слайд
Содержание слайда: Preprocessor
Sentence end detection (semicolon, period – ratio, time and decimal point, sentence ending respectively)
Abbreviations (e.g. – for instance)
Changed to their full form with the help of lexicons
Acronyms (I.B.M – these can be read as a sequence of characters, or NASA which can be read following the default way)
Numbers (Once detected, first interpreted as rational, time of the day, dates and ordinal depending on their context)
Idioms (eg. “In spite of”, “as a matter of fact”– these are combined into single FSU using a special lexicon)
№19 слайд
Содержание слайда: Content Words
Content words- infinite in number
Needs Morphology – part of linguistics that describes word forms as a function of reduced set of abstract semantically bearing units called morphemes.
Inflectional, derivational and compound words (content words) are decomposed into their elementary graphemic units (morphemes)
Uses regular grammars exploiting lexicons of stems and affixes which is the only way because of its infinite size
№20 слайд
Содержание слайда: Contextual Analysis
Considers words in their context
Reduces the list of their parts of speech categories to a very restricted number of highly probable hypotheses, given the corresponding possible parts of speech of neighboring words.
Achieved by N-grams, multi-layer perceptrons (Neural networks), local stochastic grammars (provided by expert linguistics) etc
№21 слайд
Содержание слайда: Letter to Sound Module
LTS module is responsible for the automatic determination of the phonetic transciption of the incoming text
Cannot just look up in a pronunciation dictionary
Do not follow the rule “one character = one phoneme”
Examples
Single character correspond to two phonemes -- x as /ks/
Several characters producing one phoneme—
gh in thought
Single character pronounced in different ways
c in ancestor, ancient, epic
Single phoneme resulting in several spellings –
sh in dish, t in action, c in ancient
№22 слайд
Содержание слайда: Letter to Sound Module
Some of the cases to consider
Consonants may be reduced or deleted in clusters (eg. t in softness)
Assimilation which originates in articulatory constraints and leads to a change of some phonological features of a given phoneme (eg. obstacle)
Heterophonic homographs which are pronounced differently even though when they have same spelling (eg. record, contrast)
Phonetic liaisons which affect final consonants of French words immediately followed by a vocalic sound which results in pronunciation of characters that otherwise disappear or in a change of pronunciation
Schwas (transformation of unstressed vowels into short central phonetic elements is done or simply deletes them – like in thoughtful and interesting
Vowel lengthening, new words, proper nouns which are really dependent on the language of origin to know the correct pronunciation.
№24 слайд
Содержание слайда: Dictionary Based
Dictionary based consist of storing a maximum of phonological knowledge into a lexicon and entries are generally restricted to morphemes and pronunciation of surface forms is accounted by inflectional, derivational and compounding morphophonic rules which describe how the phonetic transcriptions of their morphemic constituents are modified when they are combined into words. For those words that are not in the lexicon are transcribed by rule.
№25 слайд
Содержание слайда: Rule Based
Rule based strategy which transfers most of the phonological competence of dictionaries into a set of letter to sound (grapheme to phoneme) rules. And those words which are pronounced in a such a particular way that they constitute a rule on their own are stored in exceptions directory.
№28 слайд
Содержание слайда: Morpho-Phonemic Module in Dictionary based
This module deals with the phonological changes and one distinguishes the following in this module
Rules for changing phonological features (eg. ion and ure in completion and exposure)
Rules for deleting or inserting phonemes (eg. buses or landed)
Rules that account stress shift in languages such as English or German (eg. adApt + ation = adaptation or which doesn’t change like in abOrt + ion = abOrtion).
These are achieved by using rewrite rules and by using Two-level rules[Koskenniemi,1983].
№30 слайд
Содержание слайда: Phonetic Post Processing
In order to increase the intelligibility and the naturalness of synthetic speech, some kind of phonetic post processing is required. After first phonemic transcription of each word has been obtained, this is applied so as to account for coarticulatory smoothing. This smoothing results in high quality speech.
№31 слайд
Содержание слайда: Syntactic Prosodic Parser
Prosody refers to certain properties of the speech signal which are related to audible changes in pitch, loudness, syllable length. This is also referred as intonation. The features of this are focus, relationships between words, finality. These have specific functions in speech communication.
№34 слайд
Содержание слайда: Syntactic Prosodic Parser
These prosodic groups are achieved by a recent very crude algorithm termed as chinks ‘n chunks by Liberman and Church [1992] in which prosodic phrases are accounted for by the simple regular rule
A (minor) prosodic phrase = a sequence of chinks followed by a sequence of chunks
№36 слайд
Содержание слайда: Why we need TTS system
There are several advantages of a high quality text to speech synthesis system
Great use in Telecommunications, relay service, Language Education, aid to handicapped persons, talking books and toys, vocal monitoring, multimedia, man-machine communication etc
Скачать все slide презентации Text to speech synthesis одним архивом:
-
Как «убить» лояльность клиентов с помощью интернет-рекламы Мария Захарченко CEO MediaContext
-
Historical development of English and Russian parts of speech
-
Sequence of tones. Sentence-stress. Types of sentence-stress. Rhythm. Tempo of speech.
-
HyperText MarkUP Language
-
Текст text-decoration
-
Speech Culture
-
The role of barbarisms in literary text
-
Language and speech development
-
Text lesen und nacherzählen
-
Titletext. Name. Company. Info. Date