Help:Triphone voicebanks

An extension of the typical syllables based on a particular language is typically called triphones.

Triphones in UTAU are designed to pre-render blendings in pronunciations when UTAU is not enough to reproduce this. Quadphones are the same, but comprising of 4 letters.

=Triphones=

Introduction
Triphones started 1 year ago, as an experiment, to help with pronunciation blending. At that time, UTAU was still considered to be a VOCALOID knockoff, and so a solution to be at par with VOCALOID in terms of quality are being formulated.

How it works
Triphones are customized syllable combinations that go beyond the prescribed word syllables in a particular language. These are recorded like regular syllables in UTAU.

Uses
Triphones help make a voicebank sound more natural. Instead of blending the notes in UTAU, these syllables are pre-blended from the voice samples themselves.

Identifying a triphonic voicebank
How do you know if the voicebank is triphonic? You will need to analyze the file list of the voicebank.


 * 1) Does the voicebank have more 3-word syllable combinations than what the language normally has?
 * 2) When analyzing the syllable, does it comprise of a Consonant-Vowel-Vowel combination? (eg. sua, sui, sue, suo)
 * 3) Are there Vowel-Consonant-Vowel combination present as well? (eg. ika, iki, iku, ike, iko)

If 2 and 3 are true, the voicebank is triphonic.

Partial VS Full triphonic voicebanks
Due to the sheer amount of letter combinations in a triphonic voicebank, some voicebank authors may find it impractical to record every single triphonic letter combination. Because of this, some will record triphones according to what is difficult for the voicebank to blend.

However, this does not mean a voicebank with full triphones has never been attempted.

=Quadphones=

Introduction
When two syllables do not blend well in UTAU, the answer is to prerecord quadphones. While this is still not recognized, it is primarily coined to address blending problems in voicebanks.