Description
INTSINT codes the intonation of an utterance by means of an alphabet of 8 discrete symbols constituting a surface phonological representation of the intonation: ::T (Top), H (Higher), U (Upstepped), S (Same), M (mid), D (Downstepped), L (Lower), B (Bottom). These tonal symbols are considered ''phonological'' in that they represent discrete categories and ''surface'' since each tonal symbol corresponds to a directly observable property of the speech signal.Tonal alignment
The tones can be aligned with phonological constituents by means of the following alignment diacritics following the tonal symbol: :: (late), ">(initial), < (early), : (medial), > (late), (final) The relevant phonological constituent with which the tonal segments are aligned can be taken as the sequence of symbols between the following pair of slashes /.../. The following is an example of a transcription using thePhonetic interpretation
The phonetic interpretation of the INTSINT tonal segments can be carried out using two speaker dependent (or even utterance dependent) parameters. ::*key: like a musical key, this establishes an absolute point of reference defined by a fundamental frequency value (in hertz). ::*range: this determines the interval between the highest and lowest pitches of the utterance. In the current algorithm (Hirst 2004, 2005) the tonal segments can be converted to target points, like those generated by the Momel algorithm, using the following equivalences. P(i) in the following formulae refers to the current Pitch target, P(i-1) to the preceding pitch target. Pitch targets are normally calculated on a logarithmic scale. The targets T, M and B are defined 'absolutely' without regard to the preceding targets ::*T: P(i) := key + range/2 ::*M: P(i) := key ::*B: P(i) := key - range/2 Other targets are defined with respect to the preceding target: ::*H: P(i) := (P(i-1) + T) / 2 ::*U: P(i) := (3*P(i-1) + T) / 4 ::*S: P(i) := P(i-1) ::*D: P(i) := (3*P(i-1) + B) / 4 ::*L: P(i) := (P(i-1) + B) / 2Example
A sequence of tonal targets such as: :: T L H L H D Bassuming values for a female speaker of ''key'' as 240 Hz and ''range'' as 1 octave, would be converted to the following F0 targets: ::References
* Hirst, D.J. & Di Cristo, A. (eds) 1998. ''Intonation Systems. A survey of Twenty Languages''. (Cambridge, Cambridge University Press). 0-521-39513-5 (Hardback); 052139550X (Paperback) * Hirst, D.J. 2004. Lexical and Non-lexical Tone and Prosodic Typology. in ''Proceedings of International Symposium on Tonal Aspects of Languages''. Beijing, March 2004, 81-88 * Hirst, D.J. 2005. Form and function in the representation of speech prosody. in K.Hirose, D.J.Hirst & Y.Sagisaka (eds) ''Quantitative prosody modeling for natural speech description and generation'' (=''Speech Communication'' 46 (3-4)), 334-347External links
INTSINT automatic annotation can be performed with SPPAS