"An Annotation Scheme of Spoken Dialogues with Topic Break Indexes"
Y.Yamashita and M.Murai
Proc. of 6th International Conference on Spoken Language Processing (ICSLP2000), Beijing, 1, pp.569-572 (2000).
This paper proposes a scheme of annotating spoken dialogues with discourse level information in terms of the discourse segment. Dialogues are coded with topic break index (TBI), which indicates the degree of topic break between the discourse segments, instead of marking a beginning and an ending utterances of the segment. TBI is graded by two levels, 1 and 2, and TBI=2 indicates a large change of the topic. Two methods are tried for assigning a TBI value for segment boundaries. In the method-I, the coder directly describes TBI according to the difference of contents between the adjacent segments. In the method-II, the coder classifies relative change of the topic break between the adjacent segments into three categories. Then, the relative changes are automatically converted into TBIs by extraction of local maximum change of the topic break. Two annotation methods are evaluated with the agreement score and the relation to prosodic parameters.
ftp article (gziped ps-file, 4 pages, 38821 bytes)
ftp article (PDF-file, 4 pages, 100390 bytes)