1 / 20

Semantic annotation of a dialog corpus

Semantic annotation of a dialog corpus. Silvie Cinková Institute of Formal and Applied Linguistics Charles University in Prague , Czech Republic COMPANIONS ( www.companions-project.org )

gen
Download Presentation

Semantic annotation of a dialog corpus

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Semantic annotationof a dialog corpus Silvie Cinková Institute of Formal and Applied Linguistics Charles University in Prague, Czech Republic COMPANIONS (www.companions-project.org) European Commission Sixth Framework Programme Information Society Technologies Integrated Project IST-34434

  2. Data for machine learning • audio-synchronized transcription • linguistic annotation • Charles University (Czech Republic) • Napier University (Edinburgh, UK) • University of Sheffield (UK) • Oxford University (UK)

  3. Functional Generative Description • formal language description • Prague structuralism + computational ling. • since 1960's • stratifies language • phonology • morphology • surface syntax • underlying syntax (tectogrammatics) • transition between syntax and semantics • a "poor men's interlingua" 

  4. Tectogrammatical representation "Underlying syntax" • linguistic meaning • syntactic and semantic relations parent-child node(s) • valency • ellipsis restoration • coreference across sentence boundaries • information structure (TFA) • synonymous function  identical representation

  5. Tectogrammatical representation Is that Jess on the left?

  6. Tectogrammatical representation • ellipsis restoration • coreference Yes it is, laughing.

  7. written Prague Dependency Treebank Czech newspapers 800 k words manually LDC 2006 Wall Street Journal in progress, 15% so far monolog reporting standard language spoken dialogs real time interaction clause fragments exophora, deixis (syntax deviations) and challenges Current...

  8. Non-sentential utterances (NSU) • phrases (NP, PP, ADVP, ADJP) • Me. • At 5 o'clock. • Blue. • interjections • Mhm. • Oh, no! • interjections attached to phrases • No, Billy. • Oh, sure. • subordinate clause without main clause • If he goes with me. • Skiing. • phrase combinations in coordination or apposition • With Mary in the morning or shopping at Tesco. • Or without.

  9. Utterance-response pair response NSU utterance U "Who's that?" "Peggy." UPred UMods Functors (semantic labels)

  10. Utterance-response pair Who's that? [Peggy.] ("That is Peggy"). Peggy.

  11. Predicate with interjections Mhm. Yes. No, Billy.

  12. NSUMods versus UMods • attribute: response_type • values: • overrules • bridging • wh-path • other • form: reference (arrow) to antecedent node

  13. Non-conflicting Modifier addition [It will be] probably not [worth getting]. Yes [I brought the book].

  14. Overruling I'm at a little place called Ellenthorpe. Hellenthorpe.

  15. Bridging There are only two people in the class. Two students?

  16. Wh-path A: "Who's that?" B: "Peggy."

  17. Wh-path - different functor matches • up to the annotator • we expect regular alternation patterns Where would you like to go tomorrow? Shopping with Mary.

  18. Other A: He entered the largest room. B: Room 128? A: I don't know the number.

  19. Summary • U-NSU pairs • NSU inherits the predicate of U (coreference) • NSU inherits all modifiers of U • NSU's own modifiers overrule the inherited • overrule • bridging • wh-path • other

  20. References • Raquel Fernández, Jonathan Ginzburg, and Shalom Lappin (2007): Classifying Non-Sentential Utterances in Dialogue: A Machine Learning Approach. Computational Linguistics, Volume 33, Nr. 3. MIT Press for the Association for Computational Linguistics • Eva Hajičová (ed) (1995): Text-And-Inference-Based Approach to Question Answering, Prague, 1995

More Related