1 / 11

Noriko Kando National Institute of Informatics

Noriko Kando National Institute of Informatics. Roadmap for Language Resources in the viewpoint from Information Access Technology Evaluation. Presented at: Roadmap for Language Resources and Evaluation In a Multilingual Environment: Organised by COCOSDA and WRITE Genoa, 28 May 2006

radley
Download Presentation

Noriko Kando National Institute of Informatics

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Noriko KandoNational Institute of Informatics Roadmap for Language Resources in the viewpoint from Information Access Technology Evaluation Presented at: Roadmap for Language Resources and Evaluation In a Multilingual Environment: Organised by COCOSDA and WRITE Genoa, 28 May 2006 In Conjunction with LREC 2006

  2. Issues • LR and Information Access • Multi-linguality across Culture • Emerging Areas: Genres, Opinion, Subjectivity, Community-based Ontology

  3. LR and Information Access • Information Access (IA)Technologies (Information Retrieval, Question Answering, Summarization, Text mining, etc) needs better LR: coverage, richness, quality. • Evaluation • Development Ex. AQUAINT (Advanced QA) Program has supported Resources (WordNet, Gazetteer, etc.), Component Modules, and QA systems.

  4. LR and Information Access– cont’d • Extrinsic (Task-based) LR Evaluation • So far LR evaluation had placed emphasis on intrinsic evaluation. Eg. Accuracy, consistency, standards, etc. • Extrinsic LR evaluation: How LR improved the effectiveness of the IA technologies? • Good ways to appeal LR’s social importance. Easy to be understood by non-experts and sponsors

  5. LR and Information Access– cont’d • LR can be enriched or created through usage in IA systems Ex. Search Engine Query logs Users’ Relevance judgments, click through, etc.

  6. Multi-linguality • Axes to characterize CLIA systems • Languages • Type of media • Tasks and users • Success criteria or relevance judgments • Document genres • Layers of CLIR technologies • Information access process [Kando 2002; Gey, Kando & Peters 2002]

  7. Layers of Cross-Lingual IA Technologies; pragmatic layer: cultural & social aspects, semantic layer: concept mapping syntactic lager: lexical layer: language identify, indexing symbol layer: character codes physical layer: network [Kando 1999; 2002; Gey, Kando & Peters 2002]

  8. Multi-linguality in Pragmatic layer • Pragmatic layer of CLIA technologies • include issues related to text structure, intra & inter- text relationship • identifying the differences of the viewpoints across the languages or cultures is also critical.

  9. Emerging Areas • Esp. Conjunction with WEB, • Heterogeneous Document Genres • Subjectivity, Opinion, Emotion, etc. • Community-based or Domain-specific Ontology • Multi-faceted Ontology • Interactivity • Multi-modal

  10. Summary • LR and Information Access • Multi-linguality across Culture • Emerging Areas: Genres, Opinion, Subjectivity, Community-based Ontology

  11. Thanks Merci Danke schön Gracie Gracias Ta! Tack Köszönöm Kiitos Terima Kasih Khap Khun Ahsante Tak 謝謝 ありがとう

More Related