1 / 15

Status report — UNL’02

Status report — UNL’02. French team. UNL evangelisation: papers (1).

mirra
Download Presentation

Status report — UNL’02

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Status report — UNL’02 French team

  2. UNL evangelisation: papers (1) [1] Boitet C. (2002) Advantages of the UNL language and format for web-oriented crosslingual applications. Proc. Seminar on linguistic meaning representation and their applications over the World Wide Web, Penang, 19-21/8/02, USM/UTMK, E. K. Tang ed., 4 p. [2] Boitet C. & Tsai W.-J. (2002) Coedition to share text revision across languages. Proc. COLING-02 WS on MT, Taipeh, 1/9/2002, 8 p. [3] Boitet C. (2002) The translation of examples, citations, definitions and glosses in the Papillon project. Proc. Papillon-02 seminar on multilingual lexical data bases, Tokyo, 16-18/7/02, NII, M. Mangeot ed., 10 p. [4] Boitet C., Mangeot M. & Sérasset G. (2002) The PAPILLON project: cooperatively building a multilingual lexical data-base to derive open source dictionaries & lexicons. Proc. LREC-02 (International Conference on Language Resources and Evaluation), Las Palmas, 26-31/5/2002, A. Zampolli & I. Prodanoff ed., 7 p. (accepté) [5] Boitet C. (2002) A rationale for using UNL as an interlingua and more in various domains. Proc. LREC-02 First International Workshop on UNL, other Interlinguas, and their Applications, Las Palmas, 26-31/5/2002, ELRA/ELDA, J. Cardeñosa ed., pp. 23—26. [6] Lafourcade M. & Boitet C. (2002) UNL Lexical selection with conceptual vectors. Proc. LREC-02 (International Conference on Language Resources and Evaluation), Las Palmas, 26-31/5/2002, A. Zampolli & I. Prodanoff ed., vol. VI/VI, pp. 1958—1964.

  3. UNL evangelisation: papers (2) [7] Blanc É. & Sérasset G. (2001) From Graph to Tree: Processing UNL Graphs using an Existing MT System. Proc. The first UNL open Conference, Suzhou, China, 22-26 November 2001, UNDL. [8] Sérasset G. & Mangeot-Lerebours M. (2001) Papillon Lexical Database Project: Monolingual Dictionaries & Interlingual Links. Proc. NLPRS-2001, NII, Tokyo, 27-30 November 2001, pp. 119-125. [9] Blanc É. (2001) An interactive hypertextual environment for MT training. Proc. MT Summit VIII, Workshop on Teaching Machine Translation, Santiago de Compostela, 20-22 Sept. 2001, European Association for Machine Translation (c/o ISSCO), M. L. Forcada, J. A. Perez-Ortiz & D. R. Lewis ed., pp. 51-55. [10] Boitet C. (2001) Die Zukunft der maschinellen Uebersetzung von Texten und gesprochenen Dialogen. Vier technische und organizatorische Schlüssel, um mehr Sprachen zu behandeln und die Qualität zu verbessern. Proc. 2. Hungarian Humboldt Conference, Budapest, 16-18/11/2001, Humboldt association, 8 p. (texte en anglais, transparents en allemand) [11] Tsai W.-J. (2001) An introduction to UNL (Universal NEtworking Language). Proc. 22nd Verein der Chinesen (Taiwanesen) in Deutschland für Forschung und Lehre (VCTDFL), Strasbourg, France, 1-4 Nov. 2001, Verein der Chinesen (Taiwanesen) in Deutschland für Forschung und Lehre (VCTDFL).

  4. UNL evangelisation: papers (3) [12] Tsai W.-J. (2001) SWIIVRE- a web site for the Initiation, Information, Validation, Research and Experimentation on UNL (Universal Networking Language). Proc. First UNL Open Conference - Building Global Knowledge with UNL, Suzhou, China, 18-20 Nov. 2001, GETA, CLIPS, IMAG, G. UNDL ed., 8 p. [13] Boitet C. (2001) À propos de la traduction automatique. La TILV (Tribune Internationale des Langues Vivantes) 30 (2001), pp. 16-17. [14] Boitet C. (2001) Machine Translation. In "Encyclopedia of Cognitive Science", Nature Publishing Group, London, 24 (manuscript form) p. (à paraître) [15] Tomokiyo M., Al-Assimi A.-B. & Boitet C. (2001) Multilingual documents management by using Universal Networking Language UNL on Alignment Gestion Tool OGA. Proc. PACLING'01, Fukuoka, 11-14/9/2001, H. Sakaki ed., 7 p.

  5. UNL evangelisation: actions • Drawing new members to UNL • Malay (Zarin & Tang, USM, Penang) • Vietnamese (Phan, IT, Da Nang) • Introducing reference/usage of UNL • Papillon project: reference to UWs, graphs • Scientific promotion • Seminars: TALN-02, LREC-02, SSLMR-02, COLING-02

  6. Almost no external funding UNL project: 1997 30%, 1998 15%, 1999— some trips French bodies (MEA, SFERE): 2000— some trips No success in about 12 proposals to the UE… but the tide seems about to change Internal funding from our lab Our own work: server, deconverter, enconverter, SWIIVRE Student work: masters & BA levels, ling. & CS Some money from other projects Current state of affairs (1)

  7. Current state of affairs (2) • Deconverter available on server • ≈50,000 UW connected to French lemmas (KB unavailable) • In case of unknown UW, the development environment puts all in the Ariane-tree and gets quite good results • But the server is behind and still chooses the first possible French lemma (from HW) • Work on enconverter has begun • 1 Ph.D. tries to use data-driven methods • E. Blanc does it with the rule-based method • SWIIVRE web site offers some services • Information about available UNL-LLL servers • Multilingual deconversion • Graphical edition of UNL-trees

  8. Example of deconversion L’Université des Nations Unies est une organisation internationale académique des Nations Unies et rend compte à l’assemblée générale des Nations Unies et à la conférence générale de l’UNESCO. sa mission est que des problèmes globaux urgent? qui sont les responsabilités des Nations Unies et de ses états membres soient abordés. l’UNU apporte ensemble pour remplir sa mission la contribution d’universitaires de pays développés et en voie de développement et d’institutions. Il fonctionne par un réseau d’un centre de recherche de formation (un <RTC>) ou d’un programme de recherche et de formation (un RTP). la structure <organisationnelle> de l’UNU comprend (un tableau est vu) des corps constituants suivants. le conseil d’université fixe des principes et des politiques pour l’université. le président présent est <dr> Jairam_Reddy. le <recteur> académique et administratif général de l’université est le responsable avec la responsabilité de la direction de son programme global et d’une organisation et d’une administration. le recteur présent est H_van_Ginkel. le <University_Center> assiste bien que les relations bien que les relations <institutionnel> et <programmatif> de l’université de <recteur> dans le développement global et un <planning> et une coordination et des activités. la conférence d’un directeur (une <CONDIR>) rassemble les directeurs des <RTC> et des <RTP> et il est appelé pour des programmes d’une recherche qui est entreprise discuter et évaluer et le <recteur> est assister dans l’amélioration de programmes présents.

  9. Current development

  10. Current development • The dictionary server is finished • Multithread server • Data is indexed by an AVL • The deconversion server is operational since 2001 • Multithread server • Reentrant flex/bison for the validation of inputs • Able to treat whole UNL-html documents • But is now lagging behind the development environment

  11. Improvements available (2000-) • Extension of the UNL dictionary model • A tree may be specified for a UW translation • This allows the translation of UWs like : cookie(fld>internet) --> témoin de connection • Possibility to translate texts using UWs like "United Nation Board of Directors(…)" …even though these UWs seem to go against the spirit of UNL! (but see proposals for encoding them)

  12. Available deconverters • Servers (accordint to SWIIVRE-UNL robot) • Brasilian portuguese (on UNDL site) • French • Italian • Russian • Spanish • No more available: Arab, Indonesian, Thai • Chinese: no deconversion site, but downloadable deconverter for Windows • Never available (never gave a working url) • English, German, Hindi, Japanese, Mongolian

  13. Perspectives of UNL-FR (1/2) • Finding funds for concrete projects and developments — very hard! • already 12 unsuccessful proposals to the EU see J.Cardeñosa's paper at LREC-02 • participation in PCRD-6 proposals • Finding and funding permanent linguists—hard! • S. Helme for 6 months in 2002, again 6 months in 2003 (from another project!) • E. Blanc retired in Nov. 2001 but continues his research on UNL actively

  14. Perspectives of UNL-FR (2/2) • Continuing to work with local resources • Linguistic students for development/debugging of DeCo-FR • CS students working on parts of the server • WJ Tsai (Ph.D. student) works on • "UNL usage/usability web site" • interactive improvement of UNL-graphs from any (target) language by "text-graph coedition"

  15. 4 suggestions for UNLP • 2 suggestions from the Geneva 2001 meeting • Let UNLP develop like the W3C • Start group(s) for ENGLISH • big obstacle to prepare UNL-related projects • who many? at least 1 in Europe, 1 in the US… • 2 new suggestions • XML-ify the UNL format • Officially for a "minimal" format equivalent to UNL-html (we may start from Tsai WJ's proposal) • Guidelines for buildingspecialized UNL xml tags & attributes • Continue to get scientific respectability • New presence of UNLP in public forums: GOOD! • Will lead to more competent & "free" contributions

More Related