1 / 49

Biblia Medieval: a parallel corpus of medieval Spanish . Andrés Enrique-Arias Laura Camargo Fernández Universitat de les

Biblia Medieval: a parallel corpus of medieval Spanish . Andrés Enrique-Arias Laura Camargo Fernández Universitat de les Illes Balears andres.enrique@uib.es laura.camargo@uib.es. www.bibliamedieval.es. Problems in the study of variation and change through written sources:.

Gabriel
Download Presentation

Biblia Medieval: a parallel corpus of medieval Spanish . Andrés Enrique-Arias Laura Camargo Fernández Universitat de les

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Biblia Medieval: a parallel corpus of medieval Spanish. Andrés Enrique-Arias Laura Camargo Fernández Universitat de les Illes Balears andres.enrique@uib.es laura.camargo@uib.es www.bibliamedieval.es

  2. Problems in the study of variation and change through written sources: • 1) it is rather difficult to identify and to define the contexts of occurrenceof morphosyntactic variables: they are normally conditioned by a complex set of factors (syntactic, semantic, pragmatic). • 2) difficulty in identifying and controlling contextual dimensions that potentially condition variation in older texts (text type and genre, poeticality, orality, dialect, writer demographics, etc.)

  3. Problems in the study of variation and change through written sources: • 1) it is rather difficult to identify and to define the contexts of occurrenceof morphosyntactic variables: they are normally conditioned by a complex set of factors (syntactic, semantic, pragmatic). • 2) difficulty in identifying and controlling contextual dimensions that potentially condition variation in older texts (text type and genre, poeticality, orality, dialect, writer demographics, etc.)

  4. 3) to minimize these problems it would be necessary to locate and to examine a large number of occurrences of the same linguistic structure in versions produced at different time periods and in texts influenced by the same textual conventions. • However conventional corpora are not well suited to do this • 4) Furthermore, historical texts often come down to us devoid of their social context • AS A RESULT • the linguist cannot control the contextual dimensions that characterize each individual text facing the risk of a partial or misleading analysis

  5. 3) to minimize these problems it would be necessary to locate and to examine a large number of occurrences of the same linguistic structure in versions produced at different time periods and in texts influenced by the same textual conventions. • However conventional corpora are not well suited to do this • 4) Furthermore, historical texts often come down to us devoid of their social context • AS A RESULT • the linguist cannot control the contextual dimensions that characterize each individual text facing the risk of a partial or misleading analysis

  6. The adverse consequences of such risks are reduced when the linguist has access to what have been named parallel corpora: texts which are translational equivalents of a single original but have been composed at different time periods. a) have the same underlying content b) are influenced by similar textual conventions • Advantages of a parallel corpus: a) the linguist can observe the historical evolution of structural phenomena (phonetic, morphosyntactic, etc) and arrive at more nuanced generalizations b) access to the underlying text and its translational equivalents facilitates the interpretation of linguistic structures whose meaning or function is not clear.

  7. The adverse consequences of such risks are reduced when the linguist has access to what have been named parallel corpora: texts which are translational equivalents of a single original but have been composed at different time periods. a) have the same underlying content b) are influenced by similar textual conventions • Advantages of a parallel corpus: a) the linguist can observe the historical evolution of structural phenomena (phonetic, morphosyntactic, etc) and arrive at more nuanced generalizations b) access to the underlying text and its translational equivalents facilitates the interpretation of linguistic structures whose meaning or function is not clear.

  8. Biblia Medieval: a parallel corpus of Medieval Spanish • Biblical translations are especially useful material for compiling a historical parallel corpus of Spanish : • they are the only texts for which there exist versions composed at every period of the language’s history . However… • biblical translations have barely been used as sources of data in the diachronic study of Spanish. • Reason: lack of medieval biblical versions readily available to philologists and linguists (many such texts have remained unpublished until recently or in hard-to-find editions).

  9. Biblia Medieval: a parallel corpus of Medieval Spanish • Biblical translations are especially useful material for compiling a historical parallel corpus of Spanish : • they are the only texts for which there exist versions composed at every period of the language’s history . However… • biblical translations have barely been used as sources of data in the diachronic study of Spanish. • Reason: lack of medieval biblical versions readily available to philologists and linguists (many such texts have remained unpublished until recently or in hard-to-find editions).

  10. Biblia Medieval • conceived as the first parallel corpus of medieval Spanish • possiblity of consulting and comparing side to side the existing medieval Spanish versions of the Bible • freely accessible online(internet) • various levels of access to the texts (facsimile, paleographic transcription and edition with normalized spelling) • currently: paleographic transcriptions with posibility of accessing facsimile images. • The corpus, as well as other related documents and tools, can be found at www.bibliamedieval.es

  11. Contents of Biblia Medieval

  12. Contents of Biblia Medieval

  13. Contents of Biblia Medieval

  14. Corpus architecture (designed with Mark Davies) (Genesis 9:1-4, E7) E7^1^9^1^1ra^E bendixo dios a noe & a sus fi|jos & dixo les. creçed & multi|plicad & fenchid la tierra E7^1^9^2^1ra^& v<uest>ro temor | & v<uest>ro espanto sera sobre todas las ani|malias dela tierra .& sobre todas las [fol.1r-b] aues del çielo .& sobre todo lo que anda | sobre la tierra. & en todos los pescados | de lamar en v<uest>ras manos lo pongo E7^1^9^3^1rb^todo quanto se mueue sobre la t<ie>rra | que es cosa biua para vos sea para co|mer. com la verdura dela yerua vos | lo do todo E7^1^9^4^1rb^mas carrne con su anima | la ssu sangre non comades

  15. Corpus architecture (Genesis 9:1-4, E7) E7^1^9^1^1ra^E bendixo dios a noe & a sus fi|jos & dixo les. creçed & multi|plicad & fenchid la tierra E7^1^9^2^1ra^& v<uest>ro temor | & v<uest>ro espanto sera sobre todas las ani|malias dela tierra .& sobre todas las [fol.1r-b] aues del çielo .& sobre todo lo que anda | sobre la tierra. & en todos los pescados | de lamar en v<uest>ras manos lo pongo E7^1^9^3^1rb^todo quanto se mueue sobre la t<ie>rra | que es cosa biua para vos sea para co|mer. com la verdura dela yerua vos | lo do todo E7^1^9^4^1rb^mas carrne con su anima | la ssu sangre non comades

  16. Corpus architecture (Genesis 9:2, E7) E7^1^9^2^1ra^& v<uest>ro temor | & v<uest>ro espanto sera sobre todas las ani|malias dela tierra .& sobre todas las [fol.1r-b] aues del çielo .& sobre todo lo que anda | sobre la tierra. & en todos los pescados | de lamar en v<uest>ras manos lo pongo

  17. Corpus architecture (Genesis 9:2, E7) E7^1^9^2^1ra^& v<uest>ro temor | & v<uest>ro espanto sera sobre todas las ani|malias dela tierra .& sobre todas las [fol.1r-b] aues del çielo .& sobre todo lo que anda | sobre la tierra. & en todos los pescados | de lamar en v<uest>ras manos lo pongo manuscript (E7)

  18. Corpus architecture (Genesis 9:2, E7) E7^1^9^2^1ra^& v<uest>ro temor | & v<uest>ro espanto sera sobre todas las ani|malias dela tierra .& sobre todas las [fol.1r-b] aues del çielo .& sobre todo lo que anda | sobre la tierra. & en todos los pescados | de lamar en v<uest>ras manos lo pongo book: (Genesis) manuscript: (E7)

  19. Corpus architecture (Genesis 9:2, E7) E7^1^9^2^1ra^& v<uest>ro temor | & v<uest>ro espanto sera sobre todas las ani|malias dela tierra .& sobre todas las [fol.1r-b] aues del çielo .& sobre todo lo que anda | sobre la tierra. & en todos los pescados | de lamar en v<uest>ras manos lo pongo Chapter and verse: (9:2) book: (Genesis) manuscript: E7

  20. Corpus architecture (Genesis 9:2, E7) E7^1^9^2^1ra^& v<uest>ro temor | & v<uest>ro espanto sera sobre todas las ani|malias dela tierra .& sobre todas las [fol.1r-b] aues del çielo .& sobre todo lo que anda | sobre la tierra. & en todos los pescados | de lamar en v<uest>ras manos lo pongo Location in the manuscript: (folio 1 recto, column a) Chapter and verse: (9:2) book: (Genesis) manuscript: E7

  21. Corpus architecture ampersand (Genesis 9:2, E7) E7^1^9^2^1ra^& v<uest>ro temor | & v<uest>ro espanto sera sobre todas las ani|malias dela tierra .& sobre todas las [fol.1r-b] aues del çielo .& sobre todo lo que anda | sobre la tierra. & en todos los pescados | de lamar en v<uest>ras manos lo pongo

  22. Corpus architecture (Genesis 9:2, E7) E7^1^9^2^1ra^& v<uest>ro temor | & v<uest>ro espanto sera sobre todas las ani|malias dela tierra .& sobre todas las [fol.1r-b] aues del çielo .& sobre todo lo que anda | sobre la tierra. & en todos los pescados | de lamar en v<uest>ras manos lo pongo bars to indicate line breaks

  23. Corpus architecture abbreviations expanded between <> (Genesis 9:2, E7) E7^1^9^2^1ra^&v<uest>ro temor | & v<uest>ro espanto sera sobre todas las ani|malias dela tierra .& sobre todas las [fol.1r-b] aues del çielo .& sobre todo lo que anda | sobre la tierra. & en todos los pescados | de lamar en v<uest>ras manos lo pongo

  24. Corpus Biblia Medieval Interest of Spanish medieval bibles for the diachronic study of Spanish • span over a vast period of time extending: from the first written manifestations of Spanish to the end of the Middle Ages • the Bible encompasses texts of varied textual typology • constitute a corpus of considerable size(6 million words) • the division in chapters and verses facilitates the alignment of the different versions • enormous quantity of associated resources

  25. Corpus Biblia Medieval Interest of Spanish medieval bibles for the diachronic study of Spanish • span over a vast period of time extending: from the first written manifestations of Spanish to the end of the Middle Ages • the Bible encompasses texts of varied textual typology • constitute a corpus of considerable size(6 million words) • the division in chapters and verses facilitates the alignment of the different versions • enormous quantity of associated resources

  26. Corpus Biblia Medieval Interest of Spanish medieval bibles for the diachronic study of Spanish • span over a vast period of time extending: from the first written manifestations of Spanish to the end of the Middle Ages • the Bible encompasses texts of varied textual typology • constitute a corpus of considerable size(6 million words) • the division in chapters and verses facilitates the alignment of the different versions • enormous quantity of associated resources

  27. Corpus Biblia Medieval Interest of Spanish medieval bibles for the diachronic study of Spanish • span over a vast period of time extending: from the first written manifestations of Spanish to the end of the Middle Ages • the Bible encompasses texts of varied textual typology • constitute a corpus of considerable size(6 million words) • the division in chapters and verses facilitates the alignment of the different versions • enormous quantity of associated resources

  28. Corpus Biblia Medieval Interest of Spanish medieval bibles for the diachronic study of Spanish • span over a vast period of time extending: from the first written manifestations of Spanish to the end of the Middle Ages • the Bible encompasses texts of varied textual typology • constitute a corpus of considerable size(6 million words) • the division in chapters and verses facilitates the alignment of the different versions • enormous quantity of associated resources

  29. Corpus Biblia Medieval Interest of Spanish medieval bibles for the diachronic study of Spanish • span over a vast period of time extending: from the first written manifestations of Spanish to the end of the Middle Ages • the Bible encompasses texts of varied textual typology • constitute a corpus of considerable size(6 million words) • the division in chapters and verses facilitates the alignment of the different versions • enormous quantity of associated resources

  30. Translational equivalents of Latin ecce(information on linguistic resources available to translators) Deuteronomy 31:16 [Vulgate] Dixitque Dominus ad Moysen: “ecce tu dormies cum patribus tuis...” [Fazienda] E dixo a Moisén:“e tú izrás con tos parientes...” [E8] Et dixo Dios a Moisén: “evás que tú dormirás con tus padres...” [E3 / Ajuda] E dixo Dios a Muisén: “cata que tú yacerás con tus parientes...” [E4] E dixo el Señor a Moisén: “hete que dormirás con tus padres...” [E7] E dixo el Señor a Muisén: “ya tú vas a yazer con tus parientes...” [E19] E dixo Dios a Muisén: “aquí tú yacerás con tus parientes...” [Alba] Dixo el Señor a Moisén: “sepas que así como tú yoguieres con tus parientes...”

  31. Also, in the biblical corpus it is possible to observe with particular clarity historical changes that entail an increase in textual frequency : • Clitic pronouns in the ten first chapters of the book of Isaiah (5000 words approx.) in versions from different historical periods: • E6 (ca. 1250) 121 • Alba (1420-1433) 139 • Biblia del Oso (1569) 167 • Biblia de Jerusalén (1966) 201

  32. Reflexive Non reflexive Distribution of reflexive and non reflexive pronouns

  33. reflexive non reflexive Distribution of reflexive and non reflexive pronouns

  34. Percentage of article + possessive (as opposed to possessive alone) to translate Latin eius, eorum, illius, illorum according to register

  35. Percentage of article + possessive (as opposed to possessive alone) according to register

  36. Biblia Medieval Applications of Biblia Medieval Philology Linguistics * Discourse traditions * Critical editions * Corpus linguistics * Biblical sources * Historical linguistics Historical morphosyntax * Variation and change Multidisciplinary approaches * Literature, history, medieval culture and thought, religion, judaism, translation studies, etc

  37. Biblia Medieval Corpus and associated resources www.bibliamedieval.es Available in summer 2008

More Related