220 likes | 386 Views
Digital Media Technology. Week 2: XML Basics. Peter Verhaar. Recapitulation. Introduction to the course and the topic: digital media technologies Focus on text Lifecycle of digital objects HTML Web browsers Elements: <p>, <h1>, <b>, < i > Images: < img src =“[filename]”/>
E N D
Digital Media Technology Week 2: XML Basics Peter Verhaar
Recapitulation • Introduction to the course and the topic: digital media technologies • Focus on text • Lifecycle of digital objects • HTML • Web browsers • Elements: <p>, <h1>, <b>, <i> • Images: <imgsrc=“[filename]”/> • Hyperlinks: <a href=“[url]”/>
Recapitulation • Cascading Style Sheets, e.g.:p { color: #660099 ; } • Client and server FTP Client Server
Booktrade Correspondence Project • Application of text encoding • Study of correspondence from the Dutch book trade in the 19C. • Primary materials: • Archive of De Erven F. Bohn • Archive of A.W. Sijthof
History • Bohn founded in Haarlem, 1752 by Christoph Heinrich Bohn (1719-1784) from Lübeck (Germany) • >1784: son François Bohn (1751-1819) • 1819: public auction. Name changed to De Erven F. Bohn. • 1875 no more bookselling; 1876: J.K. Tadema • Up to 1900: 55% literature, philology, music, arts, history, geography, travel;10% school books; 35% general, children’s, theology, science and social science • After 1900: chiefly science and professional • Today: Bohn Stafleu van Loghum
Archive • Moved to Leiden University Library in 1973 • Ca. 10,000 books • Financial administration • Ca. 30,000 letters (Section C) • Contracts, Reviews, Illustrations • Correspondence section is filmed • Complete archive is described in an EAD file
Correspondence • Ca. 30,00 letters • To and from authors, publishers, illustrators • Correspondents include Nicolaas Beets, George Eliot, Ouida, Alexander MacMillan, Lawrence Alma Tadema
Dear Sirs, I will accept / £10 for the / rights to make a / translation into / Dutch of my / novel entitled / Wanda //
Printers will / send you entire / proofs from London / instantly. Please / to send money / on receipt of this / Address Madame / Ouida. ~c. 2 words illegible~/ ~c. 1 word illegible~ Ouida / L. de la Ramée
Questions • Social network of Bohn • Which book titles are mentioned in the correspondence? • How international was the Dutch Booktrade in the 19C? • Who were Bohn’s and Sijthoff’s competitors?
Example of a transcription • Gentlemen, • I reply to your letter of the 29th Ulto, offering 30 £ for an early copy of my late father's forthcoming novel Kenelm Chellengly. I beg to inform you that I have simultaneously received from another Dutch Firm, precisely the same offer, viz. 30 £ for an early copy of that work, with a view to a Dutch translation of it (…).Your obedt. Servt, • LyttonKnebworth Park Stevenage Herts
Encoded text • Gentlemen, • I reply to your letter of the <date>29th Ulto</date>, offering 30 £ for an early copy of my late father's forthcoming novel <title>Kenelm Chellengly</title>. I beg to inform you that I have simultaneously received from another Dutch Firm, precisely the same offer, viz. 30 £ for an early copy of that work, with a view to a Dutch translation of it (…).Your obedt. Servt, • <persName>Lytton</persName>Knebworth Park <placeName>Stevenage</placeName> Herts
XML • eXtensible Markup Language • XML elements situate and describe components In 1851, Nicolaas Beets published his novel Camera Obscura <title> </title> Element content Opening tag Closing tag
Attributes • <name type=“person”>F. Bohn</name> Property Value your letter of the <date value=“1873-10-29”>29th Ulto</date>
<tei> <text> <salute> Gentlemen,</salute> <body> I reply to your letter of the <date>29th Ulto</date>, offering 30 £ for an early copy of the novel(…) </body> </text> </tei> Validation rules DTD or XML Schema Document Instance
XML • A Metalanguage • XML-based languages must follow two sets of rules: • Generic rules for all XML-based languages • Specific rules for individual markup langauge (found in Schema or DTD)
Well-formed XML • Each opening tag must have a matching closing tag • Elements must be nested properly • A single root element • Names of elements are case sensitive • Attribute values must be given in quotation marks • An attribute can only be used once in an opening tag
Terminology • Elements • Attributes • Entities • ASCI • Unicode • DTD • Well-formed XML • Valid XML • Meta-language