270 likes | 374 Views
Data linking with kblog. Phillip Lord Newcastle University. The Long Tail. http://en.wikipedia.org/wiki/File:La_Palmyre_041-crop.jpg. Example Data. ID_REF VALUE 1007_s_at 2.867330709 1053_at 10.50302152 117_at 2.702517066 121_at 3.052316166 1255_g_at 2.278998026
E N D
Data linking with kblog Phillip Lord Newcastle University
The Long Tail http://en.wikipedia.org/wiki/File:La_Palmyre_041-crop.jpg
Example Data ID_REF VALUE 1007_s_at 2.867330709 1053_at 10.50302152 117_at 2.702517066 121_at 3.052316166 1255_g_at 2.278998026 1294_at 5.360226024 1316_at 5.496447322 1320_at 4.475412175 1405_i_at 2.301359647
Example Data ID_REF VALUE 1007_s_at 2.867330709 1053_at 10.50302152 117_at 2.702517066 121_at 3.052316166 1255_g_at 2.278998026 1294_at 5.360226024 1316_at 5.496447322 1320_at 4.475412175 1405_i_at 2.301359647
Example Data ID_REF VALUE 1007_s_at 2.867330709 1053_at 10.50302152 117_at 2.702517066 121_at 3.052316166 1255_g_at 2.278998026 1294_at 5.360226024 1316_at 5.496447322 1320_at 4.475412175 1405_i_at 2.301359647
Example Data ID_REF VALUE 1007_s_at 2.867330709 1053_at 10.50302152 117_at 2.702517066 121_at 3.052316166 1255_g_at 2.278998026 1294_at 5.360226024 1316_at 5.496447322 1320_at 4.475412175 1405_i_at 2.301359647
The problem? http://en.wikipedia.org/wiki/File:Clock_in_Kings_Cross.jpg
The problem? http://en.wikipedia.org/wiki/File:Clock_in_Kings_Cross.jpg • http://en.wikipedia.org/wiki/File:New_British_Coinage_2008.jpg
The problem? http://en.wikipedia.org/wiki/File:Clock_in_Kings_Cross.jpg • http://en.wikipedia.org/wiki/File:New_British_Coinage_2008.jpg
Coach Building • 250,000 articles per year • 240 million Downloads • Cost: 1.5 Billion Euro • Elsevier • 17 million articles • > 20 languages • 365 million readers • Total Cost: 10 million dollars • Wikipedia http://commons.wikimedia.org/wiki/File:Hackney-coach,_about_1680.png
Wordpress • Has one critical feature • It has an edit dialog • Word • Latex • Open Office • Asciidoc • Textile • Markdown • By email
Features • Reviewing • Metadata – coins, metatags * • Crawlability * • Multiple authors • Archiving (UKWA) • Searchability
Features • Bi-directional links • Permalinks (purls to follow) • DOIs (datacite!) • Versioning • Extensibility • Nice maths * (and mathjax) • Syntax Highlighting • Bibliographic Support (with DOIs, and incompletely CiTO) * • ePUB and PDF (!?) export
Data Linking • Bi-directional links require support at both ends • Adding this generically • Adding this for specific data sets (microarray) • Data linking into papers
Old technology • Most of this technology pre-exists • So why don’t people use it! • There is a good reason... • TECHNOLOGY IS BORING
Content • http://ontogenesis.knowledgeblog.org • Now has 15k page views (not hits!) • 25 articles, multiple authors • Seeking pubmed inclusion • Advertising: two blog articles about ontogenesis happened with 1 day of first article. • http://taverna.knowledgeblog.org • 10 articles • About scientific workflows • Supplement to myExperiment
Well... • These stats are not going to scare either Elsevier or Wikipedia • But, they are not bad either • And it allows primary scientific content of many different forms • We believe it can form part of the scientific landscape
Acknowledgements Phillip Lord (me!) Dan Swan Simon Cockell Robert Stevens (Manchester) Georgina Moulton (Manchester) Thanks also to JISC, David Shotton, BL, Datacite, and WordPress.