{"id":72,"date":"2011-04-01T06:47:27","date_gmt":"2011-04-01T06:47:27","guid":{"rendered":"http:\/\/blog.soton.ac.uk\/data\/?p=72"},"modified":"2011-04-01T16:14:18","modified_gmt":"2011-04-01T16:14:18","slug":"pdf-selected-as-interchange-format","status":"publish","type":"post","link":"https:\/\/blog.soton.ac.uk\/data\/2011\/04\/01\/pdf-selected-as-interchange-format\/","title":{"rendered":"[April 1st Gag] PDF selected as Interchange Format"},"content":{"rendered":"<div style=\"float: right; margin-left: 10px;\"><a href=\"https:\/\/twitter.com\/share\" class=\"twitter-share-button\" data-count=\"vertical\" data-url=\"https:\/\/blog.soton.ac.uk\/data\/2011\/04\/01\/pdf-selected-as-interchange-format\/\">Tweet<\/a><\/div>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignright\" title=\"Boffin\" src=\"http:\/\/lemur.ecs.soton.ac.uk\/~cjg\/Archive\/Photos\/2011\/cjg-boffin.png\" alt=\"\" width=\"336\" height=\"297\" \/><\/p>\n<p><span style=\"color: #000000;\">The following article is our prank for April 1st. <\/span><\/p>\n<p><span style=\"color: #000000;\">Just to be clear PDF is a dreadful format to exchange data in. It was inspired, in part, by The Register wesbsite running the following picture and quote. Yes, I did say that, but I was talking about research and data communication. <\/span><\/p>\n<p><span style=\"color: #000000;\">It was fun working out how to make our site output PDF versions of the data, and we&#8217;ll leave those as available, but no longer the default. Also, I&#8217;ve now linked in the &#8220;.svg&#8221; format which is basically the same as the PDF.<\/span><\/p>\n<p>Hopefully this gave a few people a chuckle.<\/p>\n<p style=\"text-align: center;\"><span style=\"color: #000000;\">*** *** ***<br \/>\n<\/span><\/p>\n<p>We have had many complaints that RDF is complicated, unsupported and makes it difficult to control how people will reuse your data.<\/p>\n<p>With this in mind, we have taken a big decision: PDF (Portable Document Format) has been selected as our preferred format for exchanging data on the <a href=\"http:\/\/data.southampton.ac.uk\/\">data.southampton.ac.uk<\/a> site.<\/p>\n<p>Many of the data.southampton team felt we should listen to the <a href=\"http:\/\/forums.theregister.co.uk\/forum\/1\/2011\/03\/22\/southampton_linked_data_semantic_web\/\">pro-PDF comments<\/a> on the forum for the recent <a href=\"http:\/\/www.theregister.co.uk\/2011\/03\/22\/southampton_linked_data_semantic_web\/\">Register Article about Open Data in Southampton<\/a>.<\/p>\n<div>PDF is widely recognised as one of the most accessible document  formats available today, and is ideally suited to both the publication and importing of  data because of its ability to accurately maintain the layout of complex  data sets in the browser on the desktop, and via printed hard copy. The  immaturity of the Linked Data community means that there are still  considerable technical overheads involved in the publication and use of  data represented in less well supported formats, such as RDF or XML.<\/div>\n<div><a href=\"http:\/\/blog.soton.ac.uk\/data\/files\/2011\/03\/chart_1.png\"><img loading=\"lazy\" decoding=\"async\" class=\"size-full wp-image-73 alignnone\" title=\"RDF vs PDF\" src=\"http:\/\/blog.soton.ac.uk\/data\/files\/2011\/03\/chart_1.png\" alt=\"\" width=\"600\" height=\"371\" srcset=\"https:\/\/blog.soton.ac.uk\/data\/files\/2011\/03\/chart_1.png 600w, https:\/\/blog.soton.ac.uk\/data\/files\/2011\/03\/chart_1-300x185.png 300w\" sizes=\"auto, (max-width: 600px) 100vw, 600px\" \/><\/a><\/div>\n<div>When we compared the number of search results PDF has when compared with RDF the decision became far easier to justify.<\/div>\n<div>\n<p>Henceforth, the preferred method for both importing and  exporting data from the site will be PDF. We will continue to provide  other formats such as CSV &amp; XML for the time being, but with a clear  goal of removing these options as soon as is practical.<\/p>\n<p>From May 1st onward we will only accept and export data in PDF and  HTML formats. This allows us much more control and flexibility over how  our data is presented. Data providers will be able to supply the  Southampton OpenData team with data via PDF documents, or as printouts  that we can scan and convert to PDF, and we will know exactly how to  deal with it. To make things even easier, people will even be able to use the networked scanners anywhere on campus to directly upload data. Data providers at remote sites will be able to fax their data in.<\/p>\n<\/div>\n<div>As well as PDF, we are also working with owners of  very large databases on an application that will allow them to dump  their data into a view resembling a spreadsheet view; we will then  republish this data via an interface a little like Google Maps. This will allow  users to cast their eye over very large datasets and then zoom in to  data values that look particularly interesting. We hope this will particularly enthuse library staff, as it is bringing a familiar micro-fiche style user interface to the web of open data.<\/div>\n<h3 style=\"margin-top: 1em;\">Extending 4store<\/h3>\n<p>For now, we will be continuing to use 4store as our database server, but we have significantly improved on the default interface by adding a &#8220;PDF&#8221; output mode which users will find familiar.<\/p>\n<p>Examples:<\/p>\n<ul>\n<li><a href=\"http:\/\/sparql.data.southampton.ac.uk\/?query=PREFIX+rdfs%3A+&lt;http%3A%2F%2Fwww.w3.org%2F2000%2F01%2Frdf-schema%23&gt;%0D%0APREFIX+skos%3A+&lt;http%3A%2F%2Fwww.w3.org%2F2004%2F02%2Fskos%2Fcore%23&gt;%0D%0A%0D%0ASELECT+*+WHERE+{+%3Fbuilding+a+&lt;http%3A%2F%2Fvocab.deri.ie%2Frooms%23Building&gt;+%3B%0D%0A+++++++++++++++++++++++++++rdfs%3Alabel+%3Flabel+%3B%0D%0A+++++++++++++++++++++++++++skos%3Anotation+%3Fbuilding_code+}+%0D%0A&amp;output=pdf\">PDF query for a list of University Buildings<\/a><\/li>\n<li><a href=\"http:\/\/sparql.data.southampton.ac.uk\/?query=PREFIX+rdfs%3A+&lt;http%3A%2F%2Fwww.w3.org%2F2000%2F01%2Frdf-schema%23&gt;%0D%0APREFIX+foaf%3A+&lt;http%3A%2F%2Fxmlns.com%2Ffoaf%2F0.1%2F&gt;%0D%0APREFIX+soton%3A+&lt;http%3A%2F%2Fid.southampton.ac.uk%2Fns%2F&gt;%0D%0APREFIX+skos%3A+&lt;http%3A%2F%2Fwww.w3.org%2F2004%2F02%2Fskos%2Fcore%23&gt;%0D%0A%0D%0A%0D%0ASELECT+DISTINCT+%3Fprogramme+%3Ftheme+%3Fprogramme_label+%3Ftheme_label+%3Fmode+%3Fl2+%3Fl2_code+%3Fl2_label+{%0D%0A++%3Fprogramme+a+soton%3AProgramme+%3B%0D%0A++++soton%3AinAcademicSession+&lt;http%3A%2F%2Fid.southampton.ac.uk%2Facademic-session%2F2010-2011&gt;+%3B%0D%0A++++rdfs%3Alabel+%3Fprogramme_label+%3B%0D%0A++++soton%3AbannerProgrammeHasTheme+%3Ftheme+.%0D%0A++%3Ftheme+rdfs%3Alabel+%3Ftheme_label+.%0D%0A++OPTIONAL+{+%3Ftheme+soton%3AbannerModeOfStudy+%3Fmode+.+}%0D%0A++OPTIONAL+{+%0D%0A++%3Ftheme+&lt;http%3A%2F%2Fid.southampton.ac.uk%2Fns%2FbannerShortJACSCode&gt;+%3Fjacs+.%0D%0A+++%3Fl2+skos%3Anarrower+%3Fjacs+.%0D%0A+++%3Fl2+skos%3AinScheme+&lt;http%3A%2F%2Fid.southampton.ac.uk%2Fns%2FJACS-Level-2&gt;+.%0D%0A+++%3Fl2+skos%3AprefLabel+%3Fl2_label+.%0D%0A+++%3Fl2+skos%3Anotation+%3Fl2_code+.+}%0D%0A}%0D%0AORDER+BY+%3Fprogramme_label+%3Ftheme_label%0D%0A&amp;output=pdf\">PDF query for a list of Programmes taught at the University<\/a><\/li>\n<\/ul>\n<p>Our extension will be made available, on request, under an open source license.<\/p>\n<h3>PDF Descriptions of Resources<\/h3>\n<p>Many of the resources in the site will now be available to download as PDF in addition to HTML, just by changing &#8220;.html&#8221; to &#8220;.pdf&#8221;. Look out for the &#8220;Get the data!&#8221; box on many pages which will offer a link to the PDF format.<\/p>\n<ul>\n<li><a href=\"http:\/\/data.southampton.ac.uk\/module\/COMP1004.pdf\">Module described in PDF<\/a><\/li>\n<li><a href=\"http:\/\/data.southampton.ac.uk\/generic-products-and-services\/Alcohol.pdf\">Where to buy booze<\/a> (popular with some students!)<\/li>\n<\/ul>\n<p>Real-time PDF data!<\/p>\n<p>The most valuable data of all is accurate and up to date, and we are now able to do this in a way you&#8217;ve never seen before! We&#8217;ve already created an HTML page for every bus-stop in the city, but that&#8217;s only in HTML format, which is well known to be inferior to PDF.<\/p>\n<ul>\n<li><a href=\"http:\/\/data.southampton.ac.uk\/bus-stop\/SNA19777.html\">http:\/\/data.southampton.ac.uk\/bus-stop\/SNA19777.html<\/a><\/li>\n<\/ul>\n<p>Imagine you&#8217;re at a bus-stop and want to know when the next bus is, now all you need to do is download the following link into your phone and view it in the mobile PDF viewer of your choice, and hey-presto! &#8211; realtime bus data direct to you on your handset!<\/p>\n<ul>\n<li><a href=\"http:\/\/data.southampton.ac.uk\/bus-stop\/SNA19777.pdf\">http:\/\/data.southampton.ac.uk\/bus-stop\/SNA19777.pdf<\/a><\/li>\n<\/ul>\n<h3>Positive Reactions<\/h3>\n<p>So far all the feedback we have had has been massively positive. One user of data.southampton said<\/p>\n<blockquote><p>&#8220;I&#8217;m so glad they have done this, and it&#8217;s easy to switch too, all I needed to do was change a &#8220;R&#8221; to a &#8220;P&#8221; &#8211; simples!&#8221;<\/p><\/blockquote>\n<p>Professor Nigel Shadbolt and Professor Sir Tim Berners-Lee were unavailable to comment as they are currently at the WWW2011 Conference, but we are confident they will have a very strong reaction when they hear about the decision.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Tweet The following article is our prank for April 1st. Just to be clear PDF is a dreadful format to exchange data in. It was inspired, in part, by The Register wesbsite running the following picture and quote. Yes, I did say that, but I was talking about research and data communication. It was fun [&hellip;]<\/p>\n","protected":false},"author":5,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[],"class_list":["post-72","post","type-post","status-publish","format-standard","hentry","category-uncategorized"],"_links":{"self":[{"href":"https:\/\/blog.soton.ac.uk\/data\/wp-json\/wp\/v2\/posts\/72","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/blog.soton.ac.uk\/data\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/blog.soton.ac.uk\/data\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/blog.soton.ac.uk\/data\/wp-json\/wp\/v2\/users\/5"}],"replies":[{"embeddable":true,"href":"https:\/\/blog.soton.ac.uk\/data\/wp-json\/wp\/v2\/comments?post=72"}],"version-history":[{"count":11,"href":"https:\/\/blog.soton.ac.uk\/data\/wp-json\/wp\/v2\/posts\/72\/revisions"}],"predecessor-version":[{"id":78,"href":"https:\/\/blog.soton.ac.uk\/data\/wp-json\/wp\/v2\/posts\/72\/revisions\/78"}],"wp:attachment":[{"href":"https:\/\/blog.soton.ac.uk\/data\/wp-json\/wp\/v2\/media?parent=72"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/blog.soton.ac.uk\/data\/wp-json\/wp\/v2\/categories?post=72"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/blog.soton.ac.uk\/data\/wp-json\/wp\/v2\/tags?post=72"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}