{"id":258,"date":"2013-04-11T16:28:28","date_gmt":"2013-04-11T16:28:28","guid":{"rendered":"http:\/\/blog.soton.ac.uk\/data\/?p=258"},"modified":"2013-04-11T16:28:28","modified_gmt":"2013-04-11T16:28:28","slug":"vacancies-datase","status":"publish","type":"post","link":"https:\/\/blog.soton.ac.uk\/data\/2013\/04\/11\/vacancies-datase\/","title":{"rendered":"The Vacancies Dataset"},"content":{"rendered":"<div style=\"float: right; margin-left: 10px;\"><a href=\"https:\/\/twitter.com\/share\" class=\"twitter-share-button\" data-count=\"vertical\" data-url=\"https:\/\/blog.soton.ac.uk\/data\/2013\/04\/11\/vacancies-datase\/\">Tweet<\/a><\/div>\n<p>Just recently I&#8217;ve been looking for data we can publish as RDF with minimal effort, and without requiring any access to restricted services or taking up peoples&#8217; time. I came across the University&#8217;s jobs site, <a href=\"http:\/\/www.jobs.soton.ac.uk\/\">jobs.soton.ac.uk<\/a>. It uses a pretty cool system which exports all the vacancies as easily parsable RSS feeds, grouped into sensible categories. We have a feed for each campus, and a feed for each organisational unit of the University, so if a job appears in, for example, the feed for Highfield Campus as well as the feed for Finance, the job is a finance-based job on the Highfield Campus. Because of this, it&#8217;s trivial to write a script that parses all the RSS feeds on the jobs site and produces RDF. So that&#8217;s what I did, and you can see the results in our new <a href=\"http:\/\/id.southampton.ac.uk\/dataset\/vacancies\">Vacancies dataset<\/a>.<\/p>\n<p>Normally when I produce a new dataset I like to provide a clever web tool or search engine to make use of the data, but this time I haven&#8217;t, because the jobs site already does this very well. So why republish the data at all? There are two reasons. Firstly, our colleague at Oxford University, Alexander Dutton, has <a href=\"https:\/\/data.ox.ac.uk\/doc\/dataset\/vacancies\">already done this with Oxford&#8217;s vacancies<\/a>. If we do the same, using the same data format, we&#8217;ve effectively got a standard. If other organisations begin to do the same thing, suddenly the magic of linked open data can happen. The second reason is because now SPARQL queries are possible. They&#8217;re a bit advanced for the layman, but if you were looking, for example, for a job at Southampton General Hospital paying \u00a325K or higher, <a href=\"http:\/\/sparql.data.southampton.ac.uk\/?query=SELECT+%3Ftitle+%3Fpage+%3Fstarting_salary+WHERE+{%0D%0A++++%3Fjob+%3Chttp%3A%2F%2Fwww.w3.org%2F1999%2F02%2F22-rdf-syntax-ns%23type%3E+%3Chttp%3A%2F%2Fpurl.org%2Fopenorg%2Fvacancy%2FVacancy%3E+.%0D%0A++++%3Fjob+%3Chttp%3A%2F%2Fwww.w3.org%2F2000%2F01%2Frdf-schema%23label%3E+%3Ftitle+.%0D%0A++++%3Fjob+%3Chttp%3A%2F%2Fxmlns.com%2Ffoaf%2F0.1%2Fhomepage%3E+%3Fpage+.%0D%0A++++%3Fjob+%3Chttp%3A%2F%2Fxmlns.com%2Ffoaf%2F0.1%2Fbased_near%3E+%3Chttp%3A%2F%2Fid.southampton.ac.uk%2Fsite%2F18%3E+.%0D%0A++++%3Fjob+%3Chttp%3A%2F%2Fpurl.org%2Fopenorg%2Fvacancy%2Fsalary%3E+%3Fsalary+.%0D%0A++++%3Fsalary+%3Chttp%3A%2F%2Fpurl.org%2Fgoodrelations%2Fv1%23hasMinCurrencyValue%3E+%3Fstarting_salary+.%0D%0A++++FILTER+%28%3Fstarting_salary+%3E%3D+25000%29%0D%0A}+LIMIT+10&amp;output=htmltab&amp;jsonp=&amp;show_inline=1#results_table\">you can write a SPARQL query<\/a> that does all the hard work for you, and the same query will work with Oxford&#8217;s data, although obviously you&#8217;ll need to replace the location URI with one of theirs.<\/p>\n<p>Feel free to have a poke around at the data and, as always, if you manage to come up with a cool use for this data &#8211; even just an idea &#8211; then please let me know.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Tweet Just recently I&#8217;ve been looking for data we can publish as RDF with minimal effort, and without requiring any access to restricted services or taking up peoples&#8217; time. I came across the University&#8217;s jobs site, jobs.soton.ac.uk. It uses a pretty cool system which exports all the vacancies as easily parsable RSS feeds, grouped into [&hellip;]<\/p>\n","protected":false},"author":50970,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[],"class_list":["post-258","post","type-post","status-publish","format-standard","hentry","category-uncategorized"],"_links":{"self":[{"href":"https:\/\/blog.soton.ac.uk\/data\/wp-json\/wp\/v2\/posts\/258","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/blog.soton.ac.uk\/data\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/blog.soton.ac.uk\/data\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/blog.soton.ac.uk\/data\/wp-json\/wp\/v2\/users\/50970"}],"replies":[{"embeddable":true,"href":"https:\/\/blog.soton.ac.uk\/data\/wp-json\/wp\/v2\/comments?post=258"}],"version-history":[{"count":1,"href":"https:\/\/blog.soton.ac.uk\/data\/wp-json\/wp\/v2\/posts\/258\/revisions"}],"predecessor-version":[{"id":259,"href":"https:\/\/blog.soton.ac.uk\/data\/wp-json\/wp\/v2\/posts\/258\/revisions\/259"}],"wp:attachment":[{"href":"https:\/\/blog.soton.ac.uk\/data\/wp-json\/wp\/v2\/media?parent=258"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/blog.soton.ac.uk\/data\/wp-json\/wp\/v2\/categories?post=258"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/blog.soton.ac.uk\/data\/wp-json\/wp\/v2\/tags?post=258"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}