Skip to content

Southampton Data Blog
Just another ECS Blogs site

Shortlisted for a Times Higher Education Award

Some very exciting news. I’m proud to say that Southampton have been short-listed for the Times Higher Education awards for “Outstanding ICT Initiative of the year” and the submission was for our work with data.southampton.ac.uk!

This may involve me having to wear a dinner jacket, which may get a chuckle from people who know my usual, er, style.

While I’ve worked very hard on the open data service, none of it would be possible without the help of dozens of people from all around the University, so it really is an award to the whole university. That said, I’m hoping I’m the one who gets the tasty dinner!

 

Posted in Uncategorized.

Unplanned Downtime

Some of the data.southampton.ac.uk related services have been unavailable this morning due to an unplanned power cut.

Sorry about that.

As the service becomes more important to the University, it’s clear that we need to make sure it’s as robust as possible, and reduce the risk of incidents like this in future.

Posted in Problems & Fixes.

We are hiring!

This is very exciting news.

The university has created a full time postion (initially 2 year fixed term) for data.southampton! This will involve taking the system towards maturity and “business as usual”. It’ll involve working closely with myself and Patrick.

I’m hoping to get someone enthusiastic about the technology and way it can improve how we all work, but with different skills to Patrick and I. My ideal candidate is the type of person who enjoys doing all the fiddling required to build a really good software package release. Part of the goal is to make open data not only practical for other organisations but actually easy.

I’m really chuffed that our university thinks its worth investing in Linked Data as infrastructure, not just as a research area.

Location: Highfield Campus
Salary: £27,578 to £33,884
Full Time Fixed Term
Closing Date: Sunday 19 August 2012
Interview Date: To be confirmed
Reference: 146112JF

More Information

Job Description and Person Specification [Word Document] — we will use the person specification to determine who gets the job, I anticipate we may know, or even be friends with, some of the applicants, so judging everybody by the person spec. helps keep it fair.

Posted in Uncategorized.

Easting & Northing

We’ve added a new dataset which adds Ordnance Survey style Easting and Northing data to everything which currently has a latitude and longitude (but only for items for which we are authoritative – University Buildings but not Bus-stops, basically.

If you get data from, say, http://data.southampton.ac.uk/building/59.rdf it now has Easting and Northing data in. I nicked the pattern from Ordinance Survey Postcode data documents.

Maybe this is useful, let us know if it is.

See the post on the Webteam Blog for the nitty gritty about how this works.

Posted in Datasets, New Data.

Launch of new University events calendar

The new University events calendar is now live and accessible via the current link http://www.events.soton.ac.uk/

The calendar was developed in conjunction with Electronics and Computer Sciences using Open Data as its foundation. It is automatically populated with events via RSS feeds from existing University websites, so minimal maintenance is required. Where a website does not have an RSS feed, staff can upload events manually via a SharePoint form http://www.southampton.ac.uk/submitevent/ This page also contains  a list of the feeds currently used.

For queries of further information please email digital@soton.ac.ukn

Under the hood…

The data is aggregated once per hour into RDF from an assortment of RSS feeds, and a few stray websites, and a Sharepoint Calendar, then presented as a pretty javascript driven website.

The data is also uploaded every hour to the open data service, get the data.

The source code is available from github, and was paid for by the University Communications department, and mostly built by Colin Williams with some support from me.

Joining it up

The website uses the open data service data to let you filter by campus (it links building number to campus number to campus name), and filter by divisions of the university, get the name of buildings from their number and the homepages of the schools and faculties.

Thing is… the open data about university division homepages has not been maintained since we created the list a year ago. It was still mostly correct but some had moved and a many divisions had been created, or merged and so forth.

The exciting thing is that there’s now value to the comms dept. to maintain this information as it provides them value. This may sound minor, but it means that there’s an incentive to the “right people” to maintain this data, and that’s always been part of the model we’ve been striving for!

There’s still a lot of missing features; rss, ical etc. We’re working on that.

Posted in New Data.

data.ac.uk and some things to read

The really exciting news is that we’ve just registered data.ac.uk to act as a home for UK-wide data projects. If you want to contribute ideas, join the data.ac.uk mailing list.

This week is also the last chance to respond to the UK Government Consultation on Open Stanards. Large companies have reportedly been pushing their agenda in this consultation, but anybody is allowed to voice their support, objections or suggestions to the proposals.

I’ve published a couple of relevant blog posts on the webteam blog:

There’s also been two recent blog posts from peer-projects I’d like to recommend:

Posted in Uncategorized.

Study of QR Codes in Southampton Bus Stops

Last year, an MSc student called Nick Gammer worked with data.southampton to study the value of QR Codes at bus-stops (linking to real-time data). There’s a few surprises in the results,

The key findings were:

The QR code trial was successful with increasing use over the life of the project.

Unprompted respondent comments were largely positive. The time trend of daily hit counts shows the project gained popularity over the time it was in operation, especially given book-marked hits were not recorded.

QR Code use was unaffected by the day of the week

There was little variation in average daily use and a One-Way ANOVA test confirmed no significant difference in average daily hits.

The QR code system was likely to have been beneficial to both regular, familiar, commuters and irregular, unfamiliar leisure travellers

Use was very consistent between weekdays/weekends and peak/inter-peak times.  Also, survey results asking respondents whether they would be more likely to use the service on a familiar or unfamiliar route were reasonably evenly split (57% to 43% respectively). Furthermore, there was no significant difference in the change of acceptability of wait time between the two groups.

Scanning as opposed to entering the URL in a mobile browser was the preferred access method

Only 0.6% of hits recorded were through typing a URL

It is not beneficial to provide instructions on, or promotion of QR code use as there is no effect on uptake

Use of basic posters was higher in terms of absolute hits and footfall adjusted hits, however t-tests revealed not significantly greater for either data set.

It appears QR code posters placed at stops without a shelter receive higher QR code use

The average hit rate at stops without a shelter was substantially higher (28% greater usage) however due to insufficient without shelter stop numbers and footfall data this could not be formally tested.

There is substantial variation in use by area with the urban, university and interchange areas displaying much greater use that suburban areas.

This is true for absolute average hits per stop in given areas and even more pronounced when data is adjusted for footfall.

The presence of a display does not effect QR code use

Surprisingly the average footfall adjusted hit rate was not significantly higher for stops without a functioning display giving bus arrival times. This is supported by street survey data as respondents did not find wait time significantly more acceptable due to QR code use when a display was not available.

Use of the existing SMS arrival time service is low and could be redundant

None of the 67 street survey respondents used this service suggesting potential for replacement by essentially free and often real-time QR code provision.

The QR code system was easy to use

Eighty one percent of respondents found the system either very or quite easy to use.
Easy of use was not significantly influenced by smartphone ownership, however a significantly higher proportion of respondents aged over 40 found it more challenging.

Observed behavioural change was limited; the majority was in the form of utilisation of wait time.

Due to the methodology and high service frequencies the observed behavioural change was lower than previous studies. The only observed modifications were utilisation of wait time or going to a different stop. Further research is required.

System accuracy and reliability was adequate

Eighty eight percent of respondents believed the difference between their estimated and actual bus arrival times were less than the crucial 5 minutes. There were no system errors during demonstrations or from mobile survey comments and no vandalism occurred.

Arrival time provision through QR code use substantially improved the acceptability of wait time

Sixty five percent of respondents stated receiving wait times make their wait either much or a little more acceptable. There was no significant difference in the change of acceptability of wait time between respondents at stops with and without a display, frequent and infrequent travellers or passengers faced with a short and long wait.

QR code use lead to a valuable increase in feelings of safety

Thirty nine percent of respondents, mostly interviewed during daylight hours, felt safer after receiving wait times. A significantly higher proportion of female respondents exhibited positive changes in feelings of safety.

Potential patronage increases appear large although should be treated with caution

Fifty six percent of respondents stated they would be either a little or a lot more likely to use a bus as a result of QR code use. Previous studies reveal this is likely to substantially over-represent any resulting patronage change.

Knowledge of the difference between real-time and timetable arrival information is very poor and there could be substantial benefits from improving this knowledge

Only 10% of respondents knew the difference between timetable and real-time arrival times. Confidence in the system could potentially be greatly improved by informing passengers which times can be relied upon; confidence is an essential prerequisite for reaping the benefits RTI provision.

You can view the full report here: “An appraisal of QR code use to deliver bus arrival time information at bus stops in Southampton”

Posted in Data Consumers, Research.

New and Automated Datasets

We’ve got two new datasets:

University Events Diary – this is aggregated once an hour from all over the university website and various other sources. It is going to be used to build a new university events website. Until then you can see the events on the pages for Faculties, Academic Units and Buildings. It (mostly) contains events which are open to the public.

SUSU Events – this is the same format, but contains events mostly restricted to our students. It’s pulled from their facebook page using the facebook API. Let me know if you want a copy of that script.

The Organisation Structure Dataset now pulls the structure and names from the university HR database every Sunday. The buildings occupied and homepages are still being hand maintained by me.

Posted in New Data.

Data.Southampton Talk at SWIB2011

This is a video & slides from a talk I gave at SWIB2011.

Posted in Slides, Video.

Return on Investment

This is something I’ve been banging on about all year. We all tell people that open data is a Good Thing, but we don’t really provide any rewards for it. Search engines do — if you mark up your pages maybe they will do something with your vocab.org content, maybe not.

RSS feeds also provide an immediate and understandable ROI, maybe less than they once did, but people see their data getting used, and can play with it in tools from the moment it goes live.

I’ve been thinking that what makes a lot of sense is to start creating little widgets and websites that provide value on top of RDF open data. To this end I’ve created a little service which provides a building location lookup for your website, given an RDF document containing a list of your buildings. The code is available on request but is still a bit hacky. I’ve some ideas to make it really exendable and do some stuff to make it easier to customise. It’s written using ARC2, Graphite and some javascript.

I actually thing the demonstrations below are a really big deal. It’s one tool which can provide a useful service on top of the RDF from three different organisations. This is what I want the future to look like! These can be embedded in your own pages in an iframe, but it’s a bit icky loading 4 google map iframes in one page.

Also get in touch, maybe via the comments, if you would like to publish open RDF data about your organisation’s places. It’s dead easy and all you need is a UNIXish (Linux or OSX) computer, a free tool (Grinder), and my example configuration files for places in grinder. Given that you can edit your list of buildings (and sites, and rooms…) in a spreadsheet and turn it into RDF data in minutes. Cheap, easy, and with an imediate Return On Investment. This is how Linked Data should be.

I’m even thinking about making a Grinder web service, so you just publish a speadsheet, and a configuration file and we spit out RDF for you…

Posted in Data Consumers, Geo, New Tools.