<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Diary of a Repository Preservation Project &#187; 2010 &#187; September</title>
	<atom:link href="http://blog.soton.ac.uk/keepit/2010/09/feed/?withoutcomments=1" rel="self" type="application/rss+xml" />
	<link>http://blog.soton.ac.uk/keepit</link>
	<description>How digital repositories can plan for the future - by four exemplars in science, arts, research and teaching</description>
	<lastBuildDate>Fri, 08 Apr 2011 14:38:28 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.4.1</generator>
	<atom:link rel='hub' href='http://blog.soton.ac.uk/keepit/?pushpress=hub'/>
<cloud domain='blog.soton.ac.uk' port='80' path='/keepit/?rsscloud=notify' registerProcedure='' protocol='http-post' />
		<item>
		<title>NECTAR and the Data Asset Framework – final thoughts</title>
		<link>http://blog.soton.ac.uk/keepit/2010/09/30/nectar-and-the-data-asset-framework-%e2%80%93-final-thoughts/</link>
		<comments>http://blog.soton.ac.uk/keepit/2010/09/30/nectar-and-the-data-asset-framework-%e2%80%93-final-thoughts/#comments</comments>
		<pubDate>Thu, 30 Sep 2010 15:20:58 +0000</pubDate>
		<dc:creator>miggie.pickton</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[DAF]]></category>
		<category><![CDATA[data management]]></category>
		<category><![CDATA[exemplar profiles]]></category>
		<category><![CDATA[NECTAR]]></category>

		<guid isPermaLink="false">http://blog.soton.ac.uk/keepit/?p=1867</guid>
		<description><![CDATA[&#8220;Our DAF project has provided an evidence base for the development of a future research data policy and of services to support research data management.&#8221; Some months ago I blogged about our plans to undertake a DAF project at The University of Northampton. DAF is the Data Asset Framework, which enables universities to audit departmental [...]]]></description>
			<content:encoded><![CDATA[<blockquote><p><a href="http://blog.soton.ac.uk/keepit/files/2009/05/nectar-logo.png"><img class="size-full wp-image-63 alignright" src="http://blog.soton.ac.uk/keepit/files/2009/05/nectar-logo.png" alt="NECTAR logo" width="176" height="171" /></a><strong>&#8220;Our DAF project has provided an evidence base for the development of a future research data policy and of services to support research data management.&#8221;</strong></p></blockquote>
<p>Some months ago I blogged about our <a title="NECTAR and the Data Asset Framework – first thoughts, Diary, February 7, 2010" href="http://blog.soton.ac.uk/keepit/2010/02/07/nectar-and-the-data-asset-framework-first-thoughts/" target="_self">plans</a> to undertake a DAF project at The University of Northampton. <a title="Tag: DAF, Diary, various entries" href="http://blog.soton.ac.uk/keepit/tag/daf/" target="_self">DAF</a> is the Data Asset Framework, which enables universities to audit departmental data collections, awareness, policies and practice for data curation and preservation. There had not previously been any systematic audit of data management practices at the university and it was felt that the DAF methodology would allow us to gather evidence to inform not only preservation planning but also the development of new policy and services to support researchers.</p>
<p>At that stage we had won approval for the project to go ahead and were investigating how we might implement the methodology.</p>
<p>This post describes some of the key challenges and findings of the project we eventually undertook. We conducted both an online survey and a series of interviews. We found that researchers were generally confident in managing their data but held a number of misconceptions regarding the services available to support them in this.</p>
<h3><strong>Staffing</strong></h3>
<p>The biggest obstacle to starting the project was the lack of a project researcher. We had no spare capacity in Information Services but we were aware of several university and local initiatives which aimed to provide graduates and PhD students with the opportunity to gain relevant work experience and new skills. We also had contacts in Information Science departments at other universities and considered inviting a Masters student to undertake the project.</p>
<p>In the end, we were fortunate enough to get two graduate interns from the local Graduate Boost <a href="#fnt1">[1]</a> programme.</p>
<p style="padding-left: 30px"><em>Lesson #1: Making use of graduates from a local scheme was advantageous to both of us. We gained free research assistants, they gained valuable work experience. </em></p>
<p>Each intern was available for four weeks and we wanted to offer both of them the chance to be involved in a range of project activities, including design; data collection; analysis and reporting. Our implementation of DAF therefore had to be designed in two halves and completed within eight weeks.</p>
<h3>Project plan</h3>
<table border="1" cellspacing="0" cellpadding="0" width="574">
<thead>
<tr>
<td width="130" valign="top"><span style="color: #333333"> </span></td>
<td width="444" valign="top"><span style="color: #333333"><strong>Action to be completed by end of week<br />
</strong></span></td>
</tr>
</thead>
<tbody>
<tr>
<td width="130" valign="top"><span style="color: #333333"><strong>Week 1 (14/5/10)<br />
Researcher 1</strong></span></td>
<td width="444" valign="top"><span style="color: #333333"><strong>Researcher 1</strong>: induction/familiarisation with The University of Northampton (UoN).<strong><br />
Introduction to DAF (Stage 1)</strong>: What is it? Why are we doing it? What do we hope to achieve? Who has used DAF before – what can we learn from them? (Contact other institutions if appropriate).  Establish justification and need for this project.<br />
Who are the project stakeholders? What do they want from the project? (Conduct interviews with Project Board; Director of Information Services; Deputy Director (Academic Services) and Director of Knowledge Exchange).<br />
Define scope of project, aims and objectives.<br />
Write up introduction to project.</span></td>
</tr>
<tr>
<td width="130" valign="top"><span style="color: #333333"><strong>Week 2 (21/5/10)<br />
Researcher 1</strong></span></td>
<td width="444" valign="top"><span style="color: #333333"><strong>DAF Stage 2:</strong> find out about research data at UoN.<br />
Conduct interviews with research leaders.<br />
Review methodologies used by other institutions (especially content of surveys); draft survey questionnaire for UoN (based on feedback from stakeholders and experience of others); create survey using Bristol Online Surveys.<br />
Write up methodology – including justification for decisions taken.</span></td>
</tr>
<tr>
<td width="130" valign="top"><span style="color: #333333"><strong>Week 3 (28/5/10)<br />
Researcher 1</strong></span></td>
<td width="444" valign="top"><span style="color: #333333">Conduct pilot survey; evaluate pilot and amend survey as required.<br />
Present progress report to Project Board<em>.<br />
Online survey goes live</em>; promote to research community.<br />
Write up pilot.</span></td>
</tr>
<tr>
<td width="130" valign="top"><span style="color: #333333"><strong>Week 4</strong><strong> (4/6/10)<br />
Researcher 1</strong></span></td>
<td width="444" valign="top"><span style="color: #333333">Monitor survey results; produce data analysis plan; produce draft interview plan (incorporating researcher workflow and data management practices).<br />
Promote survey to research community and identify possible candidates for interview.<br />
Produce briefing sheet for Researcher 2.</span></td>
</tr>
<tr>
<td width="130" valign="top"><span style="color: #333333"><strong>Week 5</strong><strong> (11/6/10)<br />
Researcher 2</strong></span></td>
<td width="444" valign="top"><span style="color: #333333"><strong>Researcher 2</strong>: induction/familiarisation with UoN.<strong><br />
Introduction to DAF </strong>– familiarisation with DAF methodology and experience so far at Northampton (see Researcher 1’s reports).<br />
Monitor ongoing survey results; evaluate plan for data analysis and amend if appropriate; continue promoting survey to research community.<br />
Finalise interview schedule; pilot this and make amendments as necessary.<br />
Write up plan for analysing survey data.<br />
Meet with Project Board.</span></td>
</tr>
<tr>
<td width="130" valign="top"><span style="color: #333333"><strong>Week 6</strong><strong> (18/6/10)<br />
Researcher 2</strong></span></td>
<td width="444" valign="top"><span style="color: #333333"><strong>DAF Stage 3:</strong> identify candidates for interview; conduct interviews; transcribe interviews.<br />
Continue monitoring survey.<em><br />
Online survey closes.</em><br />
Write short report of key themes arising in interviews.</span></td>
</tr>
<tr>
<td width="130" valign="top"><span style="color: #333333"><strong>Week 7 </strong><strong>(25/6/10)<br />
Researcher 2</strong></span></td>
<td width="444" valign="top"><span style="color: #333333">Finish identifying candidates for interview; conducting interviews; transcribing interviews.<br />
Analyse survey results and write up.</span></td>
</tr>
<tr>
<td width="130" valign="top"><span style="color: #333333"><strong>Week 8</strong><strong> (2/7/10)<br />
Researcher 2</strong></span></td>
<td width="444" valign="top"><span style="color: #333333"><strong>DAF Stage 4:</strong> Final reporting and recommendations.<br />
Write up results from interviews; collate with survey results; make recommendations for policy and practice in research data management.<br />
Present final report to Project Board, Director of Information Services, Deputy Director (Academic Services) and Director of the Knowledge Exchange.<br />
</span></td>
</tr>
</tbody>
</table>
<p>In DAF terms, Researcher 1 focussed on Stages 1 (planning) and 2 (identification and classification of data assets) while Researcher 2 took on Stages 3 (assessment of data asset management) and 4 (reporting and recommendations). The project was overseen by a Project Board comprising myself (NECTAR repository manager), the university’s Records Manager and Information Services’ Collections and Learning Resources Manager.</p>
<p>Fortunately, the project went according to plan. The first project researcher had no difficulty completing the desk research, interviewing several research leaders, designing and launching the survey, and writing up progress thus far. The second project researcher had the harder task, since he was tasked with conducting in-depth interviews with researchers, analysing the data from both the survey and the interviews, and producing the first draft of the project report.  To his credit, he succeeded in all of this.</p>
<p style="padding-left: 30px"><em>Lesson #2: The limited availability of project staff and the tight deadlines for the project forced everyone to work quickly and effectively in order to complete the project in time.</em></p>
<h3><strong>Project findings</strong></h3>
<p>The online survey attracted 80 responses – more than we initially expected. No doubt the incentive of a £50 prize draw helped. The survey covered a wide range of issues including the types, sizes and formats of research data held; its ownership; means of storage; security arrangements; sharing and access over the short and long term; and the requirements of funders. The (sixteen) interviews enabled the project team to follow up key findings from the survey and gather additional technical information on specific data objects.</p>
<p style="padding-left: 30px"><em>Lesson #3: The timing of the project, toward the end of the summer term, worked well for us.  Researchers had time to respond to the survey and a good number volunteered for the follow-up interviews. </em></p>
<p>A number of themes emerged. Three generic types of researcher were identified, based on their different needs and behaviours with respect to research data: the research student, the independent researcher and the group researcher/collaborator.</p>
<p>Some common behaviours were identified, for example, researchers overwhelmingly use Microsoft software for creating documents and spreadsheets and so habitually create .doc and .xls file types; similarly, .jpeg is the preferred format for image files. In contrast, there is much greater variation in the file types used for databases, audio and video files. This of course has significant implications for preservation planning.</p>
<p>Data storage needs and behaviours vary throughout the research lifecycle, with different storage devices being prominent at the data collection, analysis and project completion stages. For those that need to share data, a shared server has proved to be effective, but where this is not available, email is most frequently used.</p>
<p>Relatively few Northampton researchers have applied for funding from a body that mandates open access to research data and just over half are interested in a university repository for data (either for their exclusive use or for wider access). Of those that are interested in using a repository, more are interested in storing data for a finite period (say, until the end of the project or for a specified period thereafter) than indefinitely. Although a significant minority of researchers would consider allowing academic colleagues, either at Northampton or elsewhere, to access their data, the majority do not want their research data to be publicly available.</p>
<p>The project highlighted some common problems and concerns:</p>
<ul>
<li>researchers’ data management practices are guided by intuition rather than informed by good practice;</li>
<li>data are sometimes neglected once a project is complete;</li>
<li>there is uncertainty surrounding the ownership of data;</li>
<li>in some cases data are still being collected or stored in out-dated formats;</li>
<li>the university’s shared server space is under-exploited;</li>
<li>researchers are unaware or misinformed of the full range of services available to them.</li>
</ul>
<p>It is impossible to know whether these issues are common to the research community as a whole, but they certainly provide a useful starting point for a dialogue with researchers.</p>
<h3><strong>Reflections</strong></h3>
<p>The DAF project gave us a much greater understanding of the needs and wants of researchers with respect to their research data. It has provided an evidence base for the development of a future research data policy and of services to support research data management. Project recommendations additionally include clarification of data ownership, provision of data management training and guidelines, and promotion of digital preservation.</p>
<p>Although researchers were generally not interested in storing their datasets in the university&#8217;s open access institutional repository (<a href="http://nectar.northampton.ac.uk/">NECTAR</a>), the DAF project has been helpful in confirming that the file types which are stored in NECTAR are representative (albeit as a subset) of the types generally created by researchers. Future preservation planning for the repository will therefore also inform preservation plans for research data.</p>
<p>UPDATE (18 October 2010) The full report on this work is now available: E. Alexogiannopoulos, S. McKenney, and M. Pickton, <a href="http://nectar.northampton.ac.uk/2736/" target="_self">Research Data Management Project: a DAF investigation of research data management practices at The University of Northampton</a>, <span style="font-style: italic">University of Northampton</span>, <span>September 2010</span></p>
<hr /><a name="fnt1"></a>[1] Graduate Boost is designed to provide unemployed graduates with work experience and postgraduate credits towards an MBA qualification. It is managed by 3e, a social enterprise employment agency, on behalf of the Northampton Business School, and is funded by the Higher Education Funding Council for England (HEFCE) and the European Union European Social Fund.</p>
<p>UPDATE (20 Oct. 2010) The full report on the Northampton DAF project is now available: <span>Alexogiannopoulos, E.</span>, <span>McKenney, S.</span> and <span>Pickton, M.</span> (2010) <em><a href="http://nectar.northampton.ac.uk/2736/" target="_self">Research Data Management Project: a DAF investigation of research data management practices at The University of Northampton</a>.</em> University of Northampton, September 2010</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.soton.ac.uk/keepit/2010/09/30/nectar-and-the-data-asset-framework-%e2%80%93-final-thoughts/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>NECTAR and the KeepIt project – reflections</title>
		<link>http://blog.soton.ac.uk/keepit/2010/09/30/nectar-and-the-keepit-project-%e2%80%93-reflections/</link>
		<comments>http://blog.soton.ac.uk/keepit/2010/09/30/nectar-and-the-keepit-project-%e2%80%93-reflections/#comments</comments>
		<pubDate>Thu, 30 Sep 2010 15:20:05 +0000</pubDate>
		<dc:creator>miggie.pickton</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[exemplar profiles]]></category>
		<category><![CDATA[institutional repositories]]></category>
		<category><![CDATA[KeepIt course]]></category>
		<category><![CDATA[LIFE]]></category>
		<category><![CDATA[NECTAR]]></category>

		<guid isPermaLink="false">http://blog.soton.ac.uk/keepit/?p=1946</guid>
		<description><![CDATA[&#8220;From a starting point of little knowledge and still less understanding, the KeepIt project has demonstrated that digital preservation need not be scary. The secret is to take small steps; to set preservation planning within the bigger institutional context; and to engage stakeholders.&#8221; Near the start of the KeepIt project, Steve Hitchcock, our project manager, asked [...]]]></description>
			<content:encoded><![CDATA[<blockquote><p><em><a href="http://blog.soton.ac.uk/keepit/files/2009/05/nectar-logo.png"><img class="alignright size-full wp-image-63" src="http://blog.soton.ac.uk/keepit/files/2009/05/nectar-logo.png" alt="NECTAR logo" width="176" height="171" /></a><span style="font-style: normal"><strong>&#8220;From a starting point of little knowledge and still less understanding, the KeepIt project has demonstrated that digital preservation need not be scary. The secret is to take small steps; to set preservation planning within the bigger institutional context; and to engage stakeholders.&#8221;</strong></span></em></p></blockquote>
<p>Near the start of the KeepIt project, Steve Hitchcock, our project manager, asked the exemplar repository managers to come up with preservation objectives for each of their repositories. <a title="NECTAR - repository preservation objectives, Diary, September 25, 2009" href="http://blog.soton.ac.uk/keepit/2009/09/25/nectar-repository-preservation-objectives/" target="_self">NECTAR&#8217;s objectives</a> were four-fold:</p>
<ul>
<li><strong>Objective 1</strong>: to define the preservation needs of all file types and formats held in NECTAR (now and in the foreseeable future).</li>
<li><strong>Objective 2:</strong> to have procedures and tools to support the preservation needs identified in objective 1.</li>
<li><strong>Objective 3: </strong>to have documentation to inform and support NECTAR stakeholders.</li>
<li><strong>Objective 4</strong>:<strong> </strong>repository staff and others with collection management responsibility for the IR to receive training and ongoing support.</li>
</ul>
<p>Reflecting now on what we have done in the KeepIt project &#8211;  what we have learned and the benefits we have derived from the project &#8211; it is clear that as far as NECTAR is concerned, we have gone a long way toward achieving our goals.</p>
<p>Early discussions between Steve and the project team identified several areas of common interest. This list, combined with a liberal helping of Steve’s expertise, led to the development of the <a title="Digital preservation tools for repository managers, Diary, December 18, 2009" href="http://blog.soton.ac.uk/keepit/2009/12/18/digital-preservation-training-for-repository-managers/">KeepIt course</a>. Others have written extensively about the course in this blog, so I’ll restrict myself now to the impact which it has had on myself, NECTAR and The University of Northampton.</p>
<p><strong>Module 1</strong>, on organisational issues, introduced us to the Data Asset Framework (DAF) and the AIDA toolkit. Both of these appealed as tools for auditing different aspects of the institution’s state of readiness for preservation. At Northampton, we decided to have a go with the DAF tool and I have <a title="NECTAR and the Data Asset Framework – final thoughts, Diary, September 30, 2010" href="http://blog.soton.ac.uk/keepit/2010/09/30/nectar-and-the-data-asset-framework-–-final-thoughts/" target="_self">blogged already</a> on this. In the NECTAR team we have always prided ourselves on supplying what is most wanted and most benefits our research community and we felt that the DAF methodology would not only provide us with the evidence we would need to produce an appropriate preservation plan, it would also highlight gaps in current service provision and opportunities for improving the university’s research environment. This proved to be the case.</p>
<blockquote><p><strong>&#8220;A major report from our DAF project is shortly to be presented to the university’s Research Committee and nine recommendations will be made.&#8221;</strong></p></blockquote>
<p><strong>Module 2</strong>, which dealt with the costs of preservation, covered a further two models: Keeping Research Data Safe (KRDS2) and LIFE3. With Neil Beagrie we looked at the potential benefits of keeping research data in a repository (both direct and indirect; near term and long term; private and public). In theory, these are indisputable: time saved; transparency; opportunities for re-use and re-purposing of data; increased visibility and citation&#8230; and so on. But in practice there is great resistance among researchers to the idea of putting data into a repository. Our DAF project at Northampton was clear on this. Research datasets contain sensitive information; they represent a lot of effort on the part of researchers; they aren’t designed for communal use – all of these deter researchers from providing open access to their research data.</p>
<p>My colleague, Philip Thornborow, our Collections and Learning Resources Manager, was particularly interested in the LIFE3 model. In Philip&#8217;s words (just after attending module 2):</p>
<p style="padding-left: 30px"><em>We have a lecturer who has located some content in another library that he believes it would be beneficial to digitise. Neither the lecturer nor my university have much experience in estimating the full economic cost of such a project. During the demonstration of LIFE at the [KeepIt] training programme, it became immediately obvious to me that this tool was just what we need. The experience of the British Library and the other case studies they have used provides real data which can be used in the model. </em><em>As we worked through the model it also became obvious that all the assumptions are clearly labelled, so we could vary the values in line with our local situation and gain an idea of whether our proposed bid was viable.<br />
</em></p>
<p style="padding-left: 30px"><em>We are looking forward to testing LIFE, and in particular are interested in testing Brian Hole&#8217;s (LIFE3 project manager) comment that it can be used in reverse, so to speak. In other words, if JISC gave us £x what would we be able to achieve. I am severely averse to wheel reinvention, so any tool that can allow us to stand on the shoulders of giants, as the man in the BL courtyard said, gets my vote.</em></p>
<p>The LIFE3 tool wasn’t at that stage quite ready for public use, so Philip wasn’t able to take advantage of it then. Events have now moved on and his immediate need for the tool has passed. However, should he be required to provide costs for a future digitisation project, he remains interested in using LIFE3.</p>
<p>With a change of venue (from Southampton to London) came a change of emphasis in <strong>Module 3</strong> of the KeepIt course. We started to look at file formats and to consider the significant characteristics of digital objects. Based on the PRONOM list of format risks (ubiquity; support; stability, lossiness etc), Steve asked us to evaluate pairs of file formats (for example, MS Word vs PDF) and to score a point for the winner in each risk category.  We then had to come up with a reason why we might <span style="text-decoration: underline">not</span> choose the higher scoring file format for the repository. The subsequent discussion was fascinating since it clearly demonstrated that not all risks are equal, and that there is no consensus, even among the better informed, on the best file formats to preserve. So perhaps the repository manager can be forgiven for sometimes being uncertain and a little confused?</p>
<p>For the rest of the day we looked at signifcant properties with Stephen Grace and at preservation metadata and provenance with Steve Hitchcock. In the exercises that accompanied these sessions we were once again forced to think hard about the real issues in preservation and how the theory could be put into practice.</p>
<blockquote><p><strong>&#8220;The combination of EPrints and PLATO tools for the first time puts real preservation action within the grasp of repository managers.&#8221;</strong></p></blockquote>
<p><strong>Module 4</strong>, the only two day module, promised to be the one where we got down to the <a title="Getting down to the nitty-gritty: preservation workflow tools, Diary, May 14, 2010" href="http://blog.soton.ac.uk/keepit/2010/05/14/getting-down-to-the-nitty-gritty-preservation-workflow-tools/" target="_self">nitty gritty of practising preservation</a>. Covering the new EPrints storage and preservation apps, and their interaction with the PLATO preservation planning tool, we had two days to learn how to develop a specific preservation plan for our own repositories. The combination of EPrints and PLATO tools for the first time puts real preservation action within the grasp of inexpert repository managers. In a test environment we were able to conduct risk analysis within the repository itself, create and test preservation plans based on our own requirements, and import those plans back into EPrints.</p>
<p>Since module 4, the good folk from EPrints Services have upgraded NECTAR to version 3.2.4 of EPrints and Dave Tarrant has installed the preservation plugins. We now know that nearly all of our files are versions of PDF and MS Word formats, with a sprinkling of XML, plain text and others. None are currently deemed to be at risk. We will now need to monitor this, to be sure that NECTAR content does not fall into a high risk category. At that stage we will find out whether the training provided in module 4 translates into effective action in the repository.</p>
<blockquote><p><strong>&#8220;We now have the knowledge and confidence to implement other preservation tools as needed.&#8221;</strong></p></blockquote>
<p>The fifth and <strong>final module</strong> considered aspects of trust in data management. We looked at tools for assessing trustworthy repositories, specifically TRAC and DRAMBORA. A couple of the course attendees (<a title="Digital Collections Risk Assessment at LSE: Using DRAMBORA, Diary, July 19, 2010 " href="http://blog.soton.ac.uk/keepit/2010/07/19/digital-collections-risk-assessment-at-lse-using-drambora/" target="_self">LSE</a> and <a title="Digital Preservation, Risk Management, and UAL Research Online, Diary, June 11, 2010" href="http://blog.soton.ac.uk/keepit/2010/06/11/digital-preservation-risk-management-and-ual-research-online/" target="_self">UAL</a>) have since adopted DRAMBORA to good effect, and it certainly seems to do the business with regard to identifying, assessing, managing and mitigating risks to the repository. I can see that the use of DRAMBORA would provide more than just a risk assessment for the repository, it would produce considerable evidence to inform other institutional policies and procedures. The drawback is that we&#8217;re told it takes approximately 40 hours of work to complete the tool. Perhaps, like the two institutions above, here at Northampton we can find a way to use the tool more selectively.</p>
<p>Of course the KeepIt course hasn’t provided the only learning opportunity in the project.  The regular meetings between Steve, Dave and the other exemplars have given us the chance to share experiences and to learn from each other. The repository community as a whole has always been mutually supportive – a culture fostered by the huge amount of funding and support from the JISC – so the good relationships between project members and course participants comes as no surprise.</p>
<blockquote><p><strong>&#8220;There is a risk that new knowledge is vested in one person. By involving colleagues I believe that we have broadened both interest in digital preservation and the impact of the project.&#8221;</strong></p></blockquote>
<p>So to return to our progress against the NECTAR objectives&#8230;</p>
<ul>
<li><strong>Objective 1</strong>: <strong>to define the preservation needs of all file types and formats held in NECTAR (now and in the foreseeable future).<br />
</strong>We have identified all file formats in NECTAR and established that none are currently at risk. We have undertaken a university-wide research data project using the DAF methodology. This has not only highlighted the data management practices of researchers, but also supplied us with valuable information about new file types that might be deposited in NECTAR in the future.</li>
<li><strong>Objective 2:</strong> <strong>to have procedures and tools to support the preservation needs identified in objective 1.<br />
</strong>NECTAR has been upgraded to version 3.2.4 and the EPrints preservation plugins has been installed. We now have the knowledge and confidence to implement other preservation tools as needed.</li>
<li><strong>Objective 3: </strong><strong>to have documentation to inform and support NECTAR stakeholders.<br />
</strong>A major report from the DAF project is shortly to be presented to the university’s Research Committee and nine recommendations will be made. These include the creation of a data management policy for the university, clarification of the university’s position on data ownership, implementation of a programme of training on data management, and provision of online advice and guidance. It is hoped that this will raise the profile of records management in the institution and therefore have a benefit well beyond the relatively narrow field of research data.</li>
<li><strong>Objective 4</strong>:<strong> </strong><strong>repository staff and others with collection management responsibility for the IR to receive training and ongoing support.<br />
</strong>The University of Northampton was fortunate in that not only was I, as repository manager, able to attend all five modules of the KeepIt course, but also I was able to bring along colleagues with a professional interest in each area. Thus our Collections and Learning Resources manager attended modules 1 and 2; our metadata specialist participated in module 3 and our NECTAR technical specialist attended module 4. There is a risk in a project such as this that new knowledge is vested in the one person with commitment to the project. By involving colleagues in the course I believe that we have broadened both interest in digital preservation and the impact of the project.<strong><br />
</strong><strong></strong></li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://blog.soton.ac.uk/keepit/2010/09/30/nectar-and-the-keepit-project-%e2%80%93-reflections/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>KeepIt course: revision, conclusion and evaluation</title>
		<link>http://blog.soton.ac.uk/keepit/2010/09/30/keepit-course-revision-conclusion-and-evaluation/</link>
		<comments>http://blog.soton.ac.uk/keepit/2010/09/30/keepit-course-revision-conclusion-and-evaluation/#comments</comments>
		<pubDate>Thu, 30 Sep 2010 10:53:40 +0000</pubDate>
		<dc:creator>Steve Hitchcock</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[KeepIt course]]></category>

		<guid isPermaLink="false">http://blog.soton.ac.uk/keepit/?p=2023</guid>
		<description><![CDATA[KeepIt course module 5, Northampton, 30 March 2010 Tags Find out more about the full KeepIt course Presentations and tutorial exercises course 5 (source files) Using copies of selected slides from the course, the following revision session summarises which tools were used and what we did with them, including a reminder of the practical exercises. Each [...]]]></description>
			<content:encoded><![CDATA[<p><em>KeepIt course module 5, Northampton, 30 March 2010</em><strong><br />
</strong> <strong>Tags</strong> Find out more about the full <a href="http://blog.soton.ac.uk/keepit/tag/keepit-course/" target="_self">KeepIt course</a><strong><br />
</strong> <strong>Presentations and tutorial exercises</strong> <a href="http://eprints.ecs.soton.ac.uk/21031/">course 5</a> (source files)</p>
<p>Using copies of selected slides from the course, the following revision session summarises which tools were used and what we did with them, including a reminder of the practical exercises. Each module and tool is covered using this same basic structure.</p>
<p><object type="application/x-shockwave-flash" data="http://static.slideshare.net/swf/ssplayer2.swf?doc=id=3665042&amp;doc=keepit-course5-revision-100408064656-phpapp02" width="425" height="348"><param name="movie" value="http://static.slideshare.net/swf/ssplayer2.swf?doc=id=3665042&amp;doc=keepit-course5-revision-100408064656-phpapp02" ></object></p>
<p>Our final presentation from the KeepIt course sets out the criteria and context for evaluating the course. Participants attending this last course module were handed evaluation forms after this presentation and allowed quiet time before we enjoyed end-of-course food and drink. The results of this evaluation were summarised in a <a title="Hitchcock, et al., Transforming repositories: from repository managers to institutional data managers, ECS EPrints, 5 May 2010" href="http://eprints.ecs.soton.ac.uk/20986/" target="_self">presentation</a> at the <em><a title="8th European Conference on Digital Archiving, Geneva, 28-30 April 2010" href="http://www.bar.admin.ch/eca2010/index.html?lang=en" target="_self">European Conference on Digital Archiving</a></em> (April 2010).</p>
<p><object type="application/x-shockwave-flash" data="http://static.slideshare.net/swf/ssplayer2.swf?doc=id=3665193&amp;doc=keepit-course5-conclusion-100408071350-phpapp01" width="425" height="348"><param name="movie" value="http://static.slideshare.net/swf/ssplayer2.swf?doc=id=3665193&amp;doc=keepit-course5-conclusion-100408071350-phpapp01" ></object></p>
<p>As the course ends, what can we say about the status of preservation for digital repositories? First we have to recognise that this might depend on which type of repository, or which type of software, is being considered. By Googling a series of questions for each type of repository, but not including the term &#8216;preservation&#8217; in the query, we looked for highly ranked statements by repositories that made reference to preservation.</p>
<p>Based on this simple analysis, we end the course with the following thought: preservation is important for repositories, but needs to be connected with the other roles and activities of a repository if it is to find clear articulation and obtain proper resources.</p>
<p>Many thanks to all our course presenters: Sarah Jones, Harry Gibbs, Ed Pinsent, Neil Beagrie, Brian Hole, Stephen Grace, Gareth Knight, Andreas Rauber, Hannes Kulovits, Dave Tarrant, Adam Field, and Martin Donnelly.</p>
<p>Thanks too to all the participants who joined the course and stuck with it, and those who have followed the course on this blog. You deserve a reward &#8211; treat yourself.</p>
<p>That&#8217;s it; the end.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.soton.ac.uk/keepit/2010/09/30/keepit-course-revision-conclusion-and-evaluation/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>KeepIt Course 5: DRAMBORA: Risk and Trust and Data Management</title>
		<link>http://blog.soton.ac.uk/keepit/2010/09/29/keepit-course-5-drambora-risk-and-trust-and-data-management/</link>
		<comments>http://blog.soton.ac.uk/keepit/2010/09/29/keepit-course-5-drambora-risk-and-trust-and-data-management/#comments</comments>
		<pubDate>Wed, 29 Sep 2010 19:28:49 +0000</pubDate>
		<dc:creator>Steve Hitchcock</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[DRAMBORA]]></category>
		<category><![CDATA[KeepIt course]]></category>
		<category><![CDATA[KeepIt course 5]]></category>

		<guid isPermaLink="false">http://blog.soton.ac.uk/keepit/?p=1841</guid>
		<description><![CDATA[KeepIt course module 5, Northampton, 30 March 2010 Tools this module: TRAC, DRAMBORA Tags Find out more about: this module KeepIt course 5, the full KeepIt course Presentations and tutorial exercises course 5 (source files) &#8220;The Digital Repository Audit Method Based On Risk Assessment (DRAMBORA) was developed by the Digital Curation Centre (DCC) and DigitalPreservationEurope (DPE) to [...]]]></description>
			<content:encoded><![CDATA[<p><em>KeepIt course module 5, Northampton, 30 March 2010</em><br />
<strong> Tools</strong> this module: <a href="http://blog.soton.ac.uk/keepit/tag/trac/" target="_self">TRAC</a>, <a href="http://blog.soton.ac.uk/keepit/tag/drambora/" target="_self">DRAMBORA</a><br />
<strong>Tags</strong> Find out more about: this module <a href="http://blog.soton.ac.uk/keepit/tag/keepit-course-5/" target="_self">KeepIt course 5</a>, the full <a href="http://blog.soton.ac.uk/keepit/tag/keepit-course/" target="_self">KeepIt course</a><strong><br />
</strong> <strong>Presentations and tutorial exercises</strong> <a href="http://eprints.ecs.soton.ac.uk/21031/">course 5</a> (source files)</p>
<p>&#8220;The Digital Repository Audit Method Based On Risk Assessment (DRAMBORA) was developed by the Digital Curation Centre (DCC) and DigitalPreservationEurope (DPE) to assist repository management and staff to identify, assess, manage, and mitigate risks.&#8221;</p>
<p>Our expert leader for this session was Martin Donnelly from DCC, who joined us at just a couple of days notice, flying from Edinburgh and hopping into a hire car to reach Northampton before opening the presentation, all in a morning.</p>
<p>Here Martin shows how to apply the DRAMBORA methodology with a risk management exercise, and how to use the online interactive version. The presentation concludes with a view of how DRAMBORA complements DAF, effectively bringing the KeepIt course full circle, because DAF was the first tool we encountered back in course 1. First, we consider the relation between risk and trust.</p>
<p><object type="application/x-shockwave-flash" data="http://static.slideshare.net/swf/ssplayer2.swf?doc=id=3664701&amp;doc=keepit5-dramboranorthampton-given-100408053617-phpapp01" width="425" height="348"><param name="movie" value="http://static.slideshare.net/swf/ssplayer2.swf?doc=id=3664701&amp;doc=keepit5-dramboranorthampton-given-100408053617-phpapp01" ></object></p>
<p>I first came across DRAMBORA following its launch announcement and I registered for a guest login to try it. My initial impression was of a complex and awkward tool, and that wasn&#8217;t just the login process. It seemed to be engineered for perfect archiving scenarios, and expectations of comprehensive use seemed unlikely to be realised by typical repositories, given the estimated 28-40 hours to complete a full audit. It probably didn&#8217;t help that I wasn&#8217;t anticipating or trying to fulfil a real task when exploring the new tool.</p>
<p>I next encountered DRAMBORA in one of the DCC&#8217;s 101 lite courses (like <a title="Digital Curation 101 Lite: How to Manage Research Data, June 16, 2010, Oxford" href="http://www.dcc.ac.uk/training/digital-curation-101/digital-curation-101-lite-oxford" target="_self">this one</a>). It wasn&#8217;t referred to as DRAMBORA, but I recognised the fingerprints. We were given a small group exercise to assess a single risk. Suddenly it became clearer. It is possible to apply the elements of DRAMBORA and build the bigger picture as needed. In other words, fit the tool to the task, not the other way around. DRAMBORA is well structured for this, and the <a title="Ed Fay, Digital Collections Risk Assessment at LSE: Using DRAMBORA, Diary, July 19, 2010 " href="http://blog.soton.ac.uk/keepit/2010/07/19/digital-collections-risk-assessment-at-lse-using-drambora/" target="_self">benefits of this approach have been realised for the institutional repository of the LSE</a>.</p>
<p>What these risk scenarios and exercises reveal is the degree of team building necessary to manage risk associated with digital content produced across large institutions.</p>
<div id="attachment_1923" class="wp-caption alignleft" style="width: 264px"><a href="http://blog.soton.ac.uk/keepit/files/2010/09/drambora-method.png"><img class="size-full wp-image-1923 " src="http://blog.soton.ac.uk/keepit/files/2010/09/drambora-method.png" alt="DRAMBORA method" width="254" height="587" /></a><p class="wp-caption-text">DRAMBORA method</p></div>
<p>The DRAMBORA method is based on discrete phases of (self-)assessment (see illustration left). In comparison with TRAC, we saw that self-assessment is a key feature that differentiates DRAMBORA. This method requires institutional and contextual profiles, and a detailed understanding of a repository&#8217;s activities and &#8216;assets&#8217;, an approach that recalls the work done earlier in the course with <a title="Tag: AIDA, Diary, various entries" href="http://blog.soton.ac.uk/keepit/tag/aida/" target="_self">AIDA</a>. At this point it becomes possible to identify the risks that might impact the repository. These risks then need to be itemised and assessed individually. The intended outcome is a a strategy for managing the risks</p>
<p>So, what is risk, and how is it handled by DRAMBORA?</p>
<ul>
<li>Definition: risks describe challenges or threats that impede the achievement of repository objectives, obstruct activities, and prejudice the continued availability of essential assets.</li>
<li>In DRAMBORA, risks have several attributes: probability, impact, severity (a derived value, p*i), owner(s), and management strategies.</li>
</ul>
<p>What does a risk look like, and how can we elaborate one? A large set of sample risks is provided in the Appendix to the original <a title="DRAMBORA Toolkit Main Text (English Language Version) (Introductory Text Plus Self-audit Toolkit), v1.0 (draft), 28 February 2007" href="http://repositoryaudit.eu/download/" target="_self">paper-based DRAMBORA methodology</a> (registration required).</p>
<p>Slides 42-43 tabulate the anatomy of a risk. To try this out we have an exercise (slides 45-47), designed to take 1h. First, identify one risk (based on your own experiences wherever possible), and complete the <a href="http://eprints.ecs.soton.ac.uk/21031/5/dramboraWorksheet.rtf" target="_self">DRAMBORA worksheet</a> (.rtf format). In part 2, you will identify what steps your repository might take to manage and mitigate the identified risk over time. Or you can learn from <a title="s.meece, Digital Preservation, Risk Management, and UAL Research Online, Diary, June 11, 2010 " href="http://blog.soton.ac.uk/keepit/2010/06/11/digital-preservation-risk-management-and-ual-research-online/" target="_self">one user&#8217;s experience of trying this exercise</a>.</p>
<p>As an alternative to the paper-based process, try DRAMBORA Interactive, which guides users smoothly through the audit process, with built in features for reporting. The presentation provides a step-by-step guide to DRAMBORA Interactive (slides 52-90), or treat this as a revision for the first half of the presentation.</p>
<p>One of the more intriguing features of the Interactive version is snapshot view (slides 88-89). This records the state of a repository at a given moment in time, and can be compared with other audit snapshots to track changes.</p>
<p>DRAMBORA has interesting connections with <a title="Tag: DAF, Diary, various entries" href="http://blog.soton.ac.uk/keepit/tag/daf/" target="_self">DAF, the Data Asset Framework</a>. Both were developed at the DCC and are &#8220;self-management tools to assess the effectiveness of approach to data management or preservation&#8221;. Where DAF emphasises the researcher-data axis, DRAMBORA is more repository-process focussed (slide 94).</p>
<p>Coming soon is the data management tool, a combination of DAF, AIDA, <a title="Tag; LIFE, Diary, various entries" href="http://blog.soton.ac.uk/keepit/tag/life/" target="_self">LIFE</a> and DRAMBORA. The KeepIt course is indebted to them all for framing the course from start to finish, and for providing a rich set of tools for repository managers to tackle the challenge of digital preservation.</p>
<p>DRAMBORA is the final tool to be covered in this KeepIt course. All that remains to complete the course is <a title="KeepIt course: revision, conclusion and evaluation, Diary, September 30, 2010" href="http://blog.soton.ac.uk/keepit/2010/09/30/keepit-course-revision-conclusion-and-evaluation/" target="_self">revision and evaluation</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.soton.ac.uk/keepit/2010/09/29/keepit-course-5-drambora-risk-and-trust-and-data-management/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>KeepIt course 5: Tools for Assessing Trustworthy Repositories</title>
		<link>http://blog.soton.ac.uk/keepit/2010/09/29/keepit-course-5-tools-for-assessing-trustworthy-repositories/</link>
		<comments>http://blog.soton.ac.uk/keepit/2010/09/29/keepit-course-5-tools-for-assessing-trustworthy-repositories/#comments</comments>
		<pubDate>Wed, 29 Sep 2010 19:11:37 +0000</pubDate>
		<dc:creator>Steve Hitchcock</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[KeepIt course]]></category>
		<category><![CDATA[KeepIt course 5]]></category>
		<category><![CDATA[TRAC]]></category>
		<category><![CDATA[Trust]]></category>

		<guid isPermaLink="false">http://blog.soton.ac.uk/keepit/?p=1836</guid>
		<description><![CDATA[KeepIt course module 5, Northampton, 30 March 2010 Tools this module: TRAC, DRAMBORA Tags Find out more about: this module KeepIt course 5, the full KeepIt course Presentations and tutorial exercises course 5 (source files) Our primary tools in KeepIt course 5 are TRAC and DRAMBORA. We&#8217;ve seen passing references to these tools in earlier [...]]]></description>
			<content:encoded><![CDATA[<p><em>KeepIt course module 5, Northampton, 30 March 2010</em><br />
<strong> Tools</strong> this module: <a href="http://blog.soton.ac.uk/keepit/tag/trac/" target="_self">TRAC</a>, <a href="http://blog.soton.ac.uk/keepit/tag/drambora/" target="_self">DRAMBORA</a><br />
<strong>Tags</strong> Find out more about: this module <a href="http://blog.soton.ac.uk/keepit/tag/keepit-course-5/" target="_self">KeepIt course 5</a>, the full <a href="http://blog.soton.ac.uk/keepit/tag/keepit-course/" target="_self">KeepIt course</a><strong><br />
</strong> <strong>Presentations and tutorial exercises</strong> <a href="http://eprints.ecs.soton.ac.uk/21031/">course 5</a> (source files)</p>
<p>Our primary tools in KeepIt course 5 are TRAC and DRAMBORA. We&#8217;ve seen passing references to these tools in earlier modules: in course 1 (slides 2-3), course 2 (or maybe not, slide 5; yes we did, slide 6), and course 4 (slide 7). This gives a sense of how integral these tools are to a structured approach to digital preservation. In this module we&#8217;ll find out why, beginning here with TRAC, Trustworthy Repositories Audit and Certification.</p>
<p><object type="application/x-shockwave-flash" data="http://static.slideshare.net/swf/ssplayer2.swf?doc=id=3664564&amp;doc=keepit-course5-trust-100408050615-phpapp02" width="425" height="348"><param name="movie" value="http://static.slideshare.net/swf/ssplayer2.swf?doc=id=3664564&amp;doc=keepit-course5-trust-100408050615-phpapp02" ></object></p>
<p>TRAC was designed by committee (slide 8). Its approach is not unique, and there are similar tools such as the nestor Criteria Catalogue. Since these tools have not fully aligned none stands yet as a full global standard for measuring trust of digital repositories.</p>
<p>As the TRAC name suggests, there are two parts to the process: audit, that is, assessing conformance against a set of criteria; and certification, demonstrating conformance to an independent agent.</p>
<p>As with the <a title="Tag: PREMIS, Diary, various entries" href="http://blog.soton.ac.uk/keepit/tag/premis/" target="_self">PREMIS</a> preservation metadata dictionary covered in course 3, TRAC exhibits a clear and logical structure that quickly becomes apparent on inspection, both in the structure of the 84 individual entities (slide 13) and the structure of the full checklist (slide 14), which is designed to assess three primary areas within TRAC:</p>
<ol>
<li>Organizational Infrastructure</li>
<li>Digital Object Management</li>
<li>Technologies, Technical Infrastructure, Security</li>
</ol>
<p>To get a feel for this structure and apply the approach to some TRAC entries, we devised a short exercise that anyone can try. Using the procedure shown in slide 15, decide whether you can certify your repository, or not, for some given entries. To avoid self-selecting entries designed to show your repository in a glowing light, download and use the <a title="Digital Preservation Tools for Repository Managers 5: Trust, Supplemental Material Microsoft Excel (Randomisable spreadsheet of TRAC criteria)" href="http://eprints.ecs.soton.ac.uk/21031/3/trac%2Dchecklist%2Drandom.xls" target="_self">randomisable spreadsheet</a> which lists all the TRAC entries in order. Notice that column A consists of randomly generated numbers, so if you sort the spreadsheet on this column it will randomise the entries. Before you do this, decide how many entries you wish to apply in the exercise and pre-select this many numbers, from 1-84. These will be the line numbers corresponding to the randomised entries you will use in the exercise.</p>
<p>Not all repositories will need to be formally certified for trustworthiness, which can be an intense and detailed procedure, as illustrated by one of the few published case studies, in this case a major e-journal archive, Portico (slides 18-21). Credit to Portico for revealing this example, which shows an exhaustive process such as TRAC will raise issues even among the best prepared and managed archives today.</p>
<p>Closer to home, one of our KeepIt exemplar repositories, <a title="Tag: eCrystals, Diary, various entries" href="http://blog.soton.ac.uk/keepit/tag/ecrystals/" target="_self">eCrystals</a>, has applied TRAC and reported its initial recommendations:</p>
<ul>
<li>TRAC is open-ended and exploratory, and therefore more suited to repositories with an established long-term archival and preservation mandate.</li>
<li>At the current stage of development of the eCrystals data repository we recommend self-assessment using the DRAMBORA toolkit as an instrument.</li>
<li>The audit process in many ways is more important than actual certification, since it allows repositories to analyse and respond to their archives&#8217; strengths and weaknesses in a systematic fashion.</li>
</ul>
<p>We can test these recommendations immediately, because the <a title="KeepIt Course 5: DRAMBORA: Risk and Trust and Data Management, Diary, September 29, 2010" href="http://blog.soton.ac.uk/keepit/2010/09/29/keepit-course-5-drambora-risk-and-trust-and-data-management/" target="_self">next session</a> will involve an extensive exercise with DRAMBORA.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.soton.ac.uk/keepit/2010/09/29/keepit-course-5-tools-for-assessing-trustworthy-repositories/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>KeepIt course 5: Trust</title>
		<link>http://blog.soton.ac.uk/keepit/2010/09/29/keepit-course-5-trust/</link>
		<comments>http://blog.soton.ac.uk/keepit/2010/09/29/keepit-course-5-trust/#comments</comments>
		<pubDate>Wed, 29 Sep 2010 18:57:01 +0000</pubDate>
		<dc:creator>Steve Hitchcock</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[KeepIt course]]></category>
		<category><![CDATA[KeepIt course 5]]></category>
		<category><![CDATA[Trust]]></category>

		<guid isPermaLink="false">http://blog.soton.ac.uk/keepit/?p=1830</guid>
		<description><![CDATA[KeepIt course module 5, Northampton, 30 March 2010 Tools this module: TRAC, DRAMBORA Tags Find out more about: this module KeepIt course 5, the full KeepIt course Presentations and tutorial exercises course 5 (source files) &#8220;Say what you do. Do what you say. Show that you do what you say. &#8220;All of this leads to [...]]]></description>
			<content:encoded><![CDATA[<p><em>KeepIt course module 5, Northampton, 30 March 2010</em><br />
<strong> Tools</strong> this module: <a href="http://blog.soton.ac.uk/keepit/tag/trac/" target="_self">TRAC</a>, <a href="http://blog.soton.ac.uk/keepit/tag/drambora/" target="_self">DRAMBORA</a><br />
<strong>Tags</strong> Find out more about: this module <a href="http://blog.soton.ac.uk/keepit/tag/keepit-course-5/" target="_self">KeepIt course 5</a>, the full <a href="http://blog.soton.ac.uk/keepit/tag/keepit-course/" target="_self">KeepIt course</a><strong><br />
</strong> <strong>Presentations and tutorial exercises</strong> <a href="http://eprints.ecs.soton.ac.uk/21031/">course 5</a> (source files)</p>
<blockquote><p><em>&#8220;Say what you do. Do what you say. Show that you do what you say.</em></p>
<p><em>&#8220;All of this leads to trust.&#8221;</em></p>
<p>Colati and Shreeves, Digital Repository Management Uncovered, Webwise 2010</p></blockquote>
<p>We have reached the final lap of our recap of the KeepIt course on Digital Preservation Tools for Repository Managers. The aim of this recap is to pull together the different resources employed for the course to make it suitable for independent study and use.</p>
<p>As usual, this module introduces readily-available tools through presentations and practicals, and our primary presenter is a one of the developers of the main tool we shall be covering.</p>
<p><object type="application/x-shockwave-flash" data="http://static.slideshare.net/swf/ssplayer2.swf?doc=id=3664501&amp;doc=keepit-course5-intro-100408044938-phpapp01" width="425" height="348"><param name="movie" value="http://static.slideshare.net/swf/ssplayer2.swf?doc=id=3664501&amp;doc=keepit-course5-intro-100408044938-phpapp01" ></object></p>
<p>The topic for this session is trust, and in the context of the course this is last by design, but certainly not least. Colati and Shreeves reveal very succinctly why trust matters.</p>
<p>Most people want to be trusted, but trust takes time to establish. The same applies for repositories, and there is a cost attached. For new repositories, therefore, trust can be an over-emphasised virtue, at least initially. But with maturity comes responsibility, and trust becomes a feature a repository will wish to measure and demonstrate, especially in the context of digital preservation. Any repository manager who has completed and applied this course will want to demonstrate the results and benefits &#8211; they will want to be seen to be trusted.</p>
<p>Trust is also a two-way issue. A repository may want to be trusted by its users; it also want to be able to trust the tools and services it uses.</p>
<p>KeepIt course 5 covers two tools to manage and measure trust, and its counterpart, risk. TRAC, Trusted Repository Audit and Certification, is a checklist of criteria for assessing the degree of trust attained by a repository, and is <a title="KeepIt course 5: Tools for Assessing Trustworthy Repositories, Diary, September 29, 2010" href="http://blog.soton.ac.uk/keepit/2010/09/29/keepit-course-5-tools-for-assessing-trustworthy-repositories/" target="_self">introduced</a> with a short group exercise. Our main tool in this module, DRAMBORA, is <a title="KeepIt Course 5: DRAMBORA: Risk and Trust and Data Management, Diary, September 29, 2010" href="http://blog.soton.ac.uk/keepit/2010/09/29/keepit-course-5-drambora-risk-and-trust-and-data-management/" target="_self">covered</a> more extensively, again, with practical work to the fore.</p>
<p>This module, and the course, completes with a full-course revision session.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.soton.ac.uk/keepit/2010/09/29/keepit-course-5-trust/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>EdShare – Lessons from KeepIt</title>
		<link>http://blog.soton.ac.uk/keepit/2010/09/26/edshare-%e2%80%93-work-for-keepit/</link>
		<comments>http://blog.soton.ac.uk/keepit/2010/09/26/edshare-%e2%80%93-work-for-keepit/#comments</comments>
		<pubDate>Sun, 26 Sep 2010 14:26:06 +0000</pubDate>
		<dc:creator>d.morris</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[EdShare]]></category>
		<category><![CDATA[exemplar profiles]]></category>
		<category><![CDATA[KeepIt course]]></category>
		<category><![CDATA[Learning and teaching repositories]]></category>

		<guid isPermaLink="false">http://blog.soton.ac.uk/keepit/?p=1854</guid>
		<description><![CDATA[&#8220;Participation in the KeepIt Project has provided us with an excellent practical grasp of the realities of preservation and brought an apparently enormous and daunting area into the realms of approachable and practical possibilities.&#8221; We have developed an approach which will enable others to make a start and to develop their own routes through preservation [...]]]></description>
			<content:encoded><![CDATA[<blockquote><p><a href="http://www.edshare.soton.ac.uk/"><img class="alignright size-full wp-image-24" src="http://blog.soton.ac.uk/keepit/files/2009/05/edshare_logo.png" alt="EdShare logo" width="200" height="66" /></a>&#8220;Participation in the KeepIt Project has provided us with an excellent practical grasp of the realities of preservation and brought an apparently enormous and daunting area into the realms of approachable and practical possibilities.&#8221;</p></blockquote>
<p>We have developed an approach which will enable others to make a start and to develop their own routes through preservation in their own ways.</p>
<p><a title="Welcome to EdShare" href="http://www.edshare.soton.ac.uk/" target="_self">EdShare</a>, as the institutional learning and teaching repository of the University of Southampton, has grown from an engagement process across the institution, intended both to support educational excellence and a cultural change in approach within the institution.  Our developmental work in the early stages of the JISC-funde<a title="Welcome to the EdSpace project website" href="http://www.edspace.ecs.soton.ac.uk/" target="_self">d EdSpace Project</a> had its roots in early work for elearning led by senior academics and change agents across the University.</p>
<p>At the beginning of the EdSpace Project, we were already in a position to draw on processes similar both to the Data Asset Framework (DAF) approach (an outcome of the School exemplars we had identified and engaged with for our HEAcademy elearning benchmarking work in 2007/08) and the philosophy of Assessing Institutional Digital Assets (AIDA), itself similar in approach to the eMM toolkit approach we had utilised in our work for the whole elearning benchmarking activity.</p>
<p>This work provided additional support for working with both of these tools in module 1 of the innovative and highly successful <a title="Tag: KeepIt course, Diary, various entries" href="http://blog.soton.ac.uk/keepit/tag/keepit-course/" target="_self">KeepIt Course on Digital Preservation Tools for Repository Managers</a>.  This course has constituted a really significant aspect of the work of this project.</p>
<p>Module 2 covered the Keeping Repository Data Safe (KRDS) method and I <a title="Keeping data safe in an educational repository: cost-benefits analysis, Diary, February 22, 2010 " href="http://blog.soton.ac.uk/keepit/2010/02/22/1006/" target="_self">blogged</a> back in February providing an educational resources perspective on this.</p>
<p>Module 3 was a Primer on Preservation Workflow, Formats and Characterisation – the most technical of the training sessions we had covered so far.  Steve Hitchcock’s introductory session made the day both accessible, engaging and interesting.  In this technical area I could begin fully to appreciate both the complexity and scale of the task ahead, as well as the benefit of identifying strong technical specialists available in a large, research intensive university.  Indeed, one of the great advantages of working in such a large institution is the relatively easy access we have here to advice (at the very least, and sometimes a great deal more!) from technical experts at the forefront of their fields.</p>
<blockquote><p><strong>&#8220;One solution is the provision of integrated services and tools linked to specific repository systems. When EdShare upgrades to version 3.2 of EPrints we will be in a position to take advantage of the newly-developed EPrints preservation apps&#8221;</strong></p></blockquote>
<p>Developing a good awareness and understanding of the scope of the file formats, characterisation and associated preservation issues is a key aspect of the repository manager&#8217;s role, but the most cost-effective and sustainable support for these areas must lie with the technical experts and software developers.  Where possible, managers should develop good relationships with these people in their own organisations.  For smaller institutions, or communities of subject-based educational repositories, an alternative solution may lie in collaborating on sourcing technical advice and support as well as benefiting from the future provision of integrated services and tools linked to specific repository systems.  Thus, when EdShare upgrades to version 3.2 of EPrints later in September 2010, we will be in a position to take advantage of the newly-developed EPrints preservation apps to support our ongoing work in this area.  In this way, good practice will be accessible to more people and a wider group in the community will be able to benefit.</p>
<blockquote><p><strong>&#8220;multiple format packages providing visual, audio and written presentations – these are not necessarily the everyday, simple learning resource filetypes we had originally anticipated hosting&#8221;</strong></p></blockquote>
<p>The significant characteristics work for module 3 was also a highly relevant element for EdShare:  The motivation for many educators in using a specific software application or package when creating learning resources is precisely to take advantage of the functionality offered by that specific resource e.g. Camtasia, Adobe Captivate or Adobe Presenter.  Such applications produce sophisticated, multiple format packages specifically to provide clear visual, audio and written presentations for a specific audience – they are not necessarily the everyday, simple learning resource filetypes we had originally anticipated hosting when we launched EdShare.</p>
<div id="attachment_2055" class="wp-caption aligncenter" style="width: 383px"><a href="http://blog.soton.ac.uk/keepit/files/2010/09/edshare-format-example-bazaar.jpg"><img class="size-full wp-image-2055 " src="http://blog.soton.ac.uk/keepit/files/2010/09/edshare-format-example-bazaar.jpg" alt="EdShare handles a wide array of complex file format types, even within single items" width="373" height="450" /></a><p class="wp-caption-text">EdShare handles a wide array of complex file format types, even within single items</p></div>
<p>Nor are these resources straightforward filetype subjects for preservation actions.  In terms of offering distinctive approaches to preservation, EdShare&#8217;s case was that for reasons of the diversity, richness  and variability of filetypes involved, educational repositories (still very much the “overlooked” form of repository compared to research repositories) we provided an excellent exemplar for KeepIt.</p>
<p>In addition, we were interested to explore other aspects of preservation of concern to everyday educators – are they aware of preservation as an issue at all?  If so, at what stage in the cycle of repository content, ingest and curation should these issues be raised?  Are there concerns for preservation in other areas of the repository landscape:  are institutional managers/legal services/senior policy makers concerned about preservation for other reasons?</p>
<blockquote><p><strong>&#8220;The modules neither required nor assumed sophisticated, technical knowledge nor experience. It would be quite unrealistic, particularly in the present climate of financial constraint and staff reductions, to increase the existing and preservational contribution of the manager’s responsibilities&#8221;</strong></p></blockquote>
<p>Module 4 looked at an aspect of these questions by offering an opportunity to develop a preservation plan for the repository.  This, in turn, led to the identification of two additional pieces of work for EdShare:</p>
<ol>
<li>To investigate the typical or atypical nature of the filetype content in EdShare itself.  (I report on this work and its findings in my next blog post.)</li>
<li>To support the upgrade of EdShare’s underlying software to version 3.2 of EPrints; thereby offering the integration of the new EPrints preservation tools.</li>
</ol>
<p>As an early example of an institutional educational repository, which has been developed according to the principles of co-design and collaboration with the local user community, a couple of areas of work which EdShare would have been interested in exploring are:</p>
<ol>
<li>Policy development for preservation, particularly, for educational repositories</li>
<li>Identification of the specific role of the repository manager, educational developers, teachers and other stakeholders (established and emergent) in the preservation process.</li>
</ol>
<blockquote><p><strong>&#8220;By the end of the KeepIt course I had achieved a much higher level of confidence with respect to preservation&#8221;</strong></p></blockquote>
<p>Without a doubt, the KeepIt course modules provide an excellent awareness raising and priming opportunity for the educational repository manager.  The modules neither required nor assumed sophisticated, technical knowledge nor experience. Considerable emphasis was given to the advantages of adopting a team approach whereby either local or host technical expertise is drawn upon to provide the appropriate levels and extent of technical expertise required for reliable, preservation processes.  In practical terms, this was enormously welcome: providing support for the educational repository is only one role in an increasing range of roles I have as a librarian in the University.  It would be quite unrealistic, particularly in the present climate of financial constraint and staff reductions, to increase the existing and preservational contribution of the manager’s responsibilities with a more technical (or any other) role.  There are sufficient aspects in which managers can contribute to preservation work in repositories without having to emphasise the technical element of this work.</p>
<p>By the end of the KeepIt course I had achieved a much higher level of confidence with respect to preservation &#8211; reassured that the technical capabilities of our software would be enormously enhanced with a software upgrade; confident that the affordances of technology could then support the additional aspirations we were developing in terms of supporting an improved awareness of preservation on the part of teachers and policy makers across the University.  Aware that much of what we were already doing as a repository supported an approach which promoted preservation practices and respected the expectations of our content producers that their resources would be cared for and available over the long term for them to access, share and re-mix.</p>
<p>A JISC-sponsored report by Emmerson &#8211; <a title="JISC IE Repository, 18 Feb 2008" href="http://ie-repository.jisc.ac.uk/149/" target="_self">Retention of Learning Materials: A Survey of Institutional Policies and Practice</a> &#8211; had been an early resource we had drawn on in the EdSpace Project &#8211; reflecting both the trend across UK HE of low-awareness levels on the part of institutions to develop and enforce policies and practices, as well as alerting institutions to the potential wastefulness and vulnerability they exposed themselves to in continuing in this way.  This area of work has given rise to a stronger focus on one of the additional areas of work for EdShare as identified above:    to explore and understand the specific institutional concerns of the University of Southampton in the preservation of resources for learning and teaching.  Indeed, this work will align very well with significant ongoing work to develop the “Southampton Learning Environment” – a framework for supporting, delivering and enhancing learning and teaching across the whole University community.  We will need to address policy, practice as well as technical and storage implications for this work as we develop more robust institutional arrangements for this important area of work.</p>
<p>To summarise the impact of KeepIt on EdShare:</p>
<p><strong>What we did</strong></p>
<ol>
<li>Worked as an exemplar among a community of repository managers and explored common areas of interest;</li>
<li>Worked as part of the JISC/HE Academy OER Project Programme – EdShare is developing  instantiations to host the OER Collections of HumBox and SWAPBox</li>
</ol>
<p><strong>What we learned</strong></p>
<ol>
<li>Digital preservation can be simplified by integrating tools and services with existing digital repository interfaces;</li>
<li>Preservation is the responsibility of content creators, teachers and others contributing to the educational process and not just the repository manager;</li>
<li>All the small actions contribute to the process, not just the large scale, ambitious and expensive ones</li>
</ol>
<p><strong>What others can learn</strong></p>
<ol>
<li>Having in place reporting processes for identification of filetypes in a repository is a good starting point for preservation activities.  It seems that many existing educational repositories do not have these processes in place.</li>
<li>Maintaining an awareness of the rapid developments in this field will enable others (not necessarily EPrints users nor experts in preservation) to take advantage of the developments made in the KeepIt project and by the emerging community of practice around preservation work.</li>
</ol>
<p><strong>What we are doing next</strong></p>
<ol>
<li>Upgrading EdShare to version 3.2 of the EPrints software puts us in a good position to take advantage of the work done by technical experts to provide apps which support automated preservation activities.</li>
<li>From this work, we will implement policies and processes to support decisions on format types for preservation actions.</li>
<li>Preservation of learning resources in an institution is only one aspect of the work required for the delivery of retention policies and practices required in a 21st Century University.  We will work to understand more about this area and to put our own institution in a good position to support both its aspirations and obligations.</li>
</ol>
]]></content:encoded>
			<wfw:commentRss>http://blog.soton.ac.uk/keepit/2010/09/26/edshare-%e2%80%93-work-for-keepit/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Connecting Plato with digital repository interfaces: #ipres2010 twitterstream</title>
		<link>http://blog.soton.ac.uk/keepit/2010/09/21/connecting-plato-with-digital-repository-interfaces-ipres2010-twitterstream/</link>
		<comments>http://blog.soton.ac.uk/keepit/2010/09/21/connecting-plato-with-digital-repository-interfaces-ipres2010-twitterstream/#comments</comments>
		<pubDate>Tue, 21 Sep 2010 14:44:23 +0000</pubDate>
		<dc:creator>Steve Hitchcock</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[EPrints]]></category>
		<category><![CDATA[Plato]]></category>
		<category><![CDATA[preservation planning]]></category>

		<guid isPermaLink="false">http://blog.soton.ac.uk/keepit/?p=1629</guid>
		<description><![CDATA[This is a copy of the live Twitter record (edited to remove duplicates and retweets) for the presentation: Dave Tarrant, Connecting preservation planning and Plato with digital repository interfaces, iPres 2010, Vienna, Tuesday, September 21, 2010 (Day 2) Session 5a Preservation Planning and Evaluation (10:30-11:50) See also the EPrints record for this presentation including the full [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://blog.soton.ac.uk/keepit/files/2010/09/ipres10-logo.jpeg"><img class="size-full wp-image-1652 alignright" src="http://blog.soton.ac.uk/keepit/files/2010/09/ipres10-logo.jpeg" alt="ipres10-logo" width="107" height="34" /></a>This is a copy of the live Twitter record (edited to remove duplicates and retweets) for the presentation:</p>
<p>Dave Tarrant, <strong>Connecting preservation planning and Plato with digital repository interfaces</strong>, iPres 2010, Vienna, Tuesday, September 21, 2010 (Day 2)<a href="http://www.ifs.tuwien.ac.at/dp/ipres2010/schedule.html#session5A" target="_self"> Session 5a Preservation Planning and Evaluation</a> (10:30-11:50)</p>
<p>See also the <a title="Tarrant, D., Hitchcock, S., Carr, L., Kulovits, H. and Rauber, A. (2010) Connecting preservation planning and Plato with digital repository interfaces, ECS EPrints, U. Southampton " href="http://eprints.ecs.soton.ac.uk/21289/" target="_self">EPrints record for this presentation</a> including the full paper on which this presentation is based and the slides.</p>
<blockquote>
<h3>&#8220;impressive piece of work – excellent talk too&#8221;</h3>
</blockquote>
<p><strong><a href="http://blog.soton.ac.uk/keepit/files/2009/11/polarbear-plato.jpg"><img class="alignleft size-thumbnail wp-image-646" src="http://blog.soton.ac.uk/keepit/files/2009/11/polarbear-plato-150x150.jpg" alt="Plato polar bear logo" width="150" height="150" /></a>timgollins</strong> #Ipres2010 #5a &#8211; @davetaz about to open the session with &#8220;Connecting preservation planning and Plato with digital repository interfaces&#8221;</p>
<p><strong>mariekeguy</strong> #ipres2010 #5a @davetaz (David Tarrant) talking about Connecting preservation planning and Plato with digital repository interfaces</p>
<p><strong>cardcc</strong> @timgollins well you won&#8217;t have time for tweeting if @davetaz sticks with his usual style! #Ipres2010 #5a</p>
<p><strong>euanc</strong> #ipres2010: Session 5a- Preservation planning and evaluation starts. no tweets from @davetaz for the next while</p>
<p><strong>timgollins</strong> #Ipres2010 #5a &#8211; @davetaz &#8211; Biggest Challenge &#8211; using and controlling the plethora of tools 2 Retweets</p>
<p><strong>mariekeguy</strong> #ipres2010 #5a Tarrant: Lots of repository tools out there but repository managers unsure how to use then, need workflow</p>
<p><strong>timgollins</strong> #Ipres2010 #5a &#8211; @davetaz Bit preservation is the start, then Identification, Characterisation, &#8230;. Point is useability 2 Retweets</p>
<p><strong>timgollins</strong> #Ipres2010 #5a &#8211; @davetaz &#8211; gives tweet quote but I cat type fast enough !!</p>
<p><strong>euanc</strong> #ipres2010 #5a @davetaz is telling us to tweet things 140 characters long and then not giving time to type them out, typical!</p>
<p><strong>mariekeguy</strong> #ipres2010 #5a Tarrant: Need to lower the barriers to use of repository tools &#8211; make it fit 140 characters <img src='http://blog.soton.ac.uk/keepit/wp-includes/images/smilies/icon_wink.gif' alt=';-)' class='wp-smiley' /> </p>
<p><strong>timgollins</strong> #Ipres2010 #5a &#8211; @davetaz Talking about e-prints &#8211; already at Torrent speed and speeding up <img src='http://blog.soton.ac.uk/keepit/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
<p><strong>timgollins</strong> #Ipres2010 #5a &#8211; @davetaz Droid profile of a publication repository &#8211; thanks for the Plug Dave <img src='http://blog.soton.ac.uk/keepit/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
<p><strong>mariekeguy</strong> Talks faster than me! RT @timgollins: #Ipres2010 #5a &#8211; @davetaz Talking about e-prints &#8211; already at Torrent speed and speeding up <img src='http://blog.soton.ac.uk/keepit/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  1 Retweet</p>
<p><strong>euanc</strong> #ipres2010 #5a @davetaz speaking too fast for my slow brain this morning. impressed by @mariekeguy &amp; @timgollins tweet speet</p>
<p><strong>timgollins</strong> #Ipres2010 #5a &#8211; @davetaz E-prints &#8211; already missed the point about Characterization using JHOVE through Plato</p>
<blockquote>
<h3 style="font-size: 1.17em">&#8220;Plato action plan XML read by Eprints for ‘action’. V.cool.&#8221;</h3>
</blockquote>
<p><strong>euanc</strong> #ipres2010 @davetaz : only get factual information in registries, not institutionally subjective information</p>
<p><strong>timgollins</strong> #Ipres2010 #5a &#8211; @davetaz Now on Risk assessment &#8211; risk is local &#8211; do the risk analysis &#8211; central registries nly have global risk factors</p>
<p><strong>euanc</strong> #ipres2010 we need a tool that identifies intellectual entities that are made up of multiple computer files via indicators to be defined</p>
<p><strong>timgollins</strong> #Ipres2010 #5a &#8211; @davetaz e-prints talking about sampling for characterization and then using Plato for Risk assessment 2 Retweets</p>
<p><strong>euanc</strong> how do you test results of bulk migration? how do you confirm you have preserved whatever you were trying to preserve? #ipres2010</p>
<p><strong>timgollins</strong> #Ipres2010 #5a &#8211; @davetaz &#8211; talking about executable preservation action plans that use linked data registries to identify tools 1 Retweet</p>
<p><strong>timgollins</strong> #Ipres2010 #5a &#8211; @davetaz Plato &#8211; executable action plan contains decisions taken in creating the plan &#8211; preserve the plan in the repository 2 Retweets</p>
<p><strong>timgollins</strong> #Ipres2010 #5a &#8211; @davetaz Trust, authenticity &#8211; declare the plan used to migrate the objects &#8211; transparency/provenance delivers trust ! 2 Retweets</p>
<blockquote>
<h3>&#8220;Good to see KeepIt project enabling Repo managers to really use tools&#8221;</h3>
</blockquote>
<p><strong>euanc </strong>#ipres2010 #5a @davetaz &#8211; quotes from KeepIT exemplars &#8220;Much more time-consuming and complicated&#8221; &#8220;less confident but now knw what it means&#8221; 1 Retweet</p>
<p><strong>mariekeguy</strong> #ipres2010 #5a Tarrant: KeepIt examples &#8211; repository managers realise dig pres not just for the techies &#8211; also part of their role 1 Retweet</p>
<p><strong><a href="http://blog.soton.ac.uk/keepit/files/2009/06/jisccolour15.png"><img class="alignleft size-full wp-image-115" src="http://blog.soton.ac.uk/keepit/files/2009/06/jisccolour15.png" alt="JISC logo" width="99" height="52" /></a>neilgrindley</strong> @davetaz talking about integrating preservation tools. Good to see KeepIt project enabling Repo managers to really use tools.#ipres2010 1 Retweet</p>
<p><strong>mariekeguy</strong> #ipres2010 #5a KeepIt project <a title="KeepIt: Kultur, eCrystals, EdShare (and NECTAR) - Preserve It!" href="http://bit.ly/92Qhf1" target="_self">http://bit.ly/92Qhf1</a> 1 Retweet</p>
<p><strong>timgollins</strong> #Ipres2010 #5a &#8211; @davetaz Presenting the e-prints users response &#8211; impressive piece of work &#8211; excellent talk too <img src='http://blog.soton.ac.uk/keepit/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  2 Retweets</p>
<p><strong><a href="http://www.eprints.org/"><img class="size-thumbnail wp-image-1657 alignright" src="http://blog.soton.ac.uk/keepit/files/2010/09/eprintslogo-150x57.gif" alt="EPrints repository software logo" width="150" height="57" /></a>pjvangarderen</strong> #ipres2010 Tarrant: Plato action plan XML read by Eprints for &#8216;action&#8217;. V.cool. Need to see if this will work for Archivematica. 2 Retweets</p>
<p><strong>jisckeepit</strong> Thanks @timgollins @euanc @mariekeguy for excellent commentary and summary on @davetaz Plato and repositories #ipres2010 #5a</p>
<h3>Questions?</h3>
<p>The short question and answer session that followed the presentation is summarised by the presenter, Dave Tarrant:</p>
<p><strong>Q1</strong>: How did you organise the training courses in KeepIt and what materials did you use.<br />
<strong> A1</strong>: We utilised connections with projects to invite key people to give presentations which were focused on repository managers, we collected the materials and these are available online (url tweeted <a title="KeepIt - repository preservation, School of ECS, University of Southampton" href="http://www.ecs.soton.ac.uk/research/projects/640" target="_self">http://www.ecs.soton.ac.uk/research/projects/640</a>, source materials). Ed. Or try <a title="Tag: KeepIt course, Diary, various entries" href="http://blog.soton.ac.uk/keepit/tag/keepit-course/" target="_self">course blogs</a>, <a title="JISC KeepIt project presentations - the full set" href="http://www.slideshare.net/SteveHitchcock/presentations">course presentations</a> (Slideshare)</p>
<p><strong>Q2</strong>: Is there any format you can&#8217;t handle?<br />
<strong> A2</strong>: In EPrints, no, in the scope of the tools, yes, as the tools do not cover all formats. However in the case of eCrystals they have started this process by <a title="Adding chemistry to a file format registry, Diary, September 16, 2010 " href="http://blog.soton.ac.uk/keepit/2010/09/16/adding-chemistry-to-a-file-format-registry/" target="_self">providing the identification of their files to PRONOM-DROID</a>. You can then follow the workflow presented to look at extracting characteristics and feeding these to a tool or developing a new characterisation tool.</p>
<p><strong>Q3</strong>: I notice in the paper you give the plan an ID (ID/41). Is this a persistent ID?<br />
<strong> A3</strong>: Yes! An eprints persistent ID is assigned to almost everything in the system. I simply removed the first part of the URI to fit it in the 2-column format. EPrints provenance (which uses the Open Provenance Model, OPM) wouldn&#8217;t work without it.</p>
<h3>Summary</h3>
<p>So what can we divine from this? The presentation was fast and furious, in true Tarrant style, but thanks to the tweeters we can see the main points were mostly effectively conveyed and the story well received and complete. An amazing response.</p>
<p><strong>lescarr</strong> Following #ipres2010 where @davetaz presents the EPrints/DROID/Plato integration so quickly that tweets are too long &amp; cumbersome to keep up</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.soton.ac.uk/keepit/2010/09/21/connecting-plato-with-digital-repository-interfaces-ipres2010-twitterstream/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>KeepIt course 4: putting a preservation plan into EPrints</title>
		<link>http://blog.soton.ac.uk/keepit/2010/09/21/keepit-course-4-putting-a-preservation-plan-into-eprints/</link>
		<comments>http://blog.soton.ac.uk/keepit/2010/09/21/keepit-course-4-putting-a-preservation-plan-into-eprints/#comments</comments>
		<pubDate>Tue, 21 Sep 2010 14:28:14 +0000</pubDate>
		<dc:creator>Steve Hitchcock</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[EPrints]]></category>
		<category><![CDATA[EPrints preservation apps]]></category>
		<category><![CDATA[KeepIt course]]></category>
		<category><![CDATA[KeepIt course 4]]></category>
		<category><![CDATA[Plato]]></category>
		<category><![CDATA[preservation planning]]></category>

		<guid isPermaLink="false">http://blog.soton.ac.uk/keepit/?p=1579</guid>
		<description><![CDATA[KeepIt course module 4, Southampton, 18-19 March 2010 Tools this module: Plato, EPrints preservation apps Tags Find out more about: this module KeepIt course 4, the full KeepIt course Presentations and tutorial exercises course 4 (source files) So far in the practical work in KeepIt course 4 we have managed some storage services from our [...]]]></description>
			<content:encoded><![CDATA[<p><em>KeepIt course module 4, Southampton, 18-19 March 2010</em><strong><br />
</strong> <strong> Tools</strong> this module: <a title="Tag: Plato, Diary, various entries" href="http://blog.soton.ac.uk/keepit/tag/Plato/" target="_self">Plato</a>, <a title="Tag: EPrints preservation apps, Diary, various entries" href="http://blog.soton.ac.uk/keepit/tag/EPrints-preservation-apps/" target="_self">EPrints preservation apps</a><br />
<strong>Tags</strong> Find out more about: this module <a title="Tag: KeepIt course 4, Diary, various entries" href="http://blog.soton.ac.uk/keepit/tag/KeepIt-course-4" target="_self">KeepIt course 4</a>, the full <a title="Tag: KeepIt course, Diary, various entries" href="http://blog.soton.ac.uk/keepit/tag/KeepIt-course" target="_self">KeepIt course</a><strong><br />
</strong> <strong>Presentations and tutorial exercises</strong> <a title="Digital Preservation Tools for Repository Managers 4: Putting storage, format management and preservation planning in the repository, ECS EPrints, 07 May 2010" href="http://eprints.ecs.soton.ac.uk/21029/" target="_self">course 4</a> (source files)</p>
<p>So far in the practical work in KeepIt course 4 we have managed some storage services from our EPrints repository, then deposited some image files in the repository and performed preliminary format identification and risk analysis. We exported those files and used Plato to produce a preservation plan for that format.</p>
<p>These tools were only trivially connected by the export and use of common files. In other words, you could have done these courses on EPrints and Plato independently. Further, since the stated aim of this course module was to put preservation in the repository interface, you might ask how this is achieved when all we have done so far is copy content from the repository and instead used Plato.</p>
<h3><strong>Here is the clever part &#8230;</strong></h3>
<p>and it&#8217;s simple compared with the previous session. Recall the repository interface when we exported our GIF files. There are no presentation slides for this final session in course 4, just <a title="EPrints preservation-preservation action (pdf)" href="http://eprints.ecs.soton.ac.uk/21029/23/Preservation_Plan.pdf" target="_self">instruction sheets</a> for our remaining practical work, so an illustration of this interface is reproduced here.</p>
<p style="text-align: center"><a href="http://blog.soton.ac.uk/keepit/files/2010/09/eprints-high-risk-objects.png"><img class="aligncenter size-full wp-image-1580" src="http://blog.soton.ac.uk/keepit/files/2010/09/eprints-high-risk-objects.png" alt="eprints-high-risk-objects" width="522" height="354" /></a></p>
<p>Under the preservation actions button we used to download and export our GIF files to Plato is another button, to upload a preservation plan. This is the plan we have just produced and saved from Plato, as an XML file. So select that file and with one click we are back in the repository, with a preservation plan. In this case our simple plan chooses to migrate GIF images, considered high risk (hypothetically) to PNGs, our choice of low-risk (hypothetically) image formats.</p>
<p>We can now follow the instruction sheets for this session to complete our work for this module.</p>
<ul>
<li><strong>Actions</strong>: find the original GIFs, upload the plan, view the result.</li>
</ul>
<p>Each image record now contains a migrated version, the PNG, as well as the original GIF.</p>
<p>We can now update our earlier format risk screen, which shows the new PNGs as low risk objects and includes our GIFs in the same category, because they have migrated versions, but with a red bar to indicate they were originally considered to be high risk. Below this screen is shown the list of preservation plans relating to these objects.</p>
<p>A further short exercise demonstrates that when we deposit new objects in this format, we don&#8217;t have to reload the preservation plan. The appropriate plan is recognised and an Enact Plan button appears for our high-risk objects.</p>
<p>The exercise completes with a short demonstration of provenance. As we learned in KeepIt course 3, the <a title="KeepIt course 3: describing significant characteristics and recording provenance, Diary, August 23, 2010" href="http://blog.soton.ac.uk/keepit/2010/08/23/keepit-course-3-describing-significant-characteristics-and-recording-provenance/" target="_self">provenance</a> of an object is a verified record of its past history. Since our image files have been converted we have a provenance record for them. In EPrints this relational information is stored as Linked Data, and in this exercise we learn how to view these relations.</p>
<h3>EPrints-Plato KeepIt course 4 summary</h3>
<p>We said at the outset of KeepIt course 4 that we would put preservation in the repository interface, and we have done this for storage, risk analysis and preservation planning. We believe this is the first time this range of preservation support has been available in a major repository platform.</p>
<p>We also said that while you could do courses on EPrints and Plato separately, this is the only course to combine the two. We have shown here how the value multiplies by combining the different tools.</p>
<p>Versions of this EPrints-Plato course have been given in <a title="Digital Preservation: Logical and bit-stream preservation using Plato, EPrints and the Cloud, ECS EPrints, 27 Sep 2009" href="http://eprints.ecs.soton.ac.uk/17962/" target="_self">Corfu (ECDL, Sept. 2009)</a> and <a title="Conditions for digital preservation, Diary, July 22, 2010" href="http://blog.soton.ac.uk/keepit/2010/07/22/conditions-for-digital-preservation/" target="_self">Madrid (OR, July 2010)</a>, as well as this one in Southampton as part of the KeepIt course, and was just presented for the final time in <a title="iPres 2010, Tutorial 3, overview, Logical and bit-stream preservation: integrated digital preservation using Plato and EPrints" href="http://www.ifs.tuwien.ac.at/dp/ipres2010/tutorials.html#T3" target="_self">Vienna (iPres, Sept. 2010)</a>.</p>
<p>It remains to me to thank our outstanding presenters for these courses: Andreas Rauber and Hannes Kulovits from PLANETS and Vienna University of Technology, and Dave Tarrant and Adam Field from the EPrints team at the University of Southampton.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.soton.ac.uk/keepit/2010/09/21/keepit-course-4-putting-a-preservation-plan-into-eprints/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>KeepIt course 4: Preservation planning with Plato</title>
		<link>http://blog.soton.ac.uk/keepit/2010/09/21/keepit-course-4-preservation-planning-with-plato/</link>
		<comments>http://blog.soton.ac.uk/keepit/2010/09/21/keepit-course-4-preservation-planning-with-plato/#comments</comments>
		<pubDate>Tue, 21 Sep 2010 14:14:53 +0000</pubDate>
		<dc:creator>Steve Hitchcock</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[KeepIt course]]></category>
		<category><![CDATA[KeepIt course 4]]></category>
		<category><![CDATA[Plato]]></category>
		<category><![CDATA[preservation planning]]></category>

		<guid isPermaLink="false">http://blog.soton.ac.uk/keepit/?p=1520</guid>
		<description><![CDATA[KeepIt course module 4, Southampton, 18-19 March 2010 Tools this module: Plato, EPrints preservation apps Tags Find out more about: this module KeepIt course 4, the full KeepIt course Presentations and tutorial exercises course 4 (source files) Preservation planning provides a workflow leading to a preservation plan. One of the problems with some approaches to [...]]]></description>
			<content:encoded><![CDATA[<p><em>KeepIt course module 4, Southampton, 18-19 March 2010</em><strong><br />
</strong> <strong> Tools</strong> this module: <a title="Tag: Plato, Diary, various entries" href="http://blog.soton.ac.uk/keepit/tag/Plato/" target="_self">Plato</a>, <a title="Tag: EPrints preservation apps, Diary, various entries" href="http://blog.soton.ac.uk/keepit/tag/EPrints-preservation-apps/" target="_self">EPrints preservation apps</a><br />
<strong>Tags</strong> Find out more about: this module <a title="Tag: KeepIt course 4, Diary, various entries" href="http://blog.soton.ac.uk/keepit/tag/KeepIt-course-4" target="_self">KeepIt course 4</a>, the full <a title="Tag: KeepIt course, Diary, various entries" href="http://blog.soton.ac.uk/keepit/tag/KeepIt-course" target="_self">KeepIt course</a><strong><br />
</strong> <strong>Presentations and tutorial exercises</strong> <a title="Digital Preservation Tools for Repository Managers 4: Putting storage, format management and preservation planning in the repository, ECS EPrints, 07 May 2010" href="http://eprints.ecs.soton.ac.uk/21029/" target="_self">course 4</a> (source files)</p>
<p>Preservation planning provides a workflow leading to a preservation plan. One of the problems with some approaches to digital preservation is they are too proactive and reactive where file formats are concerned. For example, take a file format, decide the consensus is against it (such as Microsoft Office formats), and migrate. There is very little formally to justify this process, just that it is possible and not very difficult. The longer-term consequences of this action are unknown. What was done with good intent may turn out to be detrimental.</p>
<p>Preservation planning seeks to give a more formal basis to such decisions, and in the process will help to automate and record the consequent actions.</p>
<p>For this session in KeepIt course 4 we welcome back Andreas Rauber and Hannes Kulovits from the Vienna University of Technology to provide an extensive presentation on preservation planning using a tool, Plato, which they developed as part of the PLANETS project. This will be followed by further practical work.</p>
<p><object type="application/x-shockwave-flash" data="http://static.slideshare.net/swf/ssplayer2.swf?doc=id=3561816&amp;doc=plato-preservation-planning-keepit-100326064543-phpapp01" width="425" height="348"><param name="movie" value="http://static.slideshare.net/swf/ssplayer2.swf?doc=id=3561816&amp;doc=plato-preservation-planning-keepit-100326064543-phpapp01" ></object></p>
<p>As a formal process, preservation planning requires, unsurprisingly but perhaps disconcertingly, a lot of preparation as it takes account of preservation policies, legal obligations, organisational and technical constraints, and user requirements (slide 8). Fortunately, participants in this course are well positioned to do this, since we have covered many of these issues already in KeepIt courses 1-3.</p>
<p>The preservation planning approach supported by Plato can be overlaid on the OAIS reference model (slide 10), and is shown in more detail in slide 12. Preservation planning with Plato involves four stages:</p>
<ul>
<li>Define requirements (slides 12-33)</li>
<li>Evaluate alternatives (slides 34-45)</li>
<li>Consider results (slides 46-58)</li>
<li>Build preservation plan (an exercise)</li>
</ul>
<p>The reader can explore the Plato workflow using the slides. Here we will highlight some of the critical stages.</p>
<p>In the KeepIt course, and indeed throughout the KeepIt project, we have begun our preservation approach with format identification, but now we go further. We have to relate our identification and other information about our digital objects to our requirements for those objects. This is where our understanding of the significant characteristics of digital objects from KeepIt course 3 becomes useful. Slide 25 shows a mindmap of the sort familiar, again, from course 3, and in slide 26 we encounter the Plato interface, the tree editor, for the first time. Flipping between mindmap and Plato, the following slides show how the requirements are elaborated and values added, illustrating how this information might be mapped to the Plato editor.</p>
<p>Once we have described our objects we want to know what we might do to preserve them. There is typically more than one choice, not just in terms of a preservation action (e.g. format migration) but also in how that action might be performed, and these alternatives need to be evaluated to serve our requirements. If you recall, at the end of the previous session in this course module, we deposited some GIF files in a test repository and then downloaded those files in readiness for this session. In slide 37 we see these files appearing in Plato for the first time. Plato now shows us what alternative actions are available for these files.</p>
<p>Now there is a decision to be made: go/no-go/deferred-go. To make an informed decision we need to run some experiments, that is, to run the alternatives and compare the results before we commit to any plan. Plato helps us to run and evaluate the experiments, in this case on our image files.</p>
<p>Having begun to get some results we could perhaps begin to think we have done the hard work, but there is still a tricky stage to negotiate. Before we can analyse the results we have to transform and weight the measured values from the experiments, that is, to normalise the values so different experiments are measuring the same thing (slide 48-49), and to set the level of importance for each of the factors in our requirements tree (slides 51-52).</p>
<p>Finally, Plato presents our results (slides 55, 57) and we can see the benefit of using this tool.</p>
<h3>Summary and exercise</h3>
<p>Before we start the exercise, slides 60-64 summarise the presentation so far. If there is one conclusion that I would highlight above the others, it is that</p>
<ul>
<li>preservation planning is <em>a basis for well-informed, accountable decisions</em> (slide 64)</li>
</ul>
<p>It is no longer necessary or acceptable to make ad hoc preservation action decisions. This has been a detailed and involved process, but the benefit for a large repository is that the resulting plans can be used across all content in the analysed formats, now and in the future.</p>
<p>Two exercises are set out in slides 66-70. Again, we use our imported GIF files. My impression from observing participants on the course was that this may have been the hardest exercise in the whole KeepIt course, especially exercise 1 which confronts the requirements, and the first encounter with the tools, including the Freemind mindmap tool. There is nothing like a steep learning curve to get the best out of people, and by the end of this session you could hear the sound of pennies dropping.</p>
<p>We now have a preservation plan, and in the next session we will <a title="KeepIt course 4: putting a preservation plan into EPrints, Diary, September 21, 2010" href="http://blog.soton.ac.uk/keepit/2010/09/21/keepit-course-4-putting-a-preservation-plan-into-eprints/" target="_self">put that plan to work in a repository</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.soton.ac.uk/keepit/2010/09/21/keepit-course-4-preservation-planning-with-plato/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
