British Library will preserve the net for posterity
By Steve Ranger
Published: 8 June 2006 09:00 GMT
A new national library that will preserve UK web content forever is being developed by the British Library.
The new National Digital Library will store everything from digitised versions of centuries-old manuscripts to digital journals and web archives and will hoover up 300 terabytes of data in the next five years.
The British Library has a collection of 150 million items such as books and manuscripts, and is already collecting digital content. And this is going to gather pace when the Legal Deposit Libraries Act 2003 comes into force, probably by 2008.
As Roderic Parker, communications officer for the British Library's Digital Object Management Programme explained: "When that comes into effect the publishers of electronic journals and books and theses published in the UK will be obliged to deposit copies with us - the British Library has the right to receive everything. That's the way it will work with the digital stuff as well."
The library has already been running a voluntary deposit scheme where UK publishers of digital content can hand over their content, and while some people might question the value of some of the content on the web, Parker said even the most lowbrow publication is important.
He said: "Some of it is important from the point of cultural history, just as much as a pamphlet from an 18th century election; it tells you how they thought at the time. It's not our job as a library to say that things are ephemeral."
And while paper can rot, digital materials have different problems - such as obsolescence of the hardware or software used to access them.
Parker said: "One of the things that we have to do is make sure that [digital materials] can be kept in the long term. There are huge problems with keeping things technologically available. We've got printed materials that are five- to six-hundred-years-old and they are in pretty good condition, whereas digital stuff can be unworkable in 20 years."
The British Library is using cryptographic time-stamping technology to protect the integrity of the electronic documents in its new archive.
It is using nCipher's DSE200 document sealing engine to time-stamp and digitally sign every item to prove that documents are authentic and have not been modified.
He said: "If we supply you with a book you can see if someone has torn out a page or if someone has tried to insert something but you can't do that with an electronic file so we are trying to guarantee you can do that."
The library is building the new archive system by starting with the storage tech, designing a system which has to be very scalable, fault tolerant, reliable and resilient. The system will also have to ensure future users can view the material with contemporary applications but still experience the original look-and-feel.
As Parker points out: "If we are serious we have to make sure this is available for centuries. People have to come here in 2100 or 2200 and find what was on the web in 2006. We don't want to be in the position of saying 'we had it but it was damaged and now we can't retrieve it.'"
Information handling – Working with the resource group manager as a librarian for all documents and maintaining document configuration library ...
Learn to implement simple bits of JavaScript, HTML and Flash components from our File Library. Understands how to implement bits of JavaScript, HTML ...
A key output of this task will the creation of a knowledge database based on regulation constraints, technologies, feedback from suppliers, library ...
Agenda Setters 2009
Welcome to the ninth annual Agenda Setters poll – silicon.com's list of the top 50 most influential individuals in the technology and IT industries, from techies and CIOs to entrepreneurs and business leaders. Find out more in our latest special report.
Stories from the web...
Copyright © 2008 CBS Interactive Limited. All rights reserved. Top of page
Nick Heath
Let's shine a light into the public sector IT money pit
With £16bn being spent, why is productivity still falling?
Tim Ferguson
BBC is taking tech seriously, so give it a break!
Auntie is the envy of the world but doesn't get the credit it deserves at home...
Peter Cochrane
Peter Cochrane's Blog: Open info for all?
Government stonewalling citizens
Nick Heath
Home Office CIO on taming tech and why ID cards are good news
Interview: Annette Vernon, Home Office CIO
Nick Heath
NHS records, Google and Microsoft: Where do you want your data?
Politicians: Heal thyself
Alan Hunt
NHS network: Time to get secure
Patient data in need of a check up