This page contains a Flash digital edition of a book.
NEWS AND ANALYSIS


IS CLOUD STORAGE THE ANSWER TO PRESERVATION?


As the costs of long-term digital preservation climb, cloud storage doesn’t look set to bring them down, reports Rebecca Pool


R


ead recent blog posts from David Rosenthal, and you could be left feeling quite uncomfortable, if not somewhat sick. The LOCKSS pioneer asserts that no-one has enough money to preserve even a fraction of the content worthy of preservation – and cloud storage, perceived as a cheap way out, isn’t. ‘People have this casual assumption that if you keep something for a few years, you can afford to keep it forever,’ he told Research Information. ‘This is not a safe assumption; hard drive costs have decreased rapidly but now this drop is slowing.’


And, Rosenthal explained, the cost of cloud storage has barely changed since its inception. ‘[Cloud storage providers] are coining money out of storage services,’ he quipped. So what’s changed in the storage industry? In the last 30 years, the cost of disk storage has dropped around 30 to 40 per cent every year, according to Kryder’s law – analogous to Moore’s Law, but for hard disk drives. However, thanks to delays in the roll-out of heat-assisted magnetic recording, the hard disk drive industry dropped off the Kryder curve by mid-2011. And, as Rosenthal said, that was before the 2011 Thailand floods destroyed a huge chunk of the world’s hard disk drive manufacturing capacity, raising prices overnight. The market has bounced back, but the prices have not. According to USA-based IHS iSuppli, hard disk drives remain the cheapest storage medium around, but prices won’t dip below the pre-flood range until 2014. Time to visit the new wave of service providers, promising cheap cloud storage?


Glen Robinson, solutions architect at Amazon Web Services, believes businesses typically over-pay for data archiving, making expensive upfront payments for archives: ‘Since [storage providers] have to estimate capacity requirements, they understandably over-provision to make sure they have enough capacity for data redundancy and unexpected growth. This results in under-utilised capacity.’ So now, several cloud storage businesses offer services that state customers only pay for


www.researchinformation.info @researchinfo


what they use. ‘This changes the game for data archiving and back-up. Customers pay nothing up front, pay a very low price for storage and can scale usage up and down as required,’ he said. Given the promises, Rosenthal decided to


run a LOCKSS box in Amazon Web Services’ cloud, using Amazon’s giant’s S3 storage service. He recorded detailed costings and compared these with the costs of local disk storage. ‘Current


cloud storage services are just


not cost-competitive with local hardware for long-term storage, including LOCKSS boxes,’


raw storage prices are not going to cloud storage customers,’ asserted Rosenthal.


‘Amazon and its competitors should be riding the Kryder’s Law curve like everyone else, but pricing strategies have been to price products attractively, capture the market and then not reduce the price.’ Still, Amazon’s Robinson claimed that Amazon Web Services


is ‘relentless’ about


driving efficiencies and passing along the cost savings to the customer. ‘We’ve lowered our prices 24 times since launching our first service, with no competitive pressure to do so,’ he said. What’s more, the multi-national company recently introduced ‘Glacier’, a low-cost cloud storage service for the digital preservation market. ‘Use this if low-cost storage is paramount, your data if rarely retrieved and data retrieval times of several hours are acceptable,’ explained Robinson.


‘It turned out to be a much bigger problem


than I thought it would be’ David Rosenthal


Rosenthal concluded. ‘Over three years, running a median-sized LOCKSS box at Amazon would cost between six and 12 times the cost of buying the hardware. Yes, there are other costs such as power, cooling and storage but a factor of six to 12 is still a lot.’ What’s more, after looking at price drops from several providers, he noted that the organisations had only reduced prices by up to three per cent every year, a fraction of the 30 to 40 per cent annual price drop seen in raw disk prices over the past 30 years. ‘It’s clear that the benefits of the decrease in


But while Rosenthal acknowledged that this stripped-down service has some excellent features – in return for accepting access latency the user pays only $0.01/GB/month – he asserted that the service is designed as a back-up for something you are storing elsewhere. The data may be very cheap to store, but in Rosenthal’s words: ‘It can be very expensive to get at.’ ‘The long-term competitiveness of any cloud storage services still depends on how closely the pricing tracks Kryder’s Law, not on the initial pricing,’ he added. So, given that the disk industry is failing to maintain its historical price decreases and cloud storage costs don’t look set to come down soon, does an economic business model even exist for long-term digital data preservation? The economics of digital preservation are very difficult full-stop; most institutions are funded on a yearly budget cycle, making long-term investment in storage problematic. ‘Even government institutions such as the National Archives and Records Administration, and the national libraries are struggling with the costs,’ Rosenthal said. ‘Right now, I’m building a model so we can understand this... it turned out to be a much bigger problem than I thought it would be.’


APR/MAY 2013 Research Information 5


Mmaxer/Shutterstock.com


Page 1  |  Page 2  |  Page 3  |  Page 4  |  Page 5  |  Page 6  |  Page 7  |  Page 8  |  Page 9  |  Page 10  |  Page 11  |  Page 12  |  Page 13  |  Page 14  |  Page 15  |  Page 16  |  Page 17  |  Page 18  |  Page 19  |  Page 20  |  Page 21  |  Page 22  |  Page 23  |  Page 24  |  Page 25  |  Page 26  |  Page 27  |  Page 28  |  Page 29  |  Page 30  |  Page 31  |  Page 32  |  Page 33  |  Page 34  |  Page 35  |  Page 36