Everyday Virtualization

Blog archive

Don't Call It De-Duplication

This week I had a chance to peek into the world of storage for virtualized infrastructure during HDS Geek Day, a series of briefings from Hitachi Data Systems. (Disclosure: HDS Geek Day was an on-site blogger event in San Jose, Calif. Read my full blogger disclosure here.)

As a blogger, I came in with an open mind and no preconceived notion about HDS storage products. What I do know, however, is that my short list of top-notch storage bloggers knows a lot about HDS and generally speak well about them. This includes Devang Panchigar, Chris Evans and Nigel Poulton; all of whom were at the event.

One of the sessions in day one that discussed some future technologies around HDS's vision for virtualization caught my attention, and it's not available in the form of a product yet. The idea was to leverage storage technologies for storage efficiency for multiple virtual machines versus a technology like linked clones or primary de-duplication. This session was presented by Miki Sandorfi, chief strategist for file and content services at HDS.

What caught my attention was to have integration with the virtualization stack to use storage technologies to provide disk usage efficiencies that would work with all virtualization products. This integration would use storage snapshots and virtual disk versioning to provide storage savings that resemble de-duplication on disk.

This object store versioning technique would use snapshots and versioning on the storage system to provide disk usage savings for virtual machines. The obvious example is a VDI implementation where the source disk is very similar. There would not be delta VMDKs or differential VHD files, only versions of the file system object and shapshots.

Do you want your storage managing the differential data sets of virtual machines (desktops or servers), or would you want the virtualization engine to manage the task? Share your comments here.

Posted by Rick Vanover on 06/17/2010 at 11:30 AM


Reader Comments:

Fri, Jun 18, 2010 Rick Vanover http://rickatron.us

Miki on twitter said:

@mikisandorfi ... If you never duplicate the data, there's nothing to dedupe!

That's a good way of explaining it.

Thu, Jun 17, 2010 Rick Vanover http://rickatron.us

Phoenix airport really kills feeds to power plugs? Not cool.

I was going to equate it somehow to the immigration law thing, but #1 rule of social media is don't talk politics or religion unless that's what your talking :)

Thu, Jun 17, 2010 Rick Vanover http://rickatron.us

It is de-duplication but not by traditional dedupe at block or file.

If that makes sense :/

I'm purposely bating some confusion here, as there will be more to it.

Thu, Jun 17, 2010 W. Curtis Preston PHX airport where they turn off all the power plugs

So how is that not deduplication? I can understand how Miki doesn't want to call it that, because dedupe has negative connotations when it comes to primary storage, but you've got multiple "files" (e.g. vmdks) that are pointing to the same block. That's dedupe in my book.

Add Your Comment:

Your Name:(optional)
Your Email:(optional)
Your Location:(optional)
Comment:
Please type the letters/numbers you see above