<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Storage-Brain</title>
	<atom:link href="http://storage-brain.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://storage-brain.com</link>
	<description>The Storage Evolution Meeting Place</description>
	<lastBuildDate>Fri, 22 Jul 2011 14:38:51 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Deceit, Theft, Espionage, and Murder in the Storage Industry</title>
		<link>http://storage-brain.com/2011/07/deceit-theft-espionage-and-murder-in-the-storage-industry/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=deceit-theft-espionage-and-murder-in-the-storage-industry</link>
		<comments>http://storage-brain.com/2011/07/deceit-theft-espionage-and-murder-in-the-storage-industry/#comments</comments>
		<pubDate>Fri, 22 Jul 2011 14:38:51 +0000</pubDate>
		<dc:creator>deastty</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://storage-brain.com/?p=570</guid>
		<description><![CDATA[<p>A few years back I wrote a series of Blog posts describing some little known facts about our always interesting Storage industry.  Since Whitey Bulger (see part 4) is back in the news I thought I would dig these out for some fun summer reading and re-post them here.</p> <p>Part 1:  http://communities.netapp.com/community/netapp-blogs/drdedupe/blog/2009/07/16/deceit-theft-espionage-and-murder-in-the-storage-industry&#8211;a-4-part-blog</p> <p>Part 2:  [...]]]></description>
			<content:encoded><![CDATA[<p>A few years back I wrote a series of Blog posts describing some little known facts about our always interesting Storage industry.  Since Whitey Bulger (see part 4) is back in the news I thought I would dig these out for some fun summer reading and re-post them here.</p>
<p>Part 1:  <a title="Part 1" href="http://communities.netapp.com/community/netapp-blogs/drdedupe/blog/2009/07/16/deceit-theft-espionage-and-murder-in-the-storage-industry--a-4-part-blog" target="_blank">http://communities.netapp.com/community/netapp-blogs/drdedupe/blog/2009/07/16/deceit-theft-espionage-and-murder-in-the-storage-industry&#8211;a-4-part-blog</a></p>
<p>Part 2:  <a title="Part 2" href="http://communities.netapp.com/community/netapp-blogs/drdedupe/blog/2009/07/22/deceit-theft-espionage-and-murder-in-the-storage-industry--part-2 " target="_blank">http://communities.netapp.com/community/netapp-blogs/drdedupe/blog/2009/07/22/deceit-theft-espionage-and-murder-in-the-storage-industry&#8211;part-2 </a></p>
<p>Part 3:  <a title="Part 3" href="http://communities.netapp.com/community/netapp-blogs/drdedupe/blog/2009/07/24/deceit-theft-espionage-and-murder-in-the-storage-industry--part-3" target="_blank">http://communities.netapp.com/community/netapp-blogs/drdedupe/blog/2009/07/24/deceit-theft-espionage-and-murder-in-the-storage-industry&#8211;part-3</a></p>
<p>Part 4: <a title="Part 4" href="http://communities.netapp.com/community/netapp-blogs/drdedupe/blog/2009/08/14/deceit-theft-espionage-and-murder-in-the-storage-industry--part-4 " target="_blank">http://communities.netapp.com/community/netapp-blogs/drdedupe/blog/2009/08/14/deceit-theft-espionage-and-murder-in-the-storage-industry&#8211;part-4 </a></p>
<p>Enjoy!</p>
]]></content:encoded>
			<wfw:commentRss>http://storage-brain.com/2011/07/deceit-theft-espionage-and-murder-in-the-storage-industry/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>What Is Policy-Based Storage Management?</title>
		<link>http://storage-brain.com/2011/05/what-is-policy-based-storage-management-2/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=what-is-policy-based-storage-management-2</link>
		<comments>http://storage-brain.com/2011/05/what-is-policy-based-storage-management-2/#comments</comments>
		<pubDate>Fri, 20 May 2011 22:32:11 +0000</pubDate>
		<dc:creator>larryf</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://storage-brain.com/?p=549</guid>
		<description><![CDATA[<p>In prior blogs I’ve stressed the fact that cloud-enabled storage requires policies to make the best use of your storage resources.  In this blog I’ll dig a little deeper into what I mean by this.</p> <p>First, let’s define some storage attributes that could be used to create your policies.</p> <p>1) Growth - When I [...]]]></description>
			<content:encoded><![CDATA[<p>In prior blogs I’ve stressed the fact that cloud-enabled storage requires policies to make the best use of your storage resources.  In this blog I’ll dig a little deeper into what I mean by this.</p>
<p>First, let’s define some storage attributes that could be used to create your policies.</p>
<p>1) <strong>Growth -</strong> When I provision storage for this application, will it have slow growth, or fast growth?</p>
<p>2) <strong>Efficiency -</strong> Do I want to pack all my data as tightly as possible, or am I concerned this might affect performance?</p>
<p>3) <strong>Speed -</strong> Does this application need the fastest pipes, or will any old pipe do?</p>
<p>4) <strong>Reliability -</strong> Will the world stop spinning (well at least <em>my</em> world) if I lose access to this data?  Or could I live without it for a few hours?</p>
<p>5) <strong>Security</strong> &#8211; Am I worried about anyone else seeing my data?  Or, like my daughter’s Facebook page, the more the merrier?</p>
<p>Using the 5 above attributes, we could build a simple policy structure that looks something like this:</p>
<p><a href="http://storage-brain.com/wp-content/uploads/2011/05/Policy-1.jpg"><img class="aligncenter size-medium wp-image-552" title="Policy-1" src="http://storage-brain.com/wp-content/uploads/2011/05/Policy-1-300x118.jpg" alt="" width="300" height="118" /></a></p>
<p>We’d just define 3 tiers of storage and plop data into the storage system in the appropriate tier.  Easy huh?  Well, yes, and that’s pretty much how storage has been provisioned for the past few decades.  Buy a fleet of storage arrays, categorize them as either tier 1, 2, or 3, and assign your applications accordingly.</p>
<p>But, unfortunately, simple methods aren’t always the most efficient.  What if, for instance, you have an application with high Security needs but low Speed and Growth requirements?  In the tiering model above, any attribute with a “high” value would override everything else in the matrix, causing inefficiencies in a simple silo’d tiered storage model.</p>
<p>To solve this problem, let’s look at a more advanced model:</p>
<p><a href="http://storage-brain.com/wp-content/uploads/2011/05/Policy-2.jpg"><img class="aligncenter size-medium wp-image-554" title="Policy-2" src="http://storage-brain.com/wp-content/uploads/2011/05/Policy-2-300x126.jpg" alt="" width="300" height="126" /></a></p>
<p>In this model, each characteristic of the 5 attributes can be specified independently.  Now, instead of 3 rigid tiers, 243 different profiles can be created by moving the check marks back and forth for each attribute.  A little more complicated, but certainly more efficient than the first model.  With virtualized storage and a shared IT resource model (i.e. the cloud), the concept of rigid tiers goes away and is replaced by a virtual storage pool containing storage systems of all shapes and sizes.  In order to provision storage, applications are assigned somewhere within the virtual pool based on the criteria defined in a profile such as the one shown.  In a simple policy model like the one above, provisioning could be done manually by humans, but is much better left to machines because things can get complicated very quickly.</p>
<p>To illustrate, let’s see what happens if a few sub-critera are defined for each attribute:</p>
<p><a href="http://storage-brain.com/wp-content/uploads/2011/05/Policy-3.jpg"><img class="aligncenter size-medium wp-image-559" title="Policy-3" src="http://storage-brain.com/wp-content/uploads/2011/05/Policy-3-254x300.jpg" alt="" width="254" height="300" /></a></p>
<p>Yikes, we now have over 14 million possible profiles based on only 15 attributes with 3 choices for each.  This is why automation quickly becomes a requirement in policy-based storage management.  With a policy engine, the desired profile can be automatically mapped to the most appropriate storage resource, with the appropriate software functionality automatically enabled, and a monitoring system in place to alert of any out-of-policy conditions.</p>
<p>What’s left for us humans to do?  Nothing really – other than listening to the machines hum, and waiting for some sort of alert that tells us something needs fixin&#8217;.  Sort of like an airline pilot listening to the jet engines hum and watching the cockpit gauges as autopilot actually flies the plane.  I’ll talk more about the pilot’s, er, storage architect’s role in policy-based storage management in an upcoming blog…</p>
]]></content:encoded>
			<wfw:commentRss>http://storage-brain.com/2011/05/what-is-policy-based-storage-management-2/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Too Much Information</title>
		<link>http://storage-brain.com/2011/05/too-much-information/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=too-much-information</link>
		<comments>http://storage-brain.com/2011/05/too-much-information/#comments</comments>
		<pubDate>Thu, 05 May 2011 21:58:16 +0000</pubDate>
		<dc:creator>larryf</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://storage-brain.com/?p=533</guid>
		<description><![CDATA[<p>Over the past 30 years, online storage capacity grew from about 10GB to well over 100TB for the average IT shop, 10,000X growth during that period.  Expanding this math out over the next 30 years, We can expect to see the average data center storing 1 million TB’s, or 1 exabyte by 2040.</p> <p>Will we be able [...]]]></description>
			<content:encoded><![CDATA[<p>Over the past 30 years, online storage capacity grew from about 10GB to well over 100TB for the average IT shop, 10,000X growth during that period.  Expanding this math out over the next 30 years, We can expect to see the average data center storing 1 million TB’s, or 1 exabyte by 2040.</p>
<p>Will we be able to store all that data?  My guess is yes, easily.  Individual storage devices will have grown to 100’s of TB’s by that time, probably storing data bits at the atomic level. </p>
<p>Will we be able to access all that data quickly?  Yes, I believe so, since by 2040 data will not be housed on slow spinning platters but rather on silicon or some other exotic material and will travel close to the speed of light (or maybe faster than the speed of light?)</p>
<p>Will we be able to manage all that data? Well, now, that’s the tricky part, isn’t it?  If you think that the management of a petabyte of data is tough today, multiply that toughness by 1,000 – and that’s what the largest data centers will be dealing with someday sooner than you think.  If the average data center will have an exabyte in 2040, what will the largest data center have?  A zettabyte?  A yottabyte?  We are going to run out of names to describe the magnitude of data we’ll have…</p>
<p>Because of the continual growth of data, I believe that data management will quickly become the biggest problem facing IT shops.  Racking up a petabyte of storage isn’t any easier than it sounds, but provisioning this storage, protecting it, and serving it up with any reasonable speed is darn near impossible.</p>
<p>For the answer to this looming problem, I think we need to look back – back to the mainframe.  You see, the mainframe folks realized a long time ago that lots of data usually meant lots of headaches for administrators, and they came up with the idea that machines could manage these datasets much better than humans could.  A notable development in this area was IBM’s DFSMS, described below:</p>
<p><em>“Data Facility Storage Management Subsystem (DFSMS™) is a software suite that automatically manages data from creation to expiration. DFSMS provides allocation control for availability and performance, backup/recovery and disaster recovery services, space management, tape management, and reporting and simulation for performance and configuration tuning.”</em></p>
<p>By classifying storage devices and data, policy-based storage was possible.  IBM’s data classifications included 5 areas: data class, storage class, management class, storage group, and aggregate group.</p>
<p>Here is how IBM describes what they had created:</p>
<p><em>“The concept of policy-based storage management involves defining policies that allow the system to take over many storage management tasks that were previously performed manually.</em><em> </em></p>
<p><em>DFSMS separates the logical view of data from the physical view of data. The logical view of data is concerned with what the data look like and what services the data require. The physical view is concerned with where the data actually reside. The policy types that specify the logical view of data are: data class, storage class, and management class. Storage group is the single policy type for specifying physical storage. An aggregate-group policy specifies a grouping of data for purposes of backup and recovery in case of a disaster.”</em></p>
<p>So if IBM developed the idea of policy-based storage decades ago, why isn’t everyone using it today?  There are a couple reasons.  First of all, the mainframe has a big advantage over server/client environments.  With a mainframe, you know who the boss is.  This is a boss who micro-manages everything.  One operating system, one set of peripheral devices, and nobody better get out of line <em>or else</em>.  A server/client architecture is much more autonomous and a largely peer-based structure.  Disk storage systems are allowed free reign over whatever it is they do, and no one complains unless they can’t store or retrieve their data.  It’s much more difficult to impose an overarching authority in this environment.</p>
<p>Another reason policy-based storage isn’t so popular today is that is hasn’t really been needed.  Individual storage systems could scale large enough to support individual applications, resulting in storage silos that were highly functional albeit inefficient.</p>
<p>Enter 2011, the era of convergence between growing data and shrinking budgets.  The recession we experienced during the latter part of the 2000s still stings today.  Budgets are tight and inefficiency has left the building.  Meanwhile, data keeps growing like the pails of water that the Sorcerer’s Apprentice couldn’t control.  The constant flow of data can’t be allowed to overflow, it <strong>must</strong> be stored somewhere.</p>
<p>Storage silos are yesterday’s news.  Virtual storage pools with automated provisioning and protection policies are the future.  Expect to see storage vendors put more and more emphasis on the design of these features, and for users suffering under the weight of their data asking for them.</p>
<p>Larry</p>
]]></content:encoded>
			<wfw:commentRss>http://storage-brain.com/2011/05/too-much-information/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Infosmack Prognostications</title>
		<link>http://storage-brain.com/2011/04/my-infosmack-prognostications/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=my-infosmack-prognostications</link>
		<comments>http://storage-brain.com/2011/04/my-infosmack-prognostications/#comments</comments>
		<pubDate>Wed, 27 Apr 2011 19:34:09 +0000</pubDate>
		<dc:creator>larryf</dc:creator>
				<category><![CDATA[Beyond 2010]]></category>
		<category><![CDATA[Cloud-enabled Storage]]></category>
		<category><![CDATA[Storage Communication]]></category>
		<category><![CDATA[Storage Devices]]></category>

		<guid isPermaLink="false">http://storage-brain.com/?p=527</guid>
		<description><![CDATA[<p>I recently appeared as a guest on an Infosmack weekly podcast.  Greg Knieriemen hosted the cast, and joining me were fellow data storage authors Marc Farley and W. Curtis Preston.  After the usual banter, we got down to the business of making some storage industry predictions.  I (of course) took the opportunity to get on [...]]]></description>
			<content:encoded><![CDATA[<p>I recently appeared as a guest on an <a href="http://www.theregister.co.uk/2011/04/26/infosmack_enterprise_tech_podcast_episode_96/">Infosmack </a>weekly podcast.  Greg Knieriemen hosted the cast, and joining me were fellow data storage authors Marc Farley and W. Curtis Preston.  After the usual banter, we got down to the business of making some storage industry predictions.  I (of course) took the opportunity to get on my Hybrid Disk Drive soapbox, proclaiming that this <em><strong>will </strong></em>happen within 5 years, and Curtis showed that he was passionate about the inevitability of self-encrypting disk drives (we had a gentlemen&#8217;s disagreement on that topic.)  Marc mentioned that when he wrote his book about SAN&#8217;s, he predicted that the ethernet SAN would someday become a reality (he was right.)</p>
<p>When the topic turned to cloud storage, I think we all agreed that automation would be key.  I made the comment that we humans are all fundamentally lazy and anytime we have a choice between man or machine to do mundane/difficult work we always opt for the machine.  In the cloud, this means that policy-based storage provisioning and data protection will become paramount.  Greg asked whether or not I thought this was the right direction to be going.  I responded that we really had no choice, the sheer volume of data we will soon be asked to manage will simply be impossible for humans to manage, so we&#8217;ll be handing control over to machines, like it or not.</p>
<p>We covered many other topics, such as examples of great successes and failures in the history of the data storage industry, and I gave a sneak peek into my next planned book &#8220;<em>Building the Virtual Mainframe.</em>&#8220;  All and all I thoroughly enjoyed being part of this podcast (yes even the banter) and I hope to participate in more Infosmack&#8217;s in the future&#8230;</p>
<p>In the meantime, here is a complete list of all the books authored by the people on this podcast:</p>
<p><a href="http://www.amazon.com/Building-Storage-Networks-Marc-Farley/dp/0072130725/ref=sr_1_2?s=books&amp;ie=UTF8&amp;qid=1303933386&amp;sr=1-2">Building Storage Networks</a> Marc Farley</p>
<p><a href="http://www.amazon.com/Storage-Networking-Fundamentals-Introduction-Applications/dp/1587051621/ref=sr_1_1?s=books&amp;ie=UTF8&amp;qid=1303933386&amp;sr=1-1">Storage Networking Fundmentals</a> Marc Farley</p>
<p><a href="http://www.amazon.com/Evolution-Storage-Brain-transformative-storage/dp/1451577648/ref=sr_1_1?s=books&amp;ie=UTF8&amp;qid=1303933013&amp;sr=1-1">Evolution of the Storage Brain</a> Larry Freeman</p>
<p><a href="http://www.amazon.com/Insiders-Guide-Data-Deduplication-compilation/dp/1456309455/ref=sr_1_5?s=books&amp;ie=UTF8&amp;qid=1288807731&amp;sr=1-5">The Insider&#8217;s Guide to Data Deduplication</a> Larry Freeman</p>
<p><a href="http://www.amazon.com/Backup-Recovery-Inexpensive-Solutions-Systems/dp/0596102461/ref=sr_1_1?ie=UTF8&amp;s=books&amp;qid=1303933212&amp;sr=1-1">Backup and Recovery</a> W. Curtis Preston</p>
<p><a href="http://www.amazon.com/Using-SANs-NAS-Curtis-Preston/dp/0596001533/ref=sr_1_3?ie=UTF8&amp;s=books&amp;qid=1303933212&amp;sr=1-3">Using SAN and NAS</a> W. Curis Preston</p>
<p><a href="http://www.amazon.com/Unix-Backup-Recovery-Curtis-Preston/dp/1565926420/ref=sr_1_4?ie=UTF8&amp;s=books&amp;qid=1303933212&amp;sr=1-4">Unix Backup and Recovery</a> W. Curtis Preston</p>
<p>Enjoy,</p>
<p>Larry</p>
]]></content:encoded>
			<wfw:commentRss>http://storage-brain.com/2011/04/my-infosmack-prognostications/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Who Cares About Metering and Chargeback?</title>
		<link>http://storage-brain.com/2011/03/who-cares-about-metering-and-chargeback/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=who-cares-about-metering-and-chargeback</link>
		<comments>http://storage-brain.com/2011/03/who-cares-about-metering-and-chargeback/#comments</comments>
		<pubDate>Tue, 22 Mar 2011 16:59:42 +0000</pubDate>
		<dc:creator>larryf</dc:creator>
				<category><![CDATA[Cloud-enabled Storage]]></category>

		<guid isPermaLink="false">http://storage-brain.com/?p=487</guid>
		<description><![CDATA[<p>In my last post, I talked about 5 building blocks of the cloud, taken from the book “Cloud Storage for Dummies.”  As a reminder, these were:</p> Elastic, quickly adapting underlying infrastructure to changing subscriber demands SLA-driven, automated and integrated to provide swift response times Policy-based, with deep levels of automation to move data as [...]]]></description>
			<content:encoded><![CDATA[<p>In my last post, I talked about 5 building blocks of the cloud, taken from the book “Cloud Storage for Dummies.”  As a reminder, these were:</p>
<ol>
<li>Elastic, quickly adapting underlying infrastructure to changing subscriber demands</li>
<li>SLA-driven, automated and integrated to provide swift response times</li>
<li>Policy-based, with deep levels of automation to move data as required</li>
<li>Secure and reliable</li>
<li>Able to control geographically dispersed data</li>
</ol>
<p>Now let’s take the points above and simplify them a little more to describe cloud-enabled storage:</p>
<ol>
<li>Cloud-enabled storage is <strong><em>virtual</em></strong> – data is not bound by physical arrays or geography</li>
<li>Cloud-enabled storage is <strong><em>elastic</em></strong> – storage resources can be assigned transparently</li>
<li>Cloud-enabled storage is <strong><em>automated</em></strong> – pre-defined policies dictate the behavior of data</li>
<li>Cloud-enabled storage is <strong><em>secure</em></strong> – tenants share each other’s resources but not their data </li>
</ol>
<p>I’ll come back to those 4 points in later blogs, but in this blog I want to discuss a problem that emerges when you have a virtual, elastic, automated, and secure IT infrastructure.  That problem: how do you figure out who pays for the storage services you are providing?</p>
<p>In the old days, a project manager would go IT and said “I need a couple servers and 10TBs of storage for my new application.”  You’d pull out your vendor pricelists and add up all the costs (including network switches, backup software/hardware and of course your valuable time) and tell this manager, “OK I can have that ready for you in a couple months and I’ll need to charge your department around 100 thousand dollars.”</p>
<p>Once you both agreed to this amount, you’d go off and build out this project and when the vendor bills arrived, you’d charge the costs to the appropriate department.</p>
<p>In the era of cloud computing, however, this model changed.  Instead of buying servers, you simply create a couple of Virtual Machines.  Instead of buying a storage array, you carve some storage capacity out of your internal cloud and assign a backup policy.  Viola! While your friendly project manager is still standing there with his/her mouth agape, you say “OK that should do it, anything else?”</p>
<p>That’s the beauty of cloud computing and cloud-enabled storage – you can quickly and efficiently serve the IT needs of your organization.  The problem is; how to you charge for what you just did?  That’s where metering and chargeback come into play.  Since the cloud is designed as a “pay-per-use” activity, you need a way to figure out what was used and what your Users should pay.</p>
<p>From the storage perspective, this is easy, right?  Well yes it could be – you could simply total all your storage costs and divide this by all your storage capacity and charged a fixed cost per TB.  Using the example above, if you’ve invested $1,000,000 in a 100TB storage infrastructure, your cost per TB is $10,000 – and you could bill your friend $100,000 for their 10TB of storage.</p>
<p>The problem is that in this model, you are assuming the same requirements for each User, when in reality we all know that there is fast storage, slow storage, big storage, small&#8230;well you get the picture.  This kind of level-funding model doesn’t really charge Users for what they are getting.</p>
<p>To try and match costs more exactly, what we usually see IT shops do is divide their storage into 3 or 4 tiers based on loosely defined performance levels, price out each tier, and bill Users according.  This is a step in the right direction, but still far from perfect.  What about automatic storage tiering, for instance.  If an application sees high activity for a few weeks and moves to a higher performance tier, how do account for that?  What if your cloud provides automatic migration of data to reduce latency, how do you account for that?  The answer is automatic metering and a chargeback based on actual usage – something most people haven’t given much thought to, unless of course your business depends on the profits from providing Storage as a Service (STaaS.)</p>
<p>Since the typical IT shop doesn&#8217;t need to make a profit, chargeback is used as a means of covering costs, and metering is a way to charging Users fairly .  Metering and chargeback utilities are provided by nearly all major storage providers today – and provide a way to charge Users based on actual usage.  If you are using a chargeback system today (or would like to) and you are moving to a cloud infrastructure, this would be a good time to analyze your methodology.  Here are few questions you should ask:</p>
<ul>
<li>Does your chargeback have a sliding scale based on storage tiers?</li>
<li>How does your  chargeback account for automated tiering?</li>
<li>Is your chargeback pricing based on logical storage capacity or physical capacity?</li>
<li>Does your chargeback pricing take high performance IOPS into account?</li>
</ul>
<p>Metering and chargeback is an area that is sure to see more interest as we move more deeply into cloud and cloud-enabled storage.</p>
<p>For more on this topic, here are  a few storage-brain recommended resources, including User Guides for a couple of popular chargeback utilites:</p>
<p><a href="http://storage-brain.com/wp-content/uploads/papers/IBM_definingframeworkforcloud.pdf#search">http://storage-brain.com/wp-content/uploads/papers/IBM_definingframeworkforcloud.pdf#search</a>=</p>
<p><a href="http://storage-brain.com/wp-content/uploads/papers/Storage%20Chargeback%20User%20Guide.pdf">http://storage-brain.com/wp-content/uploads/papers/Storage%20Chargeback%20User%20Guide.pdf</a></p>
<p><a href="http://storage-brain.com/wp-content/uploads/papers/vCenterChargeback_v10_Users_Guide.pdf">http://storage-brain.com/wp-content/uploads/papers/vCenterChargeback_v10_Users_Guide.pdf</a></p>
]]></content:encoded>
			<wfw:commentRss>http://storage-brain.com/2011/03/who-cares-about-metering-and-chargeback/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Cloud Storage For [Advanced] Dummies</title>
		<link>http://storage-brain.com/2011/02/cloud-storage-for-advanced-dummies/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=cloud-storage-for-advanced-dummies</link>
		<comments>http://storage-brain.com/2011/02/cloud-storage-for-advanced-dummies/#comments</comments>
		<pubDate>Fri, 04 Feb 2011 20:49:37 +0000</pubDate>
		<dc:creator>larryf</dc:creator>
				<category><![CDATA[Cloud-enabled Storage]]></category>

		<guid isPermaLink="false">http://storage-brain.com/?p=480</guid>
		<description><![CDATA[<p>As cloud computing gains traction, so does cloud-enabled storage.  But if you are like most people, you still don’t quite have a handle on what makes cloud-enabled storage different from regular old storage.  In this blog, I hope to shed some light on this topic.</p> <p>While I was doing my research for this blog, [...]]]></description>
			<content:encoded><![CDATA[<p>As cloud computing gains traction, so does cloud-enabled storage.  But if you are like most people, you still don’t quite have a handle on what makes cloud-enabled storage different from regular old storage.  In this blog, I hope to shed some light on this topic.</p>
<p>While I was doing my research for this blog, I came across <a href="http://lto.libredigital.com/?HitachiCloudStorageForDummies">a book</a> from Hitachi Data Systems, <em>Cloud Storage For Dummies</em>.  Not that HDS is the only vendor providing storage for the cloud, but I’m familiar with the Dummies series from Wiley, and I like the format they use, so I gave the book a quick read.  Sure enough, this book is packed with useful information for anyone that is a little storage saavy but wants to learn more about the cloud.  I guess I’d call this an <em>advanced</em> Dummies book, and I recommend it.</p>
<p>In the book, the authors offer up their 5 basic tenets of cloud computing:</p>
<ol>
<li>The ability to rapidly provision or deprovision a service</li>
<li>A consumption model where users pay for what they use</li>
<li>The agility to flexibly scale (‘flex up’ or ‘flex down’) the service without extensive pre-planning</li>
<li>A secure, direct connection to cloud without having to recode applications</li>
<li>Multi-tenancy capabilities that segregate and protect the data</li>
</ol>
<p>No argument here.  But how does the storage world tie into this model?  The authors spell this out for us too:</p>
<p><strong>To fully realize these benefits, cloud storage needs to be:</strong><strong> </strong></p>
<ol>
<li>Elastic, quickly adapting underlying infrastructure to changing subscriber demands</li>
<li>SLA-driven, automated and integrated to provide swift response times</li>
<li>Policy-based, with deep levels of automation to move data as required</li>
<li>Secure and reliable</li>
<li>Able to control geographically dispersed data</li>
</ol>
<p>Aha!  There you have it!  The 5 building blocks for a cloud-enabled storage infrastructure!  I largely agree with the authors here (that’s why I recommended the book!)  Let’s take a closer look at these 5 steps; one-by one.</p>
<p><strong>1.       </strong><strong>Elastic, quickly adapting underlying infrastructure to changing subscriber demands</strong></p>
<p>To me, this is all about virtualized and clustered storage.  It’s easy to have one storage array that’s quick and adaptable, but what if you have ten, or one hundred storage arrays?  If you can take all those independent arrays, combine them into one big virtual array, now we are talking cloud.  Virtual storage pools therefore are a <em>must-have</em> for cloud-enabled storage.</p>
<p><strong>2.       </strong><strong>SLA-driven, automated and integrated to provide swift response times</strong></p>
<p><strong>3.       </strong><strong>Policy-based, with deep levels of automation to move data as required</strong></p>
<p>I’ve combined #2 and #3 because they really address the same issue.  Clouds are big, too big for any human to manage – therefore they have to manage themselves.  Policy-based provisioning is the answer.  This also goes back to #1 and virtual storage pools.  The pools have to be smart enough to create sub-pools, and automatically place data in the appropriate sub-pool based on performance, protection, and security policies.</p>
<p><strong>4.       </strong><strong>Secure and reliable</strong></p>
<p>Is there any data that doesn’t need to be secure and reliable?  Well, in the cloud there are a few challenges.  Since you don’t know exactly where your data went once you threw it into the cloud, it had better be secure.  If your data just happened to land right next to your competitor’s data, in the same data center on the same floor in the same storage array, you want to be darned sure that it doesn’t somehow get mixed in with their data – we call this secure multi tenancy.  If you have sensitive data entering the cloud you’d better make sure it’s secure from accidental discovery.</p>
<p><strong>5.       </strong><strong>Able to control geographically dispersed data</strong></p>
<p>This is becoming more important as we disperse our people, data, and storage arrays around the globe.  If a large application is housed in the U.S., but most of the people using it are in China – there will be a lot of network latency and bandwidth costs to incur.  Wouldn’t it make more sense to move the data closer to the people using it?  Of course it would – and this will be a challenge for clouds of the future.</p>
<p>There, that’s all there is to it.  To make it even simpler, I’ll reduce the five steps to cloud-enabled storage down to two:</p>
<ol>
<li>Create intelligent storage pools with automated provisioning, performance, and protection policies</li>
<li>Defend the data in the pools with secure multi tenancy</li>
</ol>
<p>Any dummy should be able to understand that, right?</p>
<p>Larry</p>
]]></content:encoded>
			<wfw:commentRss>http://storage-brain.com/2011/02/cloud-storage-for-advanced-dummies/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The Evolution of Flash Memory</title>
		<link>http://storage-brain.com/2010/12/the-evolution-of-flash-memory/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=the-evolution-of-flash-memory</link>
		<comments>http://storage-brain.com/2010/12/the-evolution-of-flash-memory/#comments</comments>
		<pubDate>Wed, 22 Dec 2010 23:41:47 +0000</pubDate>
		<dc:creator>larryf</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://storage-brain.com/?p=464</guid>
		<description><![CDATA[<p>Every once in a while, a technology comes along that revolutionizes the way we store data.  Flash Memory has the potential to be one of those technologies.  Over time, Flash may well creep through storage arrays until it replaces all rotating magnetic storage devices.  But where did Flash come from, and what will it [...]]]></description>
			<content:encoded><![CDATA[<p>Every once in a while, a technology comes along that revolutionizes the way we store data.  Flash Memory has the potential to be one of those technologies.  Over time, Flash may well creep through storage arrays until it replaces all rotating magnetic storage devices.  But where did Flash come from, and what will it take for Flash to replace a well-understood, 50 year-old technology?</p>
<p>The roots of Flash are in ROM, or Read Only Memory.  In the early days of integrated circuits (this was during the 1970s for our younger readers,) computer microcode was permanently embedded in a set of ROM memory chips.  As the name implies, ROM chips always held the same data, and could not be erased or rewritten.  ROM chipsets worked great &#8211; until a code upgrade was needed, in which case the chipset, or the board that held them had to be replaced.</p>
<p>This lead to a breakthrough – EPROM, or Erasable Programmable Read Only Memory.  These chips had a clear window over the silicon and a little sticker over the window, usually denoting the rev of the microcode on the chip.  When it was time to upgrade the code, you pulled the sticker off and put the chip into a little easy-bake oven-looking contraption that had an ultraviolet light in it.  After about an hour under the light, the chip was erased and could be re-coded using a special EPROM programmer.  This was usually done at the factory, but hobbyists (i.e. hackers) could also perform this task with PROM erasers and programmers bought easily through their Computer Shopper magazine and plugged into any PC.</p>
<p>EPROMs allowed you to recycle expensive memory chips, but were still cumbersome to use.  Another breakthrough occurred when the EEPROM became available.  EEPROM stood for Electrically Erasable Programmable Read Only Memory.  The breakthrough here was that the memory chips could be erased by simply putting an electrical charge across the memory banks inside the chip.  In fact, EEPROMS could be erased and reprogrammed without removing them at all.</p>
<p>Along the way, someone figured out that EEPROMs were becoming more than just a way to store microcode, they were becoming solid state data storage devices, like their cousins Random Access Memory (RAM).  Because of the ability to store, erase, and re-store data, EEPROMs are also defined as a form of NV-RAM (Non Volatile Random Access Memory.)</p>
<p>EEPROMs took a huge leap in the 1990s as consumer electronic products started to take hold.  Digital music players, cameras and video recorders all needed substantial storage capacity, but the only viable storage product at the time were 1 inch micro disk drives, which were proving unreliable and expensive.  In response, in 1994, Sandisk announced its Compact Flash (CF) module, based on earlier work by Toshiba and Intel.  CF brought substantial innovations, including a standard packaged module that was a plug-and-play replacement for micro disk-drives and the ability to erase and re-record single memory cells, unlike earlier EEPROMs that needed to be completely erased.</p>
<p>The rest, they say, is history.  Consumer demand took off, and Flash prices plummeted to the point where enterprise storage array vendors took notice.  Could Flash memory modules replace traditional hard drives?  The answer, as you will see, is a definite maybe.</p>
<p><strong>Flash to the Future</strong></p>
<p>For all its greatness, Flash has two major weaknesses.  First of all, like all forms of EEPROM, those little transistors inside the memory module don’t like being flashed.  Eventually, after multitudes of electrical erasures, the transistor gates start to break down and will eventually fail.  Secondly, since you always need to erase, then write, in some cases Flash isn’t very fast.  In fact Flash-based Solid State Drives (SSDs) can be slower than high performance hard disk drives (HDDs) under some conditions.  In other conditions, however, SSD’s are incredibly faster than HDD’s.</p>
<p>What we are seeing today is aggressive innovation in the area of Flash technology that <em>should</em> put it in position to someday altogether replace HDD’s.  The industry is very competitive today with <a href="http://www.oempcworld.com/support/SLC_vs_MLC.htm">Single Level Cell (SLC) and Multi Level Cell (MLC) Flash designs</a>, as well as <a href="http://www.maltiel-consulting.com/NAND_vs_NOR_Flash_Memory_Technology_Overview_Read_Write_Erase_speed_for_SLC_MLC_semiconductor_consulting_expert.pdf">NAND and NOR gate wiring</a> in the quest to obtain the right balance of speed, density, reliability, and cost-effectiveness in Flash designs.  Flash software innovations include Wear Leveling and Bad Block Management (BBM) to offset the nasty habit that Flash memory has of wearing out after heavy us.</p>
<p>Are we there yet?  No, not by a long shot.  But surely more innovation is in store.  Stay tuned to these pages as the evolution of Flash continues…</p>
<p>Larry</p>
]]></content:encoded>
			<wfw:commentRss>http://storage-brain.com/2010/12/the-evolution-of-flash-memory/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>In Data Storage, Cache is King</title>
		<link>http://storage-brain.com/2010/12/in-data-storage-cache-is-king/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=in-data-storage-cache-is-king</link>
		<comments>http://storage-brain.com/2010/12/in-data-storage-cache-is-king/#comments</comments>
		<pubDate>Wed, 15 Dec 2010 22:07:11 +0000</pubDate>
		<dc:creator>larryf</dc:creator>
				<category><![CDATA[Beyond 2010]]></category>
		<category><![CDATA[Storage Memory]]></category>

		<guid isPermaLink="false">http://storage-brain.com/?p=447</guid>
		<description><![CDATA[<p>In the realm of computers, solid-state memory is used for temporary data storage &#8211; and it’s fast. I am old enough to remember computers that used magnetic core memory, and as arcane as that sounds, core memory was fast too.</p> <p>Memory comes in many flavors. Wikipedia takes us on this walk down memory lane:</p> [...]]]></description>
			<content:encoded><![CDATA[<p>In the realm of computers, solid-state memory is used for temporary data storage &#8211; and it’s fast. I am old enough to remember computers that used magnetic core memory, and as arcane as that sounds, core memory was fast too.</p>
<p>Memory comes in many flavors. <a href="http://en.wikipedia.org/wiki/Computer_memory">Wikipedia </a>takes us on this walk down memory lane:</p>
<table>
<tbody>
<tr>
<td align="center"><strong><strong>Computer memory</strong> types</strong></td>
</tr>
<tr>
<td align="center"><strong><a title="Volatile memory" href="http://en.wikipedia.org/wiki/Volatile_memory">Volatile</a></strong></td>
</tr>
<tr>
<td>
<ul>
<li><a title="Dynamic random access memory" href="http://en.wikipedia.org/wiki/Dynamic_random_access_memory">DRAM</a>, e.g. <a title="DDR SDRAM" href="http://en.wikipedia.org/wiki/DDR_SDRAM">DDR SDRAM</a></li>
<li><a title="Static random access memory" href="http://en.wikipedia.org/wiki/Static_random_access_memory">SRAM</a></li>
<li>Upcoming
<ul>
<li><a title="T-RAM" href="http://en.wikipedia.org/wiki/T-RAM">T-RAM</a></li>
<li><a title="Z-RAM" href="http://en.wikipedia.org/wiki/Z-RAM">Z-RAM</a></li>
<li><a title="Twin Transistor RAM" href="http://en.wikipedia.org/wiki/Twin_Transistor_RAM">TTRAM</a></li>
</ul>
</li>
</ul>
</td>
</tr>
<tr>
<td align="center"><strong><a title="Non-volatile memory" href="http://en.wikipedia.org/wiki/Non-volatile_memory">Non-volatile</a></strong></td>
</tr>
<tr>
<td>
<ul>
<li><a title="Read-only memory" href="http://en.wikipedia.org/wiki/Read-only_memory">ROM</a>
<ul>
<li><a title="Programmable read-only memory" href="http://en.wikipedia.org/wiki/Programmable_read-only_memory">PROM</a></li>
<li><a title="EPROM" href="http://en.wikipedia.org/wiki/EPROM">EPROM</a></li>
<li><a title="EEPROM" href="http://en.wikipedia.org/wiki/EEPROM">EEPROM</a></li>
</ul>
</li>
<li><a title="Flash memory" href="http://en.wikipedia.org/wiki/Flash_memory">Flash memory</a></li>
<li><a title="Ferroelectric RAM" href="http://en.wikipedia.org/wiki/Ferroelectric_RAM">FeRAM</a></li>
<li>MRAM</li>
<li><a title="Phase-change memory" href="http://en.wikipedia.org/wiki/Phase-change_memory">PRAM</a></li>
<li>Upcoming
<ul>
<li><a title="Programmable metallization cell" href="http://en.wikipedia.org/wiki/Programmable_metallization_cell">CBRAM</a></li>
<li><a title="SONOS" href="http://en.wikipedia.org/wiki/SONOS">SONOS</a></li>
<li><a title="Resistive random-access memory" href="http://en.wikipedia.org/wiki/Resistive_random-access_memory">RRAM</a></li>
<li><a title="Racetrack memory" href="http://en.wikipedia.org/wiki/Racetrack_memory">Racetrack memory</a></li>
<li><a title="Nano-RAM" href="http://en.wikipedia.org/wiki/Nano-RAM">NRAM</a></li>
<li><a href="http://en.wikipedia.org/wiki/IBM_Millipede">Millipede</a></li>
</ul>
</li>
</ul>
</td>
</tr>
</tbody>
</table>
<p>While memory is fast, it’s not all that smart. Most random-access memory is simply used as a scratch pad for performing quick compute calculations, or is based on some sort of FIFO logic, meaning “I’ll only keep you in memory until some newer piece of data comes in”</p>
<p>Caching uses memory in a more sophisticated way. A clue to how caching memory works comes from the root of the word – “cacher” a French word meaning “to conceal.” With caching memory, some sort of logic is introduced to decide what data should be held in cache, and which can be ejected. And as the name implies, this is done secretly so that any caching is unknown to the requester of the data.</p>
<p>Caching significantly improves the access time to data when compared to spinning disk or, yes, even SSD.  In a perfect world, all data would be cached but of course costs would be prohibative to allow for this. As an alternative, and because Flash memory is becoming much more affordable, some advances are taking place in the adoption of caching memory.</p>
<p>In an article posted last September, Chris Mellor of The Register talked about <a href="http://www.theregister.co.uk/2009/09/25/six_flash_locations/">six ways to implement flash cache</a>, and also mentioned the vendors who were leading these efforts:</p>
<ul>
<li>In server motherboards (Intel)</li>
<li>In the server PCI bus (Fusion-io, Violin)</li>
<li> In the I/O adapter (Adaptec)</li>
<li> In the array controller (NetApp)</li>
<li> In SSD’s (STEC, Intel)</li>
<li> As a replacement for storage arrays (TMS, Sun, Violin)</li>
</ul>
<p> Three days after the original article, Mellor posted a 2nd piece, and proclaimed a <a href="http://www.theregister.co.uk/2009/09/28/dataram/">7th use for flash cache:</a></p>
<ul>
<li>In the Fibre Channel Fabric (Dataram)</li>
</ul>
<p>It&#8217;s clear to see that the use of Flash cache is spreading quickly.  As I mention in my book:</p>
<div><span style="font-size: x-small; font-family: ArialNarrow;"><span style="font-size: x-small; font-family: ArialNarrow;"><em><strong>&#8220;Memory (read cache, write cache, hybrid disk drives and solid state disk) will grow faster than any other storage component in the immediate future. This will be one of the more interesting aspects of storage evolution for the next five years.&#8221;</strong></em></span></span></div>
<div><span style="font-size: x-small; font-family: ArialNarrow;"><span style="font-size: x-small; font-family: ArialNarrow;"> </span></span><span style="font-size: x-small; font-family: ArialNarrow;"><span style="font-size: x-small; font-family: ArialNarrow;"></span></span></div>
<p>Enjoy the ride!</p>
<p>Larry</p>
]]></content:encoded>
			<wfw:commentRss>http://storage-brain.com/2010/12/in-data-storage-cache-is-king/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>What Is A Storage Array Controller?</title>
		<link>http://storage-brain.com/2010/12/what-is-a-storage-array-controller/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=what-is-a-storage-array-controller</link>
		<comments>http://storage-brain.com/2010/12/what-is-a-storage-array-controller/#comments</comments>
		<pubDate>Tue, 07 Dec 2010 22:52:50 +0000</pubDate>
		<dc:creator>larryf</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://storage-brain.com/?p=435</guid>
		<description><![CDATA[<p>Early in my data storage career, 1979 to be exact, I attended a field engineering training class for IBM mainframe-attached data storage controllers. I distinctly remember that the class was 4 weeks long (the disk drive class was only 2 weeks.) Most of the class was spent pouring over schematics that followed the logical [...]]]></description>
			<content:encoded><![CDATA[<p>Early in my data storage career, 1979 to be exact, I attended a field engineering training class for IBM mainframe-attached data storage controllers. I distinctly remember that the class was 4 weeks long (the disk drive class was only 2 weeks.) Most of the class was spent pouring over schematics that followed the logical data flow through the controller &#8211; from input registers, through memory buffers, and into the output registers. Along the way there were diagnostic routines, error checking logic and power conditioners. In those days, a storage controller weighed several hundred pounds, had over a hundred circuit cards, and was about the size of a refrigerator.</p>
<p>Thanks to microprocessor technology and component miniaturization, storage controllers eventually shrunk down to a single circuit board that plugged into the CPU card cage &#8211; first for mainframes, then minicomputers, and finally open systems Unix and Windows servers. As computers got smaller, so did the boards, until the storage controller board was reduced to something you could hold in the palm of your hand. Instead of taking 4 weeks to understand and troubleshoot storage control logic and data flow, it took about 4 minutes to teach someone to replace the storage controller card with a new one. Such was the evolution of computer hardware in those days.</p>
<p>Throughout this period of miniaturization, not much changed as far the role of the storage controller. It simply processed commands sent by the computer system, and stored data on its connected disk drives. Each drive was a separate entity and each controller could manage a small bank of drives, usually up to 16 drives per controller. Until about 1990, this is the way that all disk drives communicated with computers &#8211; dumb controllers talking to dumb devices.</p>
<p>Two breakthrough events transformed the dumb storage controller into an intelligent storage array controller. The first was the advent of RAID (Redundant Array of Independent Disks), postulated in the landmark 1988 Berkeley paper. RAID architecture was an amalgamation of many disk drives into a single “logical” drive that processed data quickly and appeared to never fail. This transformation was made possible by the RAID controller – a souped-up storage controller that suddenly had some very useful intelligence. Using RAID, rack-mounted storage arrays with data striped across disk drives could store and retrieve data faster than ever and could continue running even when individual drives within the array stopped working – all done intelligently, without the knowledge or assistance of the computer system.</p>
<p>The second breakthrough occurred with storage virtualization. As far as I can recall, this technology was first made available on the Xiotech Magnitude in the early 2000’s. Before the Magnitude, if you wanted to add disk drives to an array, you had to power the whole thing down, install the drives, and then reconstruct the entire data set, either from another array or from tape backups. In other words, if you had made any plans for the weekend, you might as well cancel them.</p>
<p>Virtual storage made all that work unnecessary by mapping physical drives to logical containers. Want to add capacity? Simply add a couple disk drives to the array (leaving the power on) and map them to the existing drives. Viola! Your new storage capacity magically appeared, and the data was automatically spread out onto the new drives. Suddenly a whole new world of possibilities arose. How about multiple logical partitions within a single array? Yeah, we can do that too!</p>
<p>Today, the storage array controller has grown into a sophisticated, self sustaining entity. The typical array controller manages hundreds of physical disk drives, simultaneously handling RAID configurations, fault recovery and data storage/retrieval functions while optimizing storage capacity with thin provisioning, data deduplication, and snapshot copies. Advanced features such as multi-protocol support, data compression, and self-healing diagnostics are on the verge of becoming mandatory. In their 30+ years of evolution, storage controllers have come a long, long way.</p>
<p>The future looks bright for storage array controllers. Multi-core CPUs provide new processing capability that will allow even more intelligence to be designed throughout. Predictive failure analysis, automated tiering, and intelligent drive spin-down will bring array controllers to new heights. As I mention in my book <em>Evolution of the Storage Brain</em>: “By 2025, it will not be unusual to see a pair of multi-core storage controllers easily managing several petabytes of storage in a single, 19-inch rack.”</p>
<p>If anyone is interested in learning more about the development of storage virtualization techniques, I’d recommend reading Tom Clark’s book “Storage Virtualization: Technologies for Simplifying Data Storage and Management”</p>
<p><a href="http://www.amazon.com/Storage-Virtualization-Technologies-Simplifying-Management/dp/0321262514">http://www.amazon.com/Storage-Virtualization-Technologies-Simplifying-Management/dp/0321262514</a></p>
<p>Larry</p>
]]></content:encoded>
			<wfw:commentRss>http://storage-brain.com/2010/12/what-is-a-storage-array-controller/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Is Backup Software Still Needed?</title>
		<link>http://storage-brain.com/2010/11/is-backup-software-still-needed/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=is-backup-software-still-needed</link>
		<comments>http://storage-brain.com/2010/11/is-backup-software-still-needed/#comments</comments>
		<pubDate>Wed, 24 Nov 2010 23:12:58 +0000</pubDate>
		<dc:creator>larryf</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://storage-brain.com/?p=427</guid>
		<description><![CDATA[<p>For most of my career I’ve been wholly engrossed in the world of data backups.  Since the advent of Unix and Windows server-based computing, data backups have seen an interesting evolution.  To begin with, the operating systems on these open servers had their own set of tools to let you do backups for free.  [...]]]></description>
			<content:encoded><![CDATA[<p>For most of my career I’ve been wholly engrossed in the world of data backups.  Since the advent of Unix and Windows server-based computing, data backups have seen an interesting evolution.  To begin with, the operating systems on these open servers had their own set of tools to let you do backups for free.  These utilities had funny names like tar, dd, cpio, and of course dump/restore and ntbackup. </p>
<p>Throughout the 80s, people would write a quick little cron script and backups worked just fine, until the number of servers and the amount of data got a little too big to handle.  Legato was the first independent company to recognize this and offered their <em>Networker</em> backup product as a commercial software application, followed quickly by several dozen competing companies.  These products offered more management than simple scripts to help with backups, things like automatic staggered scheduling, restarting of failed backups, and reports that would tell you if your backup jobs ran OK.</p>
<p>For a while, selling backup software to IT shops was big business.  I suppose it still is today, but instead of dozens of product offerings, the number of backup software vendors has dwindled down to about a half-dozen.  The question I keep asking myself is  how much longer will we need this sort of backup application at all?  Here&#8217;s why:</p>
<p>When backup software was king, there were two big things if offered:</p>
<ol>
<li>Tape media management.  Medium-sized IT shops would rotate hundreds of tapes in and out of their data centers.  Large shops would rotate thousands of tapes.  Backup software generally did a nice job of keeping track of which backups were on which tapes, usually with the aid of OCR barcode labels attached to the tape cartridges.</li>
<li>File history catalogs.  The backup software would keep a catalog, or database, of all the files it backed up.  Not sure which tape <em>mybigpreso.ppt</em> was backed up to?  You could do a quick search from the backup application and get a list of all the tapes holding your important file, then just click a button to restore it</li>
</ol>
<p>Backup software hit its peak in the 1990s then started to cool off in the 2000s, but today things have really changed.  First of all, there is much less reliance on tape as a backup medium.  Since 2000, the trend has been towards local disk-based backups and more recently outsourced cloud-based backups.  In either case, the need to track tape media has been reduced or eliminated.  Along with this trend is the growing intelligence of storage arrays.  This new intelligence includes features like the copying of files between storage arrays without server intervention and policy-based data protection.</p>
<p>For over 20 years, backup software applications have been notorious for being finicky and in need of constant attention, believe me I know this from first-hand experience.  They are also known as being very expensive to purchase and maintain.  I believe a new trend is coming soon, and it will revolve around self-backups by intelligent storage arrays. As <a href="http://www.dciginc.com/2010/09/as-2011-nears-a-transformation-in-backup-awai.html">Jerome Wendt recently blogged</a>, “Server-based backup is dying.”  Although he has a slightly different perspective on the reasons why (based on the fact that virtual servers cannot efficiently back themselves up), he and I are both reaching the same inevitable conclusion.</p>
<p>Larry</p>
]]></content:encoded>
			<wfw:commentRss>http://storage-brain.com/2010/11/is-backup-software-still-needed/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

