<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>OStor - Data deduplication in the cloud</title>
	<atom:link href="http://dedup.wordpress.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://dedup.wordpress.com</link>
	<description>Just another WordPress.com weblog</description>
	<lastBuildDate>Sun, 15 Nov 2009 06:33:12 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
<cloud domain='dedup.wordpress.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://s2.wp.com/i/buttonw-com.png</url>
		<title>OStor - Data deduplication in the cloud</title>
		<link>http://dedup.wordpress.com</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="http://dedup.wordpress.com/osd.xml" title="OStor - Data deduplication in the cloud" />
	<atom:link rel='hub' href='http://dedup.wordpress.com/?pushpress=hub'/>
		<item>
		<title>OStor &#8211; data deduplication in the cloud &#8211; HowTo</title>
		<link>http://dedup.wordpress.com/2009/11/01/ostor-data-deduplication-in-the-cloud-howto/</link>
		<comments>http://dedup.wordpress.com/2009/11/01/ostor-data-deduplication-in-the-cloud-howto/#comments</comments>
		<pubDate>Sun, 01 Nov 2009 06:25:22 +0000</pubDate>
		<dc:creator>ppraveen</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[data deduplication dedup open source ostor cloud howto]]></category>

		<guid isPermaLink="false">http://dedup.wordpress.com/?p=18</guid>
		<description><![CDATA[Quick Implementation Note OStor is implemented using Java which allows for quick prototyping, platform independence and a language of choice in this generation. The core idea is based on this paper &#8211; Low-bandwidth file system. Those principles have been applied for both Wide-area acceleration as well as Data deduplication. A future blog post will include [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=dedup.wordpress.com&amp;blog=10217908&amp;post=18&amp;subd=dedup&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p><strong>Quick Implementation Note</strong></p>
<p><a href="http://code.google.com/p/ostor/">OStor</a> is implemented using Java which allows for quick prototyping, platform independence and a language of choice in this generation. The core idea is based on this paper &#8211; <a href="http://pdos.csail.mit.edu/lbfs/">Low-bandwidth file system</a>. Those principles have been applied for both <a href="http://en.wikipedia.org/wiki/WAN_optimization">Wide-area acceleration</a> as well as <a href="http://en.wikipedia.org/wiki/Data_deduplication">Data deduplication</a>. A future blog post will include detailed implementation notes.</p>
<p><strong>Pre-requisites</strong></p>
<ul>
<li>Java 1.6 (Havn&#8217;t tested with Java 1.5 and previous versions)</li>
<li>SVN to check out code.</li>
<li>External libraries &#8211; hadoop, log4j and commons-codec. These are including the SVN repository.</li>
</ul>
<p>I have installed it out of box on my Mac OSX 10.5, 10.6 as well as Kubuntu 7 and later.</p>
<p><strong>HowTo</strong></p>
<p>Check out the code from <a href="http://code.google.com/p/ostor/">http://code.google.com/p/ostor/</a></p>
<ul>
<li>svn checkout <strong><em>http</em></strong>://ostor.googlecode.com/svn/trunk/ ostor</li>
</ul>
<p>Read the README file &#8211; ostor/README</p>
<ul>
<li>make install &#8211; find ostor.jar</li>
</ul>
<p><em>Run ostor in interactive mode </em></p>
<ul>
<li>java -cp *:*:jars/* com.ostor.dedup.core.DedupStorCli .dstor</li>
<li>type help to look at syntax</li>
<li><em>add file data/emacs.html &#8211; </em>adds a file to the repository</li>
<li><em>show object all </em>- dump all objects in the repository</li>
<li>INFO [main] (DedupObjectStor.java:170) &#8211; Dump object stor, number of object &#8211; 1<br />
INFO [main] (DedupObject.java:312) &#8211; [Object - data/emacs.html] length &#8211; 3170551 num segs &#8211; 143 unique:: segs &#8211; (143/143) size &#8211; (100%)</li>
<li><em>show segment all </em>- dump all segments in the repository</li>
<li>INFO [main] (DedupSegment.java:212) &#8211; Dump segment &#8211; Id &#8211; SEGMENT-%D6%CBk%9D%EC%E6%88m%1F%1A%AC%E5%1E%23%FC*%81%F6e%60, len &#8211; 30720, num refs &#8211; 1, hash &#8211; 1strnezmiG0fGqzlHiP8KoH2ZWA=<br />
INFO [main] (DedupSegment.java:212) &#8211; Dump segment &#8211; Id &#8211; SEGMENT-Mfa%EDD%D7%7B%CF%C9%D0%3B%3D%FCk%1E%85cP%84%11, len &#8211; 30720, num refs &#8211; 1, hash &#8211; TWZh7UTXe8/J0Ds9/GsehWNQhBE=<br />
INFO [main] (DedupSegment.java:212) &#8211; Dump segment &#8211; Id &#8211; SEGMENT-%CB%B4%9F%7E%AA%9E%96%0EZ%EB8%B7%C0%D02%5DO%87%E8%95, len &#8211; 30720, num refs &#8211; 1, hash &#8211; y7Sffqqelg5a6zi3wNAyXU+H6JU=</li>
<li>&#8230;. and so on</li>
</ul>
<p>In the next blog spot, I will describe how to run in standalone mode and then in Hadoop mode.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/dedup.wordpress.com/18/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/dedup.wordpress.com/18/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/dedup.wordpress.com/18/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/dedup.wordpress.com/18/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/dedup.wordpress.com/18/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/dedup.wordpress.com/18/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/dedup.wordpress.com/18/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/dedup.wordpress.com/18/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/dedup.wordpress.com/18/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/dedup.wordpress.com/18/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/dedup.wordpress.com/18/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/dedup.wordpress.com/18/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/dedup.wordpress.com/18/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/dedup.wordpress.com/18/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=dedup.wordpress.com&amp;blog=10217908&amp;post=18&amp;subd=dedup&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://dedup.wordpress.com/2009/11/01/ostor-data-deduplication-in-the-cloud-howto/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/4bee48452bd49a25aa7750ddde0ad0a9?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">ppraveen</media:title>
		</media:content>
	</item>
		<item>
		<title>Moved project to google code</title>
		<link>http://dedup.wordpress.com/2009/11/01/moved-project-to-google-code/</link>
		<comments>http://dedup.wordpress.com/2009/11/01/moved-project-to-google-code/#comments</comments>
		<pubDate>Sun, 01 Nov 2009 03:27:40 +0000</pubDate>
		<dc:creator>ppraveen</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[data deduplication dedup open source ostor cloud]]></category>

		<guid isPermaLink="false">http://dedup.wordpress.com/?p=6</guid>
		<description><![CDATA[Old project homepage http://ostor.sourceforge.net/ New project homepage http://code.google.com/p/ostor/ I found it much easier to manage the project on Google Code than sourceforge &#8211; quite unfortunate considering I wanted to stay vendor neutral. It was so much more easier to link my blogs, manage wiki&#8217;s, etc. on google code.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=dedup.wordpress.com&amp;blog=10217908&amp;post=6&amp;subd=dedup&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p><strong>Old project homepage</strong></p>
<p><a href="http://ostor.sourceforge.net/">http://ostor.sourceforge.net/</a></p>
<p><strong>New project homepage</strong></p>
<p><a href="http://code.google.com/p/ostor/">http://code.google.com/p/ostor/</a></p>
<p>I found it much easier to manage the project on Google Code than sourceforge &#8211; quite unfortunate considering I wanted to stay vendor neutral. It was so much more easier to link my blogs, manage wiki&#8217;s, etc. on google code.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/dedup.wordpress.com/6/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/dedup.wordpress.com/6/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/dedup.wordpress.com/6/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/dedup.wordpress.com/6/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/dedup.wordpress.com/6/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/dedup.wordpress.com/6/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/dedup.wordpress.com/6/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/dedup.wordpress.com/6/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/dedup.wordpress.com/6/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/dedup.wordpress.com/6/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/dedup.wordpress.com/6/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/dedup.wordpress.com/6/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/dedup.wordpress.com/6/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/dedup.wordpress.com/6/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=dedup.wordpress.com&amp;blog=10217908&amp;post=6&amp;subd=dedup&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://dedup.wordpress.com/2009/11/01/moved-project-to-google-code/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/4bee48452bd49a25aa7750ddde0ad0a9?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">ppraveen</media:title>
		</media:content>
	</item>
		<item>
		<title>Introducing OStor &#8211; data deduplication in the cloud. Open source project.</title>
		<link>http://dedup.wordpress.com/2009/11/01/introducing-ostor-data-deduplication-in-the-cloud-open-source-project/</link>
		<comments>http://dedup.wordpress.com/2009/11/01/introducing-ostor-data-deduplication-in-the-cloud-open-source-project/#comments</comments>
		<pubDate>Sun, 01 Nov 2009 02:53:04 +0000</pubDate>
		<dc:creator>ppraveen</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[data deduplication dedup open source ostor cloud]]></category>

		<guid isPermaLink="false">http://dedup.wordpress.com/?p=3</guid>
		<description><![CDATA[I have just launched a open source project to provide data deduplication services. Short bio from the project summary page &#8211; http://code.google.com/p/ostor/. OStor (Optimized Storage) is a service to store data optimally using block level data de-duplication and compression techniques. It can be used as a standalone tool, an interactive tool as well as in [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=dedup.wordpress.com&amp;blog=10217908&amp;post=3&amp;subd=dedup&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>I have just launched a open source project to provide data deduplication services. Short bio from the project summary page &#8211; <a href="http://code.google.com/p/ostor/">http://code.google.com/p/ostor/</a>.</p>
<p><em>OStor (Optimized Storage) is a service to store data optimally using block level data de-duplication and compression techniques. It can be used as a standalone tool, an interactive tool as well as in the cloud leveraging using Hadoop Map-Reduce framework.</em></p>
<p><strong>History</strong></p>
<p>In recent years, cloud computing has emerged as a new paradigm in the tech industry. As more and more IT Infrastructure moves into the cloud, data is being generated in the cloud at an unprecedented rate. Data also gets fed into the cloud from other sources. A portion of this data has to be retained for archival purposes. As data gets versioned and archived, a pattern emerges that mimics what was observed in the traditional IT environments &#8211; need to do data deduplication &#8211; <em>elimination of redundant data</em>. In <em>traditional IT environments</em>, various vendors provide such services since half-a-dozen years. Most notably, <a href="http://www.datadomain.com/">Data Domain</a> which was acquired by EMC recently.Their solutions are hardware based. Since the new generation of cloud services (Amazon AWS, Microsoft Azure, Google Apps) are based on virtualization, customers are reliant on either the cloud provider to provide such enhanced services or they use software-only solutions.<em><strong> </strong></em></p>
<p><em><strong>OStor</strong></em> attempts to bridge this gap. I will add details about the implementation and howto documentation in subsequent blog posts. Stay tuned.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/dedup.wordpress.com/3/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/dedup.wordpress.com/3/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/dedup.wordpress.com/3/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/dedup.wordpress.com/3/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/dedup.wordpress.com/3/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/dedup.wordpress.com/3/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/dedup.wordpress.com/3/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/dedup.wordpress.com/3/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/dedup.wordpress.com/3/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/dedup.wordpress.com/3/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/dedup.wordpress.com/3/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/dedup.wordpress.com/3/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/dedup.wordpress.com/3/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/dedup.wordpress.com/3/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=dedup.wordpress.com&amp;blog=10217908&amp;post=3&amp;subd=dedup&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://dedup.wordpress.com/2009/11/01/introducing-ostor-data-deduplication-in-the-cloud-open-source-project/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/4bee48452bd49a25aa7750ddde0ad0a9?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">ppraveen</media:title>
		</media:content>
	</item>
		<item>
		<title>Hello world!</title>
		<link>http://dedup.wordpress.com/2009/11/01/hello-world/</link>
		<comments>http://dedup.wordpress.com/2009/11/01/hello-world/#comments</comments>
		<pubDate>Sun, 01 Nov 2009 02:46:39 +0000</pubDate>
		<dc:creator>ppraveen</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false"></guid>
		<description><![CDATA[Welcome to WordPress.com. This is your first post. Edit or delete it and start blogging!<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=dedup.wordpress.com&amp;blog=10217908&amp;post=1&amp;subd=dedup&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Welcome to <a href="http://wordpress.com/">WordPress.com</a>. This is your first post. Edit or delete it and start blogging!</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/dedup.wordpress.com/1/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/dedup.wordpress.com/1/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/dedup.wordpress.com/1/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/dedup.wordpress.com/1/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/dedup.wordpress.com/1/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/dedup.wordpress.com/1/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/dedup.wordpress.com/1/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/dedup.wordpress.com/1/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/dedup.wordpress.com/1/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/dedup.wordpress.com/1/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/dedup.wordpress.com/1/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/dedup.wordpress.com/1/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/dedup.wordpress.com/1/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/dedup.wordpress.com/1/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=dedup.wordpress.com&amp;blog=10217908&amp;post=1&amp;subd=dedup&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://dedup.wordpress.com/2009/11/01/hello-world/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/4bee48452bd49a25aa7750ddde0ad0a9?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">ppraveen</media:title>
		</media:content>
	</item>
	</channel>
</rss>
