<?xml version="1.0" encoding="UTF-8" ?>
<?xml-stylesheet type="text/xsl" href="http://sqlblog.com/utility/FeedStylesheets/rss.xsl" media="screen"?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/" xmlns:wfw="http://wellformedweb.org/CommentAPI/"><channel><title>My first venture into Hadoop</title><link>http://sqlblog.com/blogs/jamie_thomson/archive/2013/02/20/my-first-venture-into-hadoop.aspx</link><description>If you keep an eye on the tech industry you cannot have failed to notice that an Open Source technology called Hadoop is getting lots of coverage these days. You’ll be able to find lots of descriptions around the web about what Hadoop actually is so I</description><dc:language>en</dc:language><generator>CommunityServer 2.1 SP2 (Build: 61129.1)</generator><item><title>re: My first venture into Hadoop</title><link>http://sqlblog.com/blogs/jamie_thomson/archive/2013/02/20/my-first-venture-into-hadoop.aspx#47865</link><pubDate>Fri, 22 Feb 2013 11:03:03 GMT</pubDate><guid isPermaLink="false">21093a07-8b3d-42db-8cbf-3350fcbf5496:47865</guid><dc:creator>Emil Glowia</dc:creator><description>&lt;p&gt;I'm not sure if this particular set of data is unstructured? I prefer Inmon definition of unstructured that splits it into textual: emails, telephone conversations, PowerPoint presentations and more; and nontextual unstructured data which is graphics and images, x-rays , diagrams etc which he covers in this book &amp;quot;DW 2.0 The Architecture for the Next Generations of Data Warehousing&amp;quot;.&lt;/p&gt;
&lt;p&gt;To me this data is just source and process of creating a table is the same as creating a buffer by SSIS and then loading to destination table.&lt;/p&gt;
&lt;p&gt;Take care&lt;/p&gt;
&lt;p&gt;Emil&lt;/p&gt;
</description></item><item><title>re: My first venture into Hadoop</title><link>http://sqlblog.com/blogs/jamie_thomson/archive/2013/02/20/my-first-venture-into-hadoop.aspx#48878</link><pubDate>Thu, 25 Apr 2013 22:11:38 GMT</pubDate><guid isPermaLink="false">21093a07-8b3d-42db-8cbf-3350fcbf5496:48878</guid><dc:creator>Cindy Gross</dc:creator><description>&lt;p&gt;It is very rare for data in and of itself to not have &amp;quot;any&amp;quot; structure. In the big data world unstructured, multi-structured, and semi-structured really refer to the fact that you load the data first and put (and possibly rapidly modify) the structure/metadata as you query it in different ways. It's more about the concept of the schema coming after the data creation/load and being flexible.&lt;/p&gt;
</description></item></channel></rss>