<?xml version="1.0" encoding="UTF-8" ?>
<?xml-stylesheet type="text/xsl" href="http://sqlblog.com/utility/FeedStylesheets/rss.xsl" media="screen"?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/" xmlns:wfw="http://wellformedweb.org/CommentAPI/"><channel><title>Search results matching tag 'ETL'</title><link>http://sqlblog.com/search/SearchResults.aspx?o=DateDescending&amp;tag=ETL&amp;orTags=0</link><description>Search results matching tag 'ETL'</description><dc:language>en-US</dc:language><generator>CommunityServer 2.1 SP2 (Build: 61129.1)</generator><item><title>Little Data Remains Important in Healthcare IT</title><link>http://sqlblog.com/blogs/andy_leonard/archive/2013/04/30/little-data-remains-important-in-healthcare-it.aspx</link><pubDate>Tue, 30 Apr 2013 14:31:24 GMT</pubDate><guid isPermaLink="false">21093a07-8b3d-42db-8cbf-3350fcbf5496:48935</guid><dc:creator>andyleonard</dc:creator><description>&lt;p&gt;&lt;a href="http://www.youtube.com/watch?v=I1wg1DNHbNU" target="_blank"&gt;&lt;img title="SameAsItEverWas" style="border-left-width:0px;border-right-width:0px;background-image:none;border-bottom-width:0px;float:left;padding-top:0px;padding-left:0px;margin:0px 5px 5px 0px;display:inline;padding-right:0px;border-top-width:0px;" border="0" alt="SameAsItEverWas" align="left" src="http://sqlblog.com/blogs/andy_leonard/SameAsItEverWas_445DFE18.jpg" width="244" height="194" /&gt;&lt;/a&gt;In his article &lt;a href="http://ht.ly/ky1ee" target="_blank"&gt;Healthcare's Big Problem With Little Data&lt;/a&gt;, author Dan Munro raises salient points about the state of health-related data. Electronic Health Records (EHR) were promoted as the end-all-be-all solution for the industry – a standardization that, I suppose, many thought would organically and naturally occur, stabilize, and be maintained.&lt;/p&gt;  &lt;p&gt;It hasn’t. At least not yet.&lt;/p&gt;  &lt;p&gt;My doctor and I speak about this almost each time I visit with him. The corporation that operates his practice nowadays seems endlessly locked in cycles of changing billing and EHR systems in search of low-cost compliance and integration. They’ve (literally) spent millions of dollars and my doctor hates the interfaces forced upon him and his patients (well, one, at least) hates the complexity of the billing and patient records systems. Can’t these systems all just get along?&lt;/p&gt;  &lt;p&gt;The result? Higher medical data management costs. I’ll give you one guesses who pays these costs.&lt;/p&gt;  &lt;p&gt;Munro posits the following from his &lt;a href="http://ht.ly/ky1ee" target="_blank"&gt;article&lt;/a&gt;:&lt;/p&gt;  &lt;blockquote&gt;   &lt;p&gt;By at least one estimate (&lt;a href="http://www.hitconsultant.net/2013/03/27/many-ehr-vendors-will-not-survive-to-see-meaningful-use-stage-2/"&gt;here&lt;/a&gt;) there are now about 500 independent EHR vendors.&amp;#160; Out of that large group is a subset of about 400 with at least one customer that has applied for Federal stimulus dollars through the labyrinthine process of meaningful use attestation. That would suggest a “first-cut” of about 100 vendors who made some commitment around certification – but have no reported customers (at least to date). That’s a staggering number of single-purpose software vendors for any industry to support – even bloated healthcare. The simple fact is it can’t. While there have been a few high-profile cases of EHR vendors shutting down, this last week was the first high-profile example of a vendor that was effectively decertified by the Feds for both their “ambulatory” and their “inpatient” EHR products. From the &lt;a href="http://www.hhs.gov/news/press/2013pres/04/20130425a.html"&gt;HHS.gov website&lt;/a&gt; last Thursday:&lt;/p&gt;    &lt;p&gt;&lt;em&gt;“We and our certification bodies take complaints and our follow-up seriously. By revoking the certification of these EHR products, we are making sure that certified electronic health record products meet the requirements to protect patients and providers,” &lt;/em&gt;said Dr. Mostashari.&lt;em&gt;“Because EHRMagic was unable to show that their EHR products met ONC’s certification requirements, their EHRs will no longer be certified under the ONC HIT Certification Program.”&lt;/em&gt;&lt;/p&gt; &lt;/blockquote&gt;  &lt;p&gt;&lt;a href="http://www.youtube.com/watch?v=I1wg1DNHbNU" target="_blank"&gt;You may ask yourself, well, how did we get here?&lt;/a&gt; This, folks, is a mess. What’s missing? Applied standards.&lt;/p&gt;  &lt;p&gt;&lt;strong&gt;“But Andy, you’ve told us standards slow down development!”&lt;/strong&gt;&lt;/p&gt;  &lt;p&gt;And I stand by that statement; standards &lt;em&gt;do&lt;/em&gt; slow down development…unless you’re building interfaces. And then standards become the means for decoupled snippets, functions, methods, applications, and even platforms to communicate with each other. In some cases, we simply cannot be productive without standards – like TCP/IP. What would happen if everyone coded their own version of internet traffic? If that was the case, very few of you would reading this post.&lt;/p&gt;  &lt;p&gt;Yes, standards slow things down. And yes, they are necessary to insure base functionality. In my humble opinion, we &lt;em&gt;have&lt;/em&gt; to get this right with healthcare data. We simply &lt;em&gt;must&lt;/em&gt;. While we see similar issues of data management across many fields, medical data is too important to mess around with; it’s (often literally) life and death. And it is certainly a high cost.&lt;/p&gt;  &lt;p&gt;&lt;strong&gt;More to Consider&lt;/strong&gt;&lt;/p&gt;  &lt;p&gt;&lt;a href="http://www.healthit.gov/providers-professionals/certification-process-ehr-technologies" target="_blank"&gt;Standards exist&lt;/a&gt;. Administering and certifying 400-500 vendor solutions is hard.&lt;/p&gt;  &lt;p&gt;&lt;strong&gt;Part of the&amp;#160; Solution&lt;/strong&gt;&lt;/p&gt;  &lt;p&gt;From the &lt;a href="http://www.hhs.gov/news/press/2013pres/04/20130425a.html" target="_blank"&gt;actions of the Department of Health and Human Services&lt;/a&gt; last week, one can ascertain HHS is taking steps to address the matter. But will all 400-500 companies voluntarily congeal their schemas? Possibly, but doubtful.&lt;/p&gt;  &lt;p&gt;My experience delivering US state Medicaid ETL solutions informs me there will be a need for data integration – regardless of the existence of standards and in spite of certification. Why? Standards are not static. The idea of &lt;em&gt;de facto&lt;/em&gt; standards emerges from the life cycle of software because &lt;a href="http://sqlblog.com/blogs/andy_leonard/archive/2010/02/17/software-is-organic-part-1.aspx" target="_blank"&gt;software is organic&lt;/a&gt;. Even if everyone agreed on the same interpretation of rigid standards (and they won’t), versions 2.0 through &lt;em&gt;n.n&lt;/em&gt; will – at a minimum – add fields to the schema. And with additional fields comes additional data.&lt;/p&gt;  &lt;p&gt;Standards will be revised when enough product schemas adopt the &lt;em&gt;de facto&lt;/em&gt;, and this will drive the need for yet more integration. Don’t take my word for it, examine the entropic history of &lt;a href="http://en.wikipedia.org/wiki/List_of_ICD-9_codes" target="_blank"&gt;ICD-9&lt;/a&gt; and &lt;a href="http://www.cms.gov/Medicare/Coding/ICD10/index.html?redirect=/icd10" target="_blank"&gt;ICD-10&lt;/a&gt; codes – the direction of progress is more data, not less.&lt;/p&gt;  &lt;p&gt;&lt;strong&gt;Learn More&lt;/strong&gt;&lt;/p&gt;  &lt;p&gt;This is one reason we at &lt;a href="http://LinchpinPeople.com" target="_blank"&gt;Linchpin People&lt;/a&gt; are focusing on Medical Data Integration. The recording of our first (free!) webinar about Medical Data Integration with SSIS 2012 is available &lt;a href="http://linchpinpeople.com/2013/04/medical-data-integration-with-ssis-2012-part-1-loading-claims-data/" target="_blank"&gt;here&lt;/a&gt;. Kent Bradshaw and I continue the series tomorrow presenting &lt;a href="http://linchpinpeople.enterthemeeting.com/m/MHEG4QRJ" target="_blank"&gt;Medical Data Integration with SSIS 2012, Part 2&lt;/a&gt; in which we focus on loading Provider and Drug data.&lt;/p&gt;  &lt;p&gt;I hope to see you there!&lt;/p&gt;  &lt;p&gt;:{&amp;gt;&lt;/p&gt;</description></item><item><title>ETL is dead, long live AP2 ?</title><link>http://sqlblog.com/blogs/jamie_thomson/archive/2013/02/15/etl-is-dead-long-live-ap2.aspx</link><pubDate>Fri, 15 Feb 2013 15:45:33 GMT</pubDate><guid isPermaLink="false">21093a07-8b3d-42db-8cbf-3350fcbf5496:47734</guid><dc:creator>jamiet</dc:creator><description>&lt;p&gt;Three days ago I posted &lt;a href="http://sqlblog.com/blogs/jamie_thomson/archive/2013/02/12/what-would-a-cloud-based-etl-tool-look-like.aspx" target="_blank"&gt;What would a cloud-based ETL tool look like?&lt;/a&gt; where I wondered out loud about the sorts of tools data integration dudes like myself would be using in the future. I got some good feedback and already have a list of “stuff” to go and look at including:&lt;/p&gt;  &lt;ul&gt;   &lt;li&gt;&lt;a href="http://www.boomi.com/" target="_blank"&gt;Boomi&lt;/a&gt; – They claim 1million cloud integrations (whatever one of those is) per day&lt;/li&gt;    &lt;li&gt;&lt;a href="http://aws.amazon.com/datapipeline/" target="_blank"&gt;AWS Data Pipeline&lt;/a&gt; – A web service that incorporates a scheduler, a workflow engine and (as the name suggests) a data pipeline engine&lt;/li&gt;    &lt;li&gt;&lt;a href="http://www.informaticacloud.com/" target="_blank"&gt;Informatica Cloud&lt;/a&gt; – An extension to Informatica’s &lt;a href="http://www.informatica.com/uk/company/news-and-events-calendar/press-releases/10222012-gartner-data-integration-tools-magic-quadrant.aspx" target="_blank"&gt;market-leading&lt;/a&gt; PowerCenter for SalesForce.&lt;/li&gt; &lt;/ul&gt;  &lt;p&gt;Most interesting to me though was a link that &lt;a href="https://twitter.com/joeharris76" target="_blank"&gt;Joe Harris&lt;/a&gt; provided to a a blog post by Mike Reich entitled &lt;a href="http://seabourneinc.com/2013/02/08/rethinking-etl-for-the-api-age/"&gt;Rethinking ETL for the API age&lt;/a&gt;. Mike outlined a number of points that really struck a chord with me; the key one was his message that the Extract-Transform-Load (ETL) mantra that has been trumpeted for years should be replaced by something that is more pertinent for “the cloud” – Mike offers &lt;strong&gt;Acquiring, Processing and Publishing&lt;/strong&gt; (AP2) as a new acronym (we all love acronyms, right?). The idea of &lt;em&gt;publishing&lt;/em&gt; data rather than &lt;em&gt;loading&lt;/em&gt; it really resonated with me as making data easily available in non-proprietary formats so that people can consume it in whatever manner they choose has &lt;a href="http://sqlblog.com/blogs/jamie_thomson/archive/2010/06/03/thinking-differently-about-bi-delivery.aspx" target="_blank"&gt;long been an interest of mine&lt;/a&gt;.&lt;/p&gt;  &lt;p&gt;Here are some other bulleted thoughts that came into my head as I read Mike’s blog post:&lt;/p&gt;  &lt;ul&gt;   &lt;li&gt;“&lt;strong&gt;Flows are fluid and flexible, unlike structured, point-to-point ‘pipelines’&lt;/strong&gt;” – My interpretation of “fluid and flexible” is that these “flows” can be plugged together to create a greater whole. This gives rise to the notion of &lt;em&gt;&lt;a href="http://en.wikipedia.org/wiki/Composability" target="_blank"&gt;composability&lt;/a&gt;;&lt;/em&gt; imagine being able to leverage flows that other people have constructed in your own flows. &lt;a href="http://pipes.yahoo.com/pipes/" target="_blank"&gt;Yahoo Pipes&lt;/a&gt; (which I first blogged about almost five years ago in &lt;a title="http://consultingblogs.emc.com/jamiethomson/archive/2007/05/07/Taking-Yahoo-Pipes-for-a-test-drive.aspx" href="http://consultingblogs.emc.com/jamiethomson/archive/2007/05/07/Taking-Yahoo-Pipes-for-a-test-drive.aspx" target="_blank"&gt;Taking Yahoo Pipes for a test drive&lt;/a&gt;) was an early incarnation of this notion of composability and is a great demonstrator of what the future holds for us.&lt;/li&gt;    &lt;li&gt;&lt;strong&gt;Composability&lt;/strong&gt; further gives rise to the notion of a marketplace where one could sell “flows”. For example, one could build a flow that aggregated data for a given search term from both Google and Bing, deduplicated the results then made them available as a single feed; expose that feed via a marketplace and charge on a pay-per-use basis. Its a simplistic, contrived example but in my opinion aptly demonstrates the opportunity here. I think data marketplaces, perhaps more pertinently &lt;em&gt;data integration marketplaces&lt;/em&gt;, are going to be huge, I really do. Given the technology agnostic nature that is being proposed here these marketplaces would be totally interoperable too, unlike the hateful app stores that &lt;a href="http://xkcd.com/1174/" target="_blank"&gt;today’s xkcd expertly satirises&lt;/a&gt;.&lt;/li&gt;    &lt;li&gt;“&lt;em&gt;&lt;strong&gt;by using APIs to move information around, we decouple the data from the underlying technology and vendor&lt;/strong&gt;&lt;/em&gt;” Absolutely true. An API is essentially a well-understood interface/abstraction over a proprietary data store so really there’s nothing new here (isn’t this what &lt;a href="http://en.wikipedia.org/wiki/Service-oriented_architecture" target="_blank"&gt;SOA&lt;/a&gt; was all about?) but there’s no harm in reiterating the point.&lt;/li&gt;    &lt;li&gt;“&lt;em&gt;&lt;strong&gt;information is stored in multiple structures and formats. Any effort to manage information should focus on translating between structures rather than trying to develop a common schema&lt;/strong&gt;&lt;/em&gt;” I worked on a project from 2005-2008 where we attempted to adhere to a supposed &lt;a href="http://ppdm.org/about-ppdm" target="_blank"&gt;industry standard schema&lt;/a&gt;. Eventually we realised that those attempts were futile given that no business can be fitted neatly into an industry-standard-shaped-box and that dovetails nicely with Mike’s point here.&lt;/li&gt;    &lt;li&gt;“&lt;strong&gt;There are four common processing tasks; combining multiple streams, translating data formats, QA information, integrate third party processing&lt;/strong&gt;” – I wonder if there is a fifth that we might refer to as data caching; after all, if we’re pulling data out of multiple APIs we are at the mercy of the speed at which those APIs can provide the data – is a person going to be prepared to wait for the data or do we need regularly cache the transformed data for easy retrieval?&lt;/li&gt;    &lt;li&gt;“&lt;strong&gt;Publishing should be application/technology agnostic&lt;/strong&gt;” It would be hard for me to agree more with this point.&lt;/li&gt; &lt;/ul&gt;  &lt;p&gt;As you can tell this is an area that I’m particularly interested in and shall continue to keep a watching brief.&lt;/p&gt;  &lt;p&gt;&lt;a href="http://twitter.com/jamiet" target="_blank"&gt;@Jamiet&lt;/a&gt;&lt;/p&gt;</description></item><item><title>What would a cloud-based ETL tool look like?</title><link>http://sqlblog.com/blogs/jamie_thomson/archive/2013/02/12/what-would-a-cloud-based-etl-tool-look-like.aspx</link><pubDate>Tue, 12 Feb 2013 14:15:17 GMT</pubDate><guid isPermaLink="false">21093a07-8b3d-42db-8cbf-3350fcbf5496:47664</guid><dc:creator>jamiet</dc:creator><description>&lt;p&gt;Given that “moving data to the cloud” is, rightly or wrongly, currently in vogue in our industry I have to think that pretty soon there will be a glaring need for tools that help us to move data between these heterogeneous sources – a cloud-based ETL tool for cloud-based data if you will. Perhaps such a thing already exists -&lt;a href="http://consultingblogs.emc.com/jamiethomson/archive/2009/07/08/kapow-etl-for-html.aspx" target="_blank"&gt;I’ve talked about Kapow&lt;/a&gt; in the past which may well be considered a form of cloud ETL tool given that it fetches data from the web– if you know of anything that might fit this very loose description feel free to let me know in the comments.&lt;/p&gt;  &lt;p&gt;I started to ponder what capabilities a cloud ETL tool should have and here’s a quick brainstormed list:&lt;/p&gt;  &lt;ul&gt;   &lt;li&gt;Data transformation would be done “in the cloud” i.e. I wouldn’t need to own my own hardware in order to run it&lt;/li&gt;    &lt;li&gt;Ability to consume data from/push data to* the following types of data protocols:&lt;/li&gt;    &lt;ul&gt;     &lt;li&gt;&lt;a href="http://en.wikipedia.org/wiki/Atom_(standard)" target="_blank"&gt;ATOM&lt;/a&gt; (application/atom+xm)&lt;/li&gt;      &lt;li&gt;&lt;a href="http://en.wikipedia.org/wiki/RSS_(file_format)" target="_blank"&gt;RSS&lt;/a&gt; (application/rss+xml)&lt;/li&gt;      &lt;li&gt;&lt;a href="http://www.odata.org/" target="_blank"&gt;OData&lt;/a&gt;&lt;/li&gt;      &lt;li&gt;&lt;a href="https://developers.google.com/gdata/" target="_blank"&gt;GData&lt;/a&gt;&lt;/li&gt;      &lt;li&gt;&lt;a href="http://www.json.org/" target="_blank"&gt;JSON&lt;/a&gt;&lt;/li&gt;      &lt;li&gt;&lt;a href="http://en.wikipedia.org/wiki/ODBC" target="_blank"&gt;ODBC&lt;/a&gt;&lt;/li&gt;      &lt;li&gt;&lt;a href="http://en.wikipedia.org/wiki/JDBC" target="_blank"&gt;JDBC&lt;/a&gt;&lt;/li&gt;      &lt;li&gt;&lt;a href="http://en.wikipedia.org/wiki/Resource_Description_Framework" target="_blank"&gt;RDF&lt;/a&gt; (application/rdf+xml)&lt;/li&gt;      &lt;li&gt;&lt;a href="http://en.wikipedia.org/wiki/File_Transfer_Protocol" target="_blank"&gt;FTP&lt;/a&gt;&lt;/li&gt;   &lt;/ul&gt;    &lt;li&gt;Ability to consume data from/push data to the following &lt;a href="http://en.wikipedia.org/wiki/MIME_type" target="_blank"&gt;MIME types&lt;/a&gt;:&lt;/li&gt;    &lt;ul&gt;     &lt;li&gt;text/html&amp;#160; (gives rise to the idea of screenscraping as a source of data)&lt;/li&gt;      &lt;li&gt;text/plain&lt;/li&gt;      &lt;li&gt;text/xml&lt;/li&gt;   &lt;/ul&gt;    &lt;li&gt;Adapters (possibly with a plug-in model) for the following cloud storage and API providers:&lt;/li&gt;    &lt;ul&gt;     &lt;li&gt;&lt;a href="http://aws.amazon.com/s3/" target="_blank"&gt;Amazon S3&lt;/a&gt;&lt;/li&gt;      &lt;li&gt;&lt;a href="http://en.wikipedia.org/wiki/Amazon_SimpleDB" target="_blank"&gt;Amazon SimpleDB&lt;/a&gt;&lt;/li&gt;      &lt;li&gt;&lt;a href="http://aws.amazon.com/rds/" target="_blank"&gt;Amazon RDS&lt;/a&gt;&lt;/li&gt;      &lt;li&gt;&lt;a href="http://aws.amazon.com/redshift/" target="_blank"&gt;Amazon RedShift&lt;/a&gt;&lt;/li&gt;      &lt;li&gt;&lt;a href="http://msdn.microsoft.com/en-gb/library/windowsazure/dd179423.aspx" target="_blank"&gt;Azure Tables&lt;/a&gt;&lt;/li&gt;      &lt;li&gt;&lt;a href="http://msdn.microsoft.com/en-us/library/windowsazure/dd135733.aspx" target="_blank"&gt;Azure BLOBs&lt;/a&gt;&lt;/li&gt;      &lt;li&gt;&lt;a href="http://msdn.microsoft.com/en-us/library/windowsazure/dd179363.aspx" target="_blank"&gt;Azure Queues&lt;/a&gt;&lt;/li&gt;      &lt;li&gt;&lt;a href="http://www.windowsazure.com/en-us/home/features/messaging/" target="_blank"&gt;Azure Service Bus&lt;/a&gt;&lt;/li&gt;      &lt;li&gt;&lt;a href="http://msdn.microsoft.com/en-us/library/windowsazure/ee336279.aspx" target="_blank"&gt;Azure SQL Database&lt;/a&gt;&lt;/li&gt;      &lt;li&gt;&lt;a href="http://datamarket.azure.com/" target="_blank"&gt;Azure Datamarket&lt;/a&gt;&lt;/li&gt;      &lt;li&gt;&lt;a href="http://en.wikipedia.org/wiki/BigTable" target="_blank"&gt;BigTable&lt;/a&gt;&lt;/li&gt;      &lt;li&gt;&lt;a href="http://wiki.developerforce.com/page/REST_API" target="_blank"&gt;Salesforce API&lt;/a&gt;&lt;/li&gt;      &lt;li&gt;&lt;a href="http://database.com/en/howitworks/open" target="_blank"&gt;Database.com (from Salesforce)&lt;/a&gt;&lt;/li&gt;      &lt;li&gt;&lt;a href="http://datasift.com/" target="_blank"&gt;Datasift&lt;/a&gt;&lt;/li&gt;      &lt;li&gt;&lt;a href="http://www.guardian.co.uk/data" target="_blank"&gt;Guardian Datastore&lt;/a&gt;&lt;/li&gt;      &lt;li&gt;&lt;a href="http://en.wikipedia.org/wiki/Hadoop" target="_blank"&gt;Hadoop&lt;/a&gt;&lt;/li&gt;      &lt;li&gt;&lt;a href="http://office365.com/" target="_blank"&gt;Office 365&lt;/a&gt;&lt;/li&gt;      &lt;li&gt;&lt;a href="http://en.wikipedia.org/wiki/Apache_Cassandra" target="_blank"&gt;Cassandra&lt;/a&gt;&lt;/li&gt;      &lt;li&gt;&lt;a href="http://www.workday.com/" target="_blank"&gt;WorkDay&lt;/a&gt;&lt;/li&gt;      &lt;li&gt;&lt;a href="https://www.dropbox.com/" target="_blank"&gt;DropBox&lt;/a&gt;&lt;/li&gt;      &lt;li&gt;&lt;a href="http://www.skydrive.com" target="_blank"&gt;SkyDrive&lt;/a&gt;&lt;/li&gt;      &lt;li&gt;&lt;a href="http://twitter.com" target="_blank"&gt;Twitter&lt;/a&gt;&lt;/li&gt;      &lt;li&gt;&lt;a href="https://developers.facebook.com/docs/reference/api/" target="_blank"&gt;Facebook Graph API&lt;/a&gt;&lt;/li&gt;      &lt;li&gt;&lt;a href="http://www.concur.com/" target="_blank"&gt;Concur&lt;/a&gt;&lt;/li&gt;      &lt;li&gt;&lt;a href="http://www.programmableweb.com/apis/directory" target="_blank"&gt;thousands more...&lt;/a&gt; &lt;/li&gt;   &lt;/ul&gt;    &lt;li&gt;Job scheduler&lt;/li&gt;    &lt;li&gt;An &lt;a href="http://en.wikipedia.org/wiki/Integrated_development_environment" target="_blank"&gt;IDE&lt;/a&gt; (open to debate whether the IDE should be “in the cloud” as well)&lt;/li&gt;    &lt;li&gt;Ability to carry out common transformations (join, aggregate, sort, &lt;a href="http://en.wikipedia.org/wiki/Projection_(relational_algebra)" target="_blank"&gt;projection&lt;/a&gt;) on those heterogeneous data sources&lt;/li&gt;    &lt;li&gt;Ability to authenticate using different authentication mechanisms&lt;/li&gt;    &lt;li&gt;Configurable logging&lt;/li&gt;    &lt;li&gt;Ability to publish transformed data in a manner that makes it consumable rather than insert it into another data store&lt;/li&gt; &lt;/ul&gt;  &lt;p&gt;Any thoughts here? As I said this is a brainstormed list so I don’t mind being told that I am approaching this from the wrong angle or even that I’m completely wrong . Should I be concentrating on scenarios rather than technologies?. I’m only too aware that given my ETL heritage my brain is already wired to consider how traditional ETL tools could be supplanted into the cloud (my mention of a job scheduler sold me out there) – perhaps that is completely wrong too and that my heritage is actually a disadvantage here.&lt;/p&gt;  &lt;p&gt;I’m interested to know what people think and hopefully trigger a conversation. I’m especially keen to hear about scenarios that you might have where you need to move and transform data that lives “in the cloud”.&lt;/p&gt;  &lt;p&gt;&lt;a href="http://twitter.com/jamiet" target="_blank"&gt;@Jamiet&lt;/a&gt;&amp;#160;&lt;/p&gt;  &lt;p&gt;*where applicable&lt;/p&gt;  &lt;p&gt;UPDATE. Within seconds of publishing this post I’d already &lt;a href="https://twitter.com/BrentO/status/301334110612373505" target="_blank"&gt;been alerted&lt;/a&gt; to &lt;a href="http://www.informaticacloud.com/" target="_blank"&gt;InformaticaCloud.com&lt;/a&gt; &amp;amp; &lt;a href="http://aws.amazon.com/datapipeline/" target="_blank"&gt;AWS Data Pipeline&lt;/a&gt;. Checking those out now!&lt;/p&gt;  &lt;p&gt;I’ve been &lt;a href="https://twitter.com/joeharris76/status/301346114240647169" target="_blank"&gt;recommended&lt;/a&gt; to check out the following articles:&lt;/p&gt;  &lt;ul&gt;   &lt;li&gt;&lt;a href="http://seabourneinc.com/2013/02/08/rethinking-etl-for-the-api-age/"&gt;Rethinking ETL for the API age&lt;/a&gt;&lt;/li&gt;    &lt;li&gt;&lt;a href="http://www.apievangelist.com/2013/02/10/bringing-etl-to-the-masses-with-apis/"&gt;Bringing ETL to the Masses with APIs&lt;/a&gt;&lt;/li&gt; &lt;/ul&gt;</description></item><item><title>SSIS Design Patterns, the Book</title><link>http://sqlblog.com/blogs/andy_leonard/archive/2012/08/06/ssis-design-patterns-the-book.aspx</link><pubDate>Mon, 06 Aug 2012 16:37:43 GMT</pubDate><guid isPermaLink="false">21093a07-8b3d-42db-8cbf-3350fcbf5496:44587</guid><dc:creator>andyleonard</dc:creator><description>&lt;p&gt;For the past two years, I have had the honor and privilege or authoring &lt;a href="http://www.amazon.com/SSIS-Design-Patterns-Matt-Masson/dp/1430237716" target="_blank"&gt;SSIS Design Patterns&lt;/a&gt; alongside Jessica Moss, Michelle Ufford, Tim Mitchell, and Matt Masson. Publication of the book – like many projects of this scope – has been delayed. The current publication date is 27 Aug 2012 and I have high confidence in this date. &lt;/p&gt;  &lt;p&gt;I take responsibility for publication delays and apologize to those who pre-ordered the book. The reasons for the delays are not important. I have built a career as a software developer and architect based on the following maxim:&lt;/p&gt;  &lt;blockquote&gt;   &lt;p&gt;&lt;em&gt;Deliver quality late, no one remembers.       &lt;br /&gt;Deliver junk on time, no one forgets.&lt;/em&gt;&lt;/p&gt; &lt;/blockquote&gt;  &lt;p&gt;The shared goal of everyone working on this project has been to deliver quality. Proofing the manuscripts, I believe we have achieved that goal. &lt;/p&gt;  &lt;p&gt;:{&amp;gt;&lt;/p&gt;</description></item><item><title>Have You Downloaded SQL Server 2012 Evaluation Edition? Why Not?!</title><link>http://sqlblog.com/blogs/andy_leonard/archive/2012/03/08/have-you-downloaded-sql-server-2012-evaluation-edition-why-not.aspx</link><pubDate>Thu, 08 Mar 2012 19:00:00 GMT</pubDate><guid isPermaLink="false">21093a07-8b3d-42db-8cbf-3350fcbf5496:42192</guid><dc:creator>andyleonard</dc:creator><description>&lt;p&gt;I am installing &lt;a target="_blank" href="http://www.microsoft.com/download/en/details.aspx?id=29066"&gt;SQL Server 2012 Evaluation Edition&lt;/a&gt; on a virtual machine as I type.&lt;/p&gt;  &lt;p&gt;You can do this. Here’s one way:&lt;/p&gt;  &lt;ol&gt;   &lt;li&gt;Grab some virtual machine software. I like &lt;a target="_blank" href="https://www.virtualbox.org/wiki/Downloads"&gt;Oracle VirtualBox&lt;/a&gt;. It’s cool. It’s free.       &lt;p&gt;&lt;a href="http://sqlblog.com/blogs/andy_leonard/SNAG-0127_2D158D49.jpg"&gt;&lt;img style="background-image:none;border-right-width:0px;padding-left:0px;padding-right:0px;display:inline;border-top-width:0px;border-bottom-width:0px;border-left-width:0px;padding-top:0px;" title="SNAG-0127" border="0" alt="SNAG-0127" width="244" height="161" src="http://sqlblog.com/blogs/andy_leonard/SNAG-0127_thumb_4FA9A5F7.jpg"&gt;&lt;/a&gt;&lt;/p&gt;   &lt;/li&gt;    &lt;li&gt;Install VirtualBox. &lt;/li&gt;    &lt;li&gt;Download the 180-day free trial of &lt;a target="_blank" href="http://www.microsoft.com/download/en/details.aspx?id=19994"&gt;Windows Server 2008 R2&lt;/a&gt;. Also cool. Also free.       &lt;p&gt;&lt;a href="http://sqlblog.com/blogs/andy_leonard/SNAG-0128_598E9762.jpg"&gt;&lt;img style="background-image:none;border-right-width:0px;padding-left:0px;padding-right:0px;display:inline;border-top-width:0px;border-bottom-width:0px;border-left-width:0px;padding-top:0px;" title="SNAG-0128" border="0" alt="SNAG-0128" width="244" height="167" src="http://sqlblog.com/blogs/andy_leonard/SNAG-0128_598E9762.jpg"&gt;&lt;/a&gt;&lt;/p&gt;   &lt;/li&gt;    &lt;li&gt;Once Windows Server 2008 R2 is downloaded, build a VirtualBox VM.      &lt;p&gt;&lt;a href="http://sqlblog.com/blogs/andy_leonard/NewVirtualBoxVM_0D568DB4.jpg"&gt;&lt;img style="background-image:none;border-right-width:0px;padding-left:0px;padding-right:0px;display:inline;border-top-width:0px;border-bottom-width:0px;border-left-width:0px;padding-top:0px;" title="NewVirtualBoxVM" border="0" alt="NewVirtualBoxVM" width="244" height="201" src="http://sqlblog.com/blogs/andy_leonard/NewVirtualBoxVM_thumb_367E25F2.jpg"&gt;&lt;/a&gt;&lt;/p&gt;   &lt;/li&gt;    &lt;li&gt;Download and install &lt;a target="_blank" href="http://www.microsoft.com/download/en/details.aspx?id=29066"&gt;SQL Server 2012 Evaluation Edition&lt;/a&gt;!       &lt;p&gt;&lt;a href="http://sqlblog.com/blogs/andy_leonard/SNAG-0130_756FA68D.jpg"&gt;&lt;img style="background-image:none;border-right-width:0px;padding-left:0px;padding-right:0px;display:inline;border-top-width:0px;border-bottom-width:0px;border-left-width:0px;padding-top:0px;" title="SNAG-0130" border="0" alt="SNAG-0130" width="244" height="143" src="http://sqlblog.com/blogs/andy_leonard/SNAG-0130_thumb_32442860.jpg"&gt;&lt;/a&gt;&lt;/p&gt;   &lt;/li&gt; &lt;/ol&gt;  &lt;p&gt;That’s all there is to it. You can get started today, no need to wait until 1 Apr 2012 for media to ship. And, you don’t have to worry about making changes to your laptop or workstation (other than installing VirtualBox). &lt;/p&gt;  &lt;p&gt;You can explore the cool new features of SQL Server 2012 for the next six months… &lt;i&gt;for free!&lt;/i&gt; What are you waiting for?&lt;/p&gt;  &lt;p&gt;:{&amp;gt;&lt;/p&gt;</description></item><item><title>Updated SSIS Framework, Now Includes Reports</title><link>http://sqlblog.com/blogs/andy_leonard/archive/2012/03/07/updated-ssis-framework-now-includes-reports.aspx</link><pubDate>Wed, 07 Mar 2012 12:00:00 GMT</pubDate><guid isPermaLink="false">21093a07-8b3d-42db-8cbf-3350fcbf5496:42149</guid><dc:creator>andyleonard</dc:creator><description>&lt;p&gt;I have built a beta version of &lt;a href="http://linchpinpeople.com" target="_blank"&gt;Linchpin People&lt;/a&gt;’s SSIS Framework, now including much-requested reports. You can grab the updated copy &lt;a href="http://andyleonard.me/wp3/wp-content/uploads/2012/03/SSISConfigFrameworkAndReports_20120306.zip"&gt;here&lt;/a&gt;. &lt;/p&gt;  &lt;p&gt;I appreciate any feedback you care to share.&lt;/p&gt;  &lt;p&gt;:{&amp;gt;&lt;/p&gt;</description></item><item><title>Simplifying CSV Data Loads</title><link>http://sqlblog.com/blogs/kevin_kline/archive/2011/10/27/simplifying-csv-data-loads.aspx</link><pubDate>Thu, 27 Oct 2011 17:55:00 GMT</pubDate><guid isPermaLink="false">21093a07-8b3d-42db-8cbf-3350fcbf5496:39424</guid><dc:creator>KKline</dc:creator><description>&lt;P&gt;Data files containing comma separated values, or CSV, are some of the most common data formats used for data representation and storage outside the database. &amp;nbsp;When it comes to loading CSV data into the database, many options exist, however, few make it as simple as CSVexpress, powered by expressor software. &amp;nbsp;I recently visited &lt;A href="http://www.csvexpress.com/"&gt;www.csvexpress.com&lt;/A&gt; to check out just how simple it could get. &amp;nbsp;&amp;nbsp;In short, &lt;A href="http://www.csvexpress.com/"&gt;CSVexpress&lt;/A&gt; offers a repeatable and quick way to load any CSV file into SQL Server (or any other database). &amp;nbsp;&amp;nbsp;For those whose data quality is not as pristine as it should be, CSVexpress also offers a wide variety of built-in functionality to repair the data issues. &amp;nbsp;&amp;nbsp;These are in addition to the data transformation components available out of the box, but let’s not get ahead of ourselves.&lt;/P&gt;
&lt;P&gt;&lt;BR&gt;The first thing I notice when I visit &lt;A href="http://www.csvexpress.com/"&gt;CSVexpress&lt;/A&gt; is that there are some video tutorials available on the main page. &amp;nbsp;I found it pretty straightforward to load a CSV file into the database without watching the tutorials beforehand. &amp;nbsp;However, by watching the tutorials I was able to learn more about some neat features and functions that I had not previously noticed.&amp;nbsp;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;For my test, I grabbed a simple CSV data file containing the following data:&lt;BR&gt;City,User_ID,Name,Street_Address,Status&lt;BR&gt;"Dallas",47,"Janet Fuller","445 Upland Pl.","Trial"&lt;BR&gt;"Lyon",38,"Andrew Heiniger","347 College Av.","Active"&lt;BR&gt;"Dallas",43,"Susanne Smith","2 Upland Pl.","Active"&lt;BR&gt;"Berne",22,"Bill Ott","250 - 20th Ave.","Active"&lt;BR&gt;"Boston",32,"Michael Ott","339 College Av.","Trial"&lt;BR&gt;"New York",41,"Bill King","546 College Av.","Deleted"&lt;BR&gt;"Oslo",45,"Janet May","396 Seventh Av.","Active"&lt;BR&gt;As you can see from the diagram below, the import of the data to create a schema was not difficult at all:&lt;IMG alt="" src="https://lh3.googleusercontent.com/e1Wesda8y4TyPA_0Zv0pk76IRzf1YXmgbvKiBwLViZx_A-GSTQQ909Ihlv1IvR-1yexkfqtagtiNMSoqW53H5BTes7fUxHqd0gelY-P8tqWJ39v8SgU" width=780 height=320&gt;&lt;BR&gt;Once the schema is configured, I can create the following simple data flow to move data from my CSV input file to my target table in SQL Server:&lt;IMG alt="" src="https://lh5.googleusercontent.com/-Sc-aKWVzCVWjCBRUhtgsdszTkA2KF7keBxE6Y9LlwPtF1ZH6RO9xmkSnJiTDRavSQJF7pyHM2NQH4b3PLjFNKKOVkWJaOQxEu7ewoQiMvlo6XKi4f8" width=321 height=90&gt;&lt;/P&gt;
&lt;P&gt;&lt;IMG id=internal-source-marker_0.41670942602338795 alt="" src="https://lh3.googleusercontent.com/Pa2kSjhOQO1rJ32jpKYeCB_MNIJxORqbd6jQvRbLOm-WGXdHgI-fq75p5iO1fo4X9EblwaUxM9jo6pyJ-s0sXS5DaEa6MdcVTaYAVPm64c_FfGyLNJo" width=624 height=381&gt;&lt;BR&gt;While there are other tools available for performing similar tasks, CSVexpress makes it very simple and intuitive. &amp;nbsp;However, as I mentioned earlier, where it starts getting really interesting is when you need to pre-process and clean-up the data prior to loading it. &amp;nbsp;&amp;nbsp;Whether it involves enriching the data from external data sources or web services, or identifying and repairing bad data, CSVexpress maintains a simple interface for all of that. &amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;BR&gt;The best part - it’s all free of charge. &amp;nbsp;The version you can download from &lt;A href="http://www.csvexpress.com/"&gt;www.csvexpress.com&lt;/A&gt; is expressor’s free Community Edition. &amp;nbsp;expressor also offers a licensed Desktop and Standard Edition with even more advanced features, which are available for a 30-day trial. &amp;nbsp;&amp;nbsp;As a matter of fact, at the end of November, expressor will be introducing Salesforce support into their commercial editions and &lt;A href="http://www.csvexpress.com/"&gt;CSVexpress&lt;/A&gt; will feature a 30-day trial version. &amp;nbsp;You will be able to load Salesforce just as easily as if loading to SQL Server, or download your Salesforce contact, lead, and opportunity data and transform it before generating the right CSV output file (or files) that meets your daily, weekly, and monthly Excel reporting and analysis needs. &amp;nbsp;Now that’s easy!&lt;/P&gt;
&lt;P&gt;- Kev&lt;/P&gt;</description></item><item><title>Thoughts on Data Explorer</title><link>http://sqlblog.com/blogs/jamie_thomson/archive/2011/10/24/thoughts-on-data-explorer.aspx</link><pubDate>Mon, 24 Oct 2011 09:55:00 GMT</pubDate><guid isPermaLink="false">21093a07-8b3d-42db-8cbf-3350fcbf5496:39339</guid><dc:creator>jamiet</dc:creator><description>&lt;p&gt;To my mind the most interesting piece of news to come out of the recent PASS conference was the unveiling of a new SQL Azure Labs project coming from the SQL Server organisation that has the codename "Data Explorer" (not a very imaginitive codename I'm sure you'll agree) and for which there is information available at &lt;a href="http://www.microsoft.com/en-us/sqlazurelabs/labs/dataexplorer.aspx" target="_blank"&gt;http://www.microsoft.com/en-us/sqlazurelabs/labs/dataexplorer.aspx&lt;/a&gt; (in case you've surfed on here a few months on from when I originally wrote this blog post you should expect that that that URI will have become a dead link).&lt;/p&gt;&lt;p&gt;My good buddy Chris Webb (&lt;a href="http://cwebbbi.wordpress.com/" target="_blank"&gt;blog&lt;/a&gt; | &lt;a href="http://twitter.com/#%21/Technitrain" target="_blank"&gt;twitter&lt;/a&gt;) has already blogged about Data Explorer at&amp;nbsp; &lt;a href="http://cwebbbi.wordpress.com/2011/10/12/pass-summit-2011-day-1-keynote/" target="_blank"&gt;Pass Summit 2011 - Day 1 Keynote&lt;/a&gt; and &lt;a href="http://cwebbbi.wordpress.com/2011/10/20/self-service-etl-with-data-explorer/" target="_blank"&gt;Self-Service ETL with Data Explorer&lt;/a&gt; in which he made a very telling observation:&lt;/p&gt;&lt;blockquote&gt;&lt;p&gt;&lt;i&gt;It allows you to mash up data from various different sources then publish the result as an OData feed – very similar to &lt;a href="http://pipes.yahoo.com/" target="_blank"&gt;Yahoo Pipes&lt;/a&gt;&lt;/i&gt;&lt;/p&gt;&lt;/blockquote&gt;&lt;p&gt;I couldn't agree more with that assertion. I blogged about Yahoo Pipes over four years ago at &lt;a href="http://consultingblogs.emc.com/jamiethomson/archive/2007/05/07/Taking-Yahoo-Pipes-for-a-test-drive.aspx" target="_blank"&gt;Taking Yahoo Pipes for a test drive&lt;/a&gt; and I referred to it then as "ETL for RSS feeds"; it interested me greatly because here was a tool that enabled non-developers to pull data from multiple sources and make it available as a single data source that could be easily consumed; moreover it ran as a cloud service which has also long been an interest of mine. Granted, it only did this for RSS feeds but the premise was still really interesting to me; I believe that making data easily consumable is far more important than the tool chosen to consume it hence why I'm such a massive advocate of &lt;a href="http://sqlblog.com/blogs/jamie_thomson/archive/2010/06/03/thinking-differently-about-bi-delivery.aspx" target="_blank"&gt;iCalendar for BI&lt;/a&gt; and why you'll rarely find me talking about the likes of Business Objects, Cognos, Qlikview, Tableau and Power View on this blog (no disrespect intended to those tools or the people that use them - they're just not what floats my boat).&lt;/p&gt;&lt;p&gt;Where Yahoo Pipes consumes RSS feeds and provides RSS feeds, Data Explorer consumes from loads of different places and provides &lt;a href="http://www.odata.org" target="_blank"&gt;OData &lt;/a&gt;feeds (something I've been &lt;a href="http://consultingblogs.emc.com/jamiethomson/archive/tags/Astoria/default.aspx" target="_blank"&gt;banging on about&lt;/a&gt; &lt;a href="http://sqlblog.com/blogs/jamie_thomson/archive/tags/odata/default.aspx" target="_blank"&gt;for a while now&lt;/a&gt;) and if you're in the Microsoft ecosystem OData is increasingly looking like the &lt;a href="http://en.wikipedia.org/wiki/Lingua_franca" target="_blank"&gt;lingua franca&lt;/a&gt; for platform and device independent data integration. Moreover, according to recent blog post &lt;a href="http://blogs.msdn.com/b/timmall/archive/2011/10/21/creating-a-custom-rss-reader-in-montego-cloud.aspx" target="_blank"&gt;Creating a custom RSS reader in Montego (cloud)&lt;/a&gt; by project lead Tim Mallalieu Data Explorer will also be able to pull data directly out of web pages and that is stepping firmly into the territory of &lt;a href="http://kapowsoftware.com/" target="_blank"&gt;Kapow&lt;/a&gt; which, again, is a tool that Chris and I have blogged about before at &lt;a href="http://consultingblogs.emc.com/jamiethomson/archive/2009/07/08/kapow-etl-for-html.aspx" target="_blank"&gt;Kapow – ETL for HTML&lt;/a&gt; and &lt;a href="http://cwebbbi.wordpress.com/2009/06/29/kapow-technologies/%20" target="_blank"&gt;Kapow Technologies&lt;/a&gt;. Chris referred to Kapow as:&lt;/p&gt;&lt;blockquote&gt;&lt;p&gt;&lt;i&gt;a cross between a screenscraper and an ETL tool&lt;/i&gt;&lt;/p&gt;&lt;/blockquote&gt;&lt;p&gt;and again I wouldn't disagree. Data Explorer looks like filling the missing link that I was alluding to in the final paragraphs of my June 2009 blog post &lt;a href="http://consultingblogs.emc.com/jamiethomson/archive/2009/06/23/enterprise-mashups.aspx" target="_blank"&gt;Enterprise Mashups&lt;/a&gt;.&lt;br&gt;&lt;/p&gt;&lt;p&gt;Are you spotting a common theme here? Data Explorer is an ETL tool and given my obvious SSIS affiliations that makes it very interesting to me. That it runs as a cloud service and will be available to non-developers only makes it more intriguing and I can't wait until Data Explorer becomes available for us to tinker with later this year. No doubt Chris will be keeping a watching brief too.&lt;/p&gt;&lt;p&gt;&lt;a href="http://twitter.com/jamiet" target="_blank"&gt;@Jamiet&lt;/a&gt; &lt;/p&gt;&lt;p&gt;UPDATE:Some further thoughts...&lt;/p&gt;&lt;p&gt;It would be interesting to see what else could be done with this data once its exposed as a feed. I'll wager that in the not too distant future you'll be able to (for example) sell the output from your Data Explorer mashup on &lt;a href="https://datamarket.azure.com/" target="_blank"&gt;Azure Datamarket&lt;/a&gt; or view geocoded feeds on Bing Maps (note that &lt;a href="http://www.odata.org/blog/2011/10/14/geospatial-properties" target="_blank"&gt;Geospatial support is coming to OData in the very near future&lt;/a&gt;). There are lots of possibilities I'm sure and I'm looking forward to seeing what ideas others have for using and sharing this data.&lt;br&gt;&lt;/p&gt;&lt;p&gt;I'm also wondering whether there will be an option to host Data Explorer (and hence Data Explorer mashups) inside the enterprise. Today most enterprise data is contained within the corporate firewall thus will not be accessible from a Data Explorer service provided via SQL Azure; it would be a shame if such data could not be accessed by Data Explorer and hence why I hope there will be an on-premise version available. I can think of many scenarios at my past clients where the ability to easily make data consumable over HTTP and behind the firewall would have been invaluable.&lt;br&gt;&lt;/p&gt;</description></item><item><title>Stairway to SSIS Step 3–Adding Rows in Incremental Loads</title><link>http://sqlblog.com/blogs/andy_leonard/archive/2011/09/16/stairway-to-ssis-step-3-adding-rows-in-incremental-loads.aspx</link><pubDate>Fri, 16 Sep 2011 11:00:00 GMT</pubDate><guid isPermaLink="false">21093a07-8b3d-42db-8cbf-3350fcbf5496:38363</guid><dc:creator>andyleonard</dc:creator><description>&lt;p&gt;If you haven’t yet, check out the &lt;a href="http://www.sqlservercentral.com/stairway/" target="_blank"&gt;Stairways&lt;/a&gt; – a series of tutorial articles at SQL Server Central. They. rock. I was really excited when Steve Jones (&lt;a href="http://www.sqlservercentral.com/blogs/steve_jones/default.aspx" target="_blank"&gt;Blog&lt;/a&gt; | &lt;a href="http://twitter.com/way0utwest" target="_blank"&gt;@way0utwest&lt;/a&gt; | &lt;a href="http://sqlpeople.net/admin/2011/05/18/steve-jones/" target="_blank"&gt;SQLPeople&lt;/a&gt;) contacted me to write the Stairway series of tutorials for SSIS! The third article is live this morning. Altogether, the published articles are:&lt;/p&gt;  &lt;ul&gt;   &lt;li&gt;&lt;a href="http://www.sqlservercentral.com/articles/Integration+Services+(SSIS)/72492/"&gt;What is SSIS? Level1 of the Stairway to Integration Services&lt;/a&gt;&lt;/li&gt;    &lt;li&gt;&lt;a href="http://www.sqlservercentral.com/articles/Integration+Services+(SSIS)/72493/"&gt;The SSIS Data Pump - Level 2 of the Stairway to Integration Services&lt;/a&gt;&lt;/li&gt;    &lt;li&gt;&lt;a href="http://www.sqlservercentral.com/articles/Integration+Services+(SSIS)/75331/" target="_blank"&gt;Level 3 of the Stairway to Integration Services - Adding Rows in Incremental Loads&lt;/a&gt;&lt;/li&gt; &lt;/ul&gt;  &lt;p&gt;I sincerely hope you enjoy reading these articles as much as I enjoyed writing them!&lt;/p&gt;  &lt;p&gt;:{&amp;gt;&lt;/p&gt;</description></item><item><title>Designing an SSIS Framework</title><link>http://sqlblog.com/blogs/andy_leonard/archive/2011/09/16/designing-an-ssis-framework.aspx</link><pubDate>Fri, 16 Sep 2011 11:00:00 GMT</pubDate><guid isPermaLink="false">21093a07-8b3d-42db-8cbf-3350fcbf5496:38495</guid><dc:creator>andyleonard</dc:creator><description>&lt;p&gt;In preparation for &lt;a href="http://www.sqlsaturday.com/84/eventhome.aspx" target="_blank"&gt;SQL Saturday #84 – Kalamazoo&lt;/a&gt;,&amp;nbsp; &lt;a href="http://sqlbits.com" target="_blank"&gt;SQLBits 9&lt;/a&gt;, and the &lt;a href="http://www.sqlpass.org/summit/2011/" target="_blank"&gt;PASS Summit 2011&lt;/a&gt;; I’ve been honing my presentation called Designing an SSIS Framework. I delivered this presentation a few times over the past couple months. Each time, I promised the attendees I’d send them the code if they emailed me… and a couple things got in the way:&lt;/p&gt;
  
&lt;p&gt;First, a &lt;em&gt;lot&lt;/em&gt; of people requested the presentation. Second, I got really busy! Both of those beat their respective alternatives.&lt;/p&gt;
  
&lt;p&gt;I finished building and barely testing the code this evening. You can grab an &lt;em&gt;updated&lt;/em&gt;&amp;nbsp;copy &lt;a href="http://andyleonard.me/wp3/wp-content/uploads/2012/03/SSISConfigFrameworkAndReports_20120306.zip" target="_blank"&gt;here&lt;/a&gt;. The zip file contains the SSIS 2008 R2 solution and two other folders: Config and Sql. Config holds a configuration file that points to a database named SSISConfig on the default instance of your local workstation or server (edit SSISConfigParent.dtsConfig if you want to change this location). &lt;/p&gt;
  
&lt;p&gt;To make it all work dynamically, you need to create a System Environment Variable named SSISConfig (case-sensitive) and set the value to the full path of SSISConfigParent.dtsConfig on your workstation or server. Next, run the “Create SSISConfig.sql” script in the Sql folder, followed by the “Add An SSIS Applications.sql” script (edited to reflect the folder that contains the SSIS packages on your workstation or server). This should get everything ready for the SSISConfig SSIS solution. When you execute Parent.dtsx, it should call Template1.dtsx, then Template2.dtsx, and finally ErrorTest.dtsx. &lt;/p&gt;
  
&lt;p&gt;Let me know how it goes!&lt;/p&gt;
  
&lt;p&gt;:{&amp;gt;&lt;/p&gt;</description></item></channel></rss>