THE SQL Server Blog Spot on the Web

Welcome to - The SQL Server blog spot on the web Sign in | |
in Search

Browse by Tags

All Tags » data quality   (RSS)
  • Data Mining Algorithms – Support Vector Machines

    Support vector machines are both, unsupervised and supervised learning models for classification and regression analysis (supervised) and for anomaly detection (unsupervised). Given a set of training examples, each marked as belonging to one of categories, an SVM training algorithm builds a model that assigns new examples into one category. An SVM ...
    Posted to Dejan Sarka (Weblog) by Dejan Sarka on June 23, 2015
  • Data Mining Algorithms – EM Clustering

    With the K-Means algorithm, each object is assigned to exactly one cluster. It is assigned to this cluster with a probability equal to 1.0. It is assigned to all other clusters with a probability equal to 0.0. This is hard clustering. Instead of distance, you can use a probabilistic measure to determine cluster membership. For example, you can ...
    Posted to Dejan Sarka (Weblog) by Dejan Sarka on May 12, 2015
  • Data Quality and Master Data Management Resources

    Many companies or organizations do regular data cleansing. When you cleanse the data, the data quality goes up to some higher level. The data quality level is determined by the amount of work invested in the cleansing. As time passes, the data quality deteriorates, and you need to repeat the cleansing process. If you spend an equal amount of ...
    Posted to Dejan Sarka (Weblog) by Dejan Sarka on October 14, 2013
  • Conferences in the next months

    Next months look full of interesting conferences and I’ll be speaking in several of those. I’ll be speaking at the following ones: SQL DAYS 17 – 18 September 2013, Germany SQL Saturday 260 Milano 8 October 2013, Italy SQL PASS Nordic 4 – 6 November 2013, ...
    Posted to Davide Mauri (Weblog) by manowar on August 29, 2013
  • Exploring earnings data for the UK [Open Data]

    I have a burgeoning interest in the world of Open Data which wikipedia describes as: Open data is the idea that certain data should be freely available to everyone to use and republish as they wish, without restrictions from copyright, patents or other mechanisms of control. The goals of the open data movement are similar to those of other ...
    Posted to Jamie Thomson (Weblog) by jamiet on February 11, 2013
  • Timewarp: What Is a Relational Database?

    Relational?!? Move On, Geezer!Maybe you're thinking that relational databases management systems (RDBMSs), like Microsoft SQL Server and Oracle, are going the way of punched cards and rotary phones.  After all, there's been a lot of hype these days in the IT media about the rise of so-called NoSQL (Not Only SQL) databases. ...
    Posted to Kevin Kline (Weblog) by KKline on September 5, 2012
  • WCF Error when using “Match Data” function in MDS Excel AddIn

    If you’re using MDS and DQS with the Excel Integration you may get an error when trying to use the “Match Data” feature that uses DQS in order to help to identify duplicate data in your data set. The error is quite obscure and you have to enable WCF error reporting in order to have the error details and you’ll discover that they are related to ...
    Posted to Davide Mauri (Weblog) by manowar on March 29, 2012
  • Simplifying CSV Data Loads

    Data files containing comma separated values, or CSV, are some of the most common data formats used for data representation and storage outside the database.  When it comes to loading CSV data into the database, many options exist, however, few make it as simple as CSVexpress, powered by expressor software.  I recently visited ...
    Posted to Kevin Kline (Weblog) by KKline on October 27, 2011
Powered by Community Server (Commercial Edition), by Telligent Systems
  Privacy Statement