THE SQL Server Blog Spot on the Web

Welcome to - The SQL Server blog spot on the web Sign in | |
in Search

Browse by Tags

All Tags » data profiling » sql server   (RSS)
  • Data Mining Algorithms – EM Clustering

    With the K-Means algorithm, each object is assigned to exactly one cluster. It is assigned to this cluster with a probability equal to 1.0. It is assigned to all other clusters with a probability equal to 0.0. This is hard clustering. Instead of distance, you can use a probabilistic measure to determine cluster membership. For example, you can ...
    Posted to Dejan Sarka (Weblog) by Dejan Sarka on May 12, 2015
  • Data Quality and Master Data Management Resources

    Many companies or organizations do regular data cleansing. When you cleanse the data, the data quality goes up to some higher level. The data quality level is determined by the amount of work invested in the cleansing. As time passes, the data quality deteriorates, and you need to repeat the cleansing process. If you spend an equal amount of ...
    Posted to Dejan Sarka (Weblog) by Dejan Sarka on October 14, 2013
Powered by Community Server (Commercial Edition), by Telligent Systems
  Privacy Statement