<?xml version="1.0" encoding="UTF-8" ?>
<?xml-stylesheet type="text/xsl" href="http://sqlblog.com/utility/FeedStylesheets/rss.xsl" media="screen"?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/" xmlns:wfw="http://wellformedweb.org/CommentAPI/"><channel><title>Search results matching tags 'Statistics', 'security', and 'algorithms'</title><link>http://sqlblog.com/search/SearchResults.aspx?o=DateDescending&amp;tag=Statistics,security,algorithms&amp;orTags=0</link><description>Search results matching tags 'Statistics', 'security', and 'algorithms'</description><dc:language>en-US</dc:language><generator>CommunityServer 2.1 SP2 (Build: 61129.1)</generator><item><title>Hacking Social Security Numbers</title><link>http://sqlblog.com/blogs/michael_coles/archive/2009/07/12/hacking-social-security-numbers.aspx</link><pubDate>Mon, 13 Jul 2009 00:09:00 GMT</pubDate><guid isPermaLink="false">21093a07-8b3d-42db-8cbf-3350fcbf5496:15230</guid><dc:creator>Mike C</dc:creator><description>&lt;P&gt;According to &lt;A title="PNAS: Predicting SSNs" href="http://www.pnas.org/content/106/27/10975.full.pdf+html?sid=f655da07-5374-4129-afe3-a09ba3f3fe69"&gt;this paper&lt;/A&gt; from the Proceedings of the National Academy of the Sciences (PNAS), social security numbers (SSNs) are pretty easy for hackers, identity thieves,&amp;nbsp;and other miscreants to predict based on publicly available data. I found this interesting partly because I just recently (a few months ago) wrote a chapter for a book discussing security&amp;nbsp;for SSNs.&lt;/P&gt;
&lt;P&gt;Here's the deal - all SSNs have a very regular structure that looks like this: &lt;EM&gt;xxx&lt;/EM&gt;-&lt;EM&gt;yy&lt;/EM&gt;-&lt;EM&gt;zzzz&lt;/EM&gt;. With&amp;nbsp;9 numeric digits there are 1 billion possible combinations that can be assigned.&amp;nbsp; And of course we have the same information that identity thieves have - the rules for SSN assignment are posted for the public at the &lt;A title="SSA Homepage" href="http://www.ssa.gov/"&gt;Social Security Administration website&lt;/A&gt;.&lt;/P&gt;
&lt;P&gt;Here are some of the &lt;A title="SSA: SSN Assignment Rules" href="http://ssa-custhelp.ssa.gov/cgi-bin/ssa.cfg/php/enduser/std_adp.php?p_faqid=425&amp;amp;p_created=972930021&amp;amp;p_sid=h6SE_GCj&amp;amp;p_accessibility=0&amp;amp;p_redirect=&amp;amp;p_lva=&amp;amp;p_sp=cF9zcmNoPTEmcF9zb3J0X2J5PSZwX2dyaWRzb3J0PSZwX3Jvd19jbnQ9MjAsMjAmcF9wcm9kcz0mcF9jYXRzPTE2LDU4JnBfcHY9JnBfY3Y9Mi41OCZwX3NlYXJjaF90eXBlPWFuc3dlcnMuc2VhcmNoX25sJnBfcGFnZT0x&amp;amp;p_li=&amp;amp;p_topview=1"&gt;key rules&lt;/A&gt; that determine how SSNs are assigned, summarized from the &lt;A title="SSA: Rules for SSN Assignment" href="http://ssa-custhelp.ssa.gov/cgi-bin/ssa.cfg/php/enduser/std_adp.php?p_faqid=425&amp;amp;p_created=972930021&amp;amp;p_sid=h6SE_GCj&amp;amp;p_accessibility=0&amp;amp;p_redirect=&amp;amp;p_lva=&amp;amp;p_sp=cF9zcmNoPTEmcF9zb3J0X2J5PSZwX2dyaWRzb3J0PSZwX3Jvd19jbnQ9MjAsMjAmcF9wcm9kcz0mcF9jYXRzPTE2LDU4JnBfcHY9JnBfY3Y9Mi41OCZwX3NlYXJjaF90eXBlPWFuc3dlcnMuc2VhcmNoX25sJnBfcGFnZT0x&amp;amp;p_li=&amp;amp;p_topview=1"&gt;SSA website&lt;/A&gt;:&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;&lt;EM&gt;xxx&lt;/EM&gt; is a 3-digit Area Number, and is assigned based on the ZIP Code from which the request to assign the SSN originates.&lt;/LI&gt;
&lt;LI&gt;&lt;EM&gt;yy &lt;/EM&gt;is a 2-digit Group Number, which is assigned in a predictable (nonconsecutive) order. The order of assignment of Group Numbers is also documented on the SSA website as well.&amp;nbsp; It's always a number between "01" and "99".&lt;/LI&gt;
&lt;LI&gt;&lt;EM&gt;zzzz&lt;/EM&gt; is a 4-digit Serial Number, which is a number between "0001" and "9999".&lt;/LI&gt;
&lt;LI&gt;There are a few stray SSNs that have been taken out of circulation for various reasons (used in marketing campaigns, etc.)&lt;/LI&gt;
&lt;LI&gt;And of course no SSN is ever reassigned.&lt;/LI&gt;&lt;/UL&gt;
&lt;P&gt;According to the rules a bad guy can narrow down the scope of his search substantially just by eliminating all SSNs that begin with 8&lt;EM&gt;xx&lt;/EM&gt;, 9&lt;EM&gt;xx&lt;/EM&gt;, 666, and 000.&amp;nbsp; That eliminates a couple 100 million+.&amp;nbsp; No SSNs have been assigned with a Group Number above 772, eliminating tens of millions in the 773 - 799 range.&amp;nbsp; No SSNs have, or will be, assigned with Group Numbers of 00 or Serial Numbers of 0000, eliminating millions more.&amp;nbsp; In addition the Group Numbers that have been assigned are available from the SSA website &lt;A title="SSA: High Group List" href="http://www.socialsecurity.gov/employer/ssnvhighgroup.htm"&gt;high group list&lt;/A&gt;, knocking hundreds of millions more possible SSNs off the list.&lt;/P&gt;
&lt;P&gt;This is just the beginning -- it gets better:&amp;nbsp;&lt;/P&gt;
&lt;P&gt;If you know where a person applied for their SSN (in many cases this will be where they were born, or close to it) you can use the &lt;A title="SSA: SSN Allocations List" href="http://www.socialsecurity.gov/employer/stateweb.htm"&gt;SSN Allocations list&lt;/A&gt;&amp;nbsp;to narrow down the search substantially.&amp;nbsp; In some cases this won't work though, since some parents don't apply for an SSN for their child immediately at birth.&lt;/P&gt;
&lt;P&gt;All this is to show how an identity thief&amp;nbsp;can use the location and approximate date of birth to accurately guess the first 5 digits of the SSN.&amp;nbsp; The PNAS authors were able to correctly guess the first 5 digits of SSNs &lt;EM&gt;with a single try&lt;/EM&gt; for 44% of their test records.&lt;/P&gt;
&lt;P&gt;At the other end of the spectrum, identity thieves can use the SSA's &lt;A title="SSA: Death Master File" href="http://ssa-custhelp.ssa.gov/cgi-bin/ssa.cfg/php/enduser/std_adp.php?p_faqid=149"&gt;Death Master File&lt;/A&gt; (DMF) to narrow down the last 4 digits (the Serial Number).&amp;nbsp; The PNAS authors used the DMF to figure out statistical distributions of SSN Serial Numbers to dramatically narrow down the last 4 digits.&amp;nbsp; They correctly guessed the complete SSNs for 8.5% of the test records with less than 1,000 attempts each; making the SSN for 8.5% of those tested less secure than a 4-digit ATM card PIN (in fact the authors compared it to an insecure&amp;nbsp;3-digit financial PIN).&lt;/P&gt;
&lt;P&gt;The authors' testing showed that overall full SSNs can be guessed with an accuracy of between 0.08% to 10% with less than 1,000 attempts each.&amp;nbsp; In rural areas they guessed complete SSNs at the rate of &amp;gt;60% for rural areas &lt;EM&gt;on the very first attempt&lt;/EM&gt;.&lt;/P&gt;
&lt;P&gt;To put some hard numbers to it, the authors estimated (based on various fairly reasonable assumptions), that an identity thief targeting a specific location (like a given state) could guess SSNs and obtain credit card accounts at the rate of about 47 per minute.&lt;/P&gt;
&lt;P&gt;Makes you wonder how secure your SSN is, really.&lt;/P&gt;</description></item></channel></rss>