{"id":17,"date":"2007-02-21T23:14:00","date_gmt":"2007-02-22T05:14:00","guid":{"rendered":"https:\/\/keithcchan.com\/wordpress\/2007\/02\/the-high-price-of-number-crunching.html"},"modified":"2010-03-20T17:15:00","modified_gmt":"2010-03-20T23:15:00","slug":"high-price-of-number-crunching","status":"publish","type":"post","link":"https:\/\/www.keithcchan.com\/wordpress\/2007\/02\/high-price-of-number-crunching.html","title":{"rendered":"The High Price of Number Crunching"},"content":{"rendered":"<p>I do a lot of work in statistics. In fact, almost all of my work involves statistics. I don&#8217;t collect data in the field. I don&#8217;t do things in a lab. I sit in front of my computer working with data sets and spreadsheets. Lately I&#8217;ve been doing a lot of online research for a good Mac statistical package. Right now I use a combination of SPSS and Excel. <\/p>\n<p><a href=\"http:\/\/www.spss.com\/\">SPSS<\/a> seems like the perfect thing for me (it stands for Statistical Package for the Social Sciences). But it&#8217;s, in a word, bad. It&#8217;s just bad. I have version 11, which ironically works on Intel  while version 13 does not. Still, it follows archaic command line structure under its GUI. Variables can have a name eight letters long or less. No numbers or symbols. What? You can assign labels to variables but that&#8217;s killing one bird with two stones. And that&#8217;s gross.<\/p>\n<p><a href=\"http:\/\/office.microsoft.com\/en-us\/excel\/default.aspx\">Excel<\/a>&#8230; well Excel is easier to use (which says more about SPSS than Excel) but there are several published articles on how you shouldn&#8217;t use Excel due to its inaccuracy.<\/p>\n<p>In the statistical package market there are a few alternatives. <a href=\"http:\/\/www.gigawiz.com\/\">Aabel<\/a> and <a href=\"http:\/\/www.stata.com\/\">Stata<\/a> are both up-to-date to work with Intel Macs. The problem, they&#8217;re around $500 each, and that&#8217;s the student price (it&#8217;s almost as expensive as.. a PS3!). If you&#8217;re just some guy who wants to do statistics, it&#8217;s $1500. Additional feature packages are $300 or so each. Why&#8217;re they so expensive? I guess they&#8217;re trying to sell to people with grants, but you&#8217;d think they would want to aim low and get students to like it first, then sell it to them again when they&#8217;re professors with labs.<\/p>\n<p>Open source to the rescue! Kind of. <a href=\"http:\/\/www.r-project.org\/\">R<\/a> is free, but it&#8217;s strictly command line. And it&#8217;s hard. The manuals and tutorials aren&#8217;t very good. Lesson 2: Add and subtract. Lesson 3: Multiple regression! I got lost somewhere in there. Also, trying to find outside advice for using R is hard in itself because it&#8217;s hard to Google. The internet has a lot of things, and the letter R is one of them.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>I do a lot of work in statistics. In fact, almost all of my work involves statistics. I don&#8217;t collect data in the field. I don&#8217;t do things in a lab. I sit in front of my computer working with data sets and spreadsheets. Lately I&#8217;ve been doing a lot&#8230; <a href=\"https:\/\/www.keithcchan.com\/wordpress\/2007\/02\/high-price-of-number-crunching.html\" class=\"readmore\">Read more<span class=\"screen-reader-text\">&#8220;The High Price of Number Crunching&#8221;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"ngg_post_thumbnail":0,"footnotes":"","jetpack_publicize_message":"","jetpack_is_tweetstorm":false,"jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":false,"jetpack_social_options":{"image_generator_settings":{"template":"highway","enabled":false}}},"categories":[20],"tags":[181,21,10],"jetpack_publicize_connections":[],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/www.keithcchan.com\/wordpress\/wp-json\/wp\/v2\/posts\/17"}],"collection":[{"href":"https:\/\/www.keithcchan.com\/wordpress\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.keithcchan.com\/wordpress\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.keithcchan.com\/wordpress\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.keithcchan.com\/wordpress\/wp-json\/wp\/v2\/comments?post=17"}],"version-history":[{"count":0,"href":"https:\/\/www.keithcchan.com\/wordpress\/wp-json\/wp\/v2\/posts\/17\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.keithcchan.com\/wordpress\/wp-json\/wp\/v2\/media?parent=17"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.keithcchan.com\/wordpress\/wp-json\/wp\/v2\/categories?post=17"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.keithcchan.com\/wordpress\/wp-json\/wp\/v2\/tags?post=17"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}