<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Age, Membership Length and Reputation Distribution on StackOverflow</title>
	<atom:link href="http://www.somethinkodd.com/oddthinking/2008/10/24/age-membership-length-and-reputation-distribution-on-stackoverflow/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.somethinkodd.com/oddthinking/2008/10/24/age-membership-length-and-reputation-distribution-on-stackoverflow/</link>
	<description>A blog for odd things and odd thoughts.</description>
	<lastBuildDate>Wed, 01 Feb 2012 22:21:16 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
	<item>
		<title>By: Julian</title>
		<link>http://www.somethinkodd.com/oddthinking/2008/10/24/age-membership-length-and-reputation-distribution-on-stackoverflow/comment-page-1/#comment-169634</link>
		<dc:creator>Julian</dc:creator>
		<pubDate>Tue, 13 Jan 2009 04:28:52 +0000</pubDate>
		<guid isPermaLink="false">http://www.somethinkodd.com/oddthinking/?p=693#comment-169634</guid>
		<description>Dear James,

I&#039;ve decided that you are probably one of those people who show their appreciation of a free gift by taking mild potshots at the giver. So, thank you very much for your kind words. :-)

The reason I changed to Python was simple. I am not familiar enough with Perl; I am familiar with Python. This is not an attack on Perl, but merely a quirk of history that I once needed to learn Python, and to date, I have not needed to learn Perl.

There are several reasons that I didn&#039;t publish the code. Each individual reason isn&#039;t strong, but together they made a case.

1) It is not my default position to publish code, unless there is a reason. It is my default position to not publish code, unless there is a reason. I see no reason here.

2) The code isn&#039;t what this article is about. The code isn&#039;t interesting. The conclusion about Stack Overflow demographics is what it is about. The rest (including the offer to make the code available) is for scientific credibility about the conclusion, not because all software should be free.

3) I am not willing to offer support for the software. If I made it available, I was likely to get support calls.

4) Yes, the code is bad from a code smell perspective. I was in a rush, and I didn&#039;t spend the time to make it clean, readable, reliable, maintainable, efficient, (etc.) I didn&#039;t even spend the time to package it up. I don&#039;t want to associate my name to the source-code. The code was excellent from the perspective of one user, in a rush, for one-time use.

5) I felt a little uncomfortable about scraping someone-else&#039;s site without permission rather than using the API (which they said was coming, but hadn&#039;t published at the time). If I made it freely available, I would be encouraging others to do the wrong thing too.

6) I didn&#039;t think there would be any real demand. (In fact, I have had a total of two requests, and one person wrote back to say it wasn&#039;t what they were looking for.)</description>
		<content:encoded><![CDATA[<p>Dear James,</p>
<p>I&#8217;ve decided that you are probably one of those people who show their appreciation of a free gift by taking mild potshots at the giver. So, thank you very much for your kind words. <img src='http://www.somethinkodd.com/oddthinking/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /> </p>
<p>The reason I changed to Python was simple. I am not familiar enough with Perl; I am familiar with Python. This is not an attack on Perl, but merely a quirk of history that I once needed to learn Python, and to date, I have not needed to learn Perl.</p>
<p>There are several reasons that I didn&#8217;t publish the code. Each individual reason isn&#8217;t strong, but together they made a case.</p>
<p>1) It is not my default position to publish code, unless there is a reason. It is my default position to not publish code, unless there is a reason. I see no reason here.</p>
<p>2) The code isn&#8217;t what this article is about. The code isn&#8217;t interesting. The conclusion about Stack Overflow demographics is what it is about. The rest (including the offer to make the code available) is for scientific credibility about the conclusion, not because all software should be free.</p>
<p>3) I am not willing to offer support for the software. If I made it available, I was likely to get support calls.</p>
<p>4) Yes, the code is bad from a code smell perspective. I was in a rush, and I didn&#8217;t spend the time to make it clean, readable, reliable, maintainable, efficient, (etc.) I didn&#8217;t even spend the time to package it up. I don&#8217;t want to associate my name to the source-code. The code was excellent from the perspective of one user, in a rush, for one-time use.</p>
<p>5) I felt a little uncomfortable about scraping someone-else&#8217;s site without permission rather than using the API (which they said was coming, but hadn&#8217;t published at the time). If I made it freely available, I would be encouraging others to do the wrong thing too.</p>
<p>6) I didn&#8217;t think there would be any real demand. (In fact, I have had a total of two requests, and one person wrote back to say it wasn&#8217;t what they were looking for.)</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: James</title>
		<link>http://www.somethinkodd.com/oddthinking/2008/10/24/age-membership-length-and-reputation-distribution-on-stackoverflow/comment-page-1/#comment-169544</link>
		<dc:creator>James</dc:creator>
		<pubDate>Mon, 12 Jan 2009 13:19:41 +0000</pubDate>
		<guid isPermaLink="false">http://www.somethinkodd.com/oddthinking/?p=693#comment-169544</guid>
		<description>The original code was in perl, why the change to python?  Is the code that bad that its only availabe on request?</description>
		<content:encoded><![CDATA[<p>The original code was in perl, why the change to python?  Is the code that bad that its only availabe on request?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Julian</title>
		<link>http://www.somethinkodd.com/oddthinking/2008/10/24/age-membership-length-and-reputation-distribution-on-stackoverflow/comment-page-1/#comment-149483</link>
		<dc:creator>Julian</dc:creator>
		<pubDate>Fri, 24 Oct 2008 23:07:18 +0000</pubDate>
		<guid isPermaLink="false">http://www.somethinkodd.com/oddthinking/?p=693#comment-149483</guid>
		<description>Asd,

Scatterplots are great for sparse data-sets.

However, this one is very clumpy, with some co=ordinates on the scatterplot having over a hundred data-points. (e.g. there are 137 26-year-old users with a reputation of 1)

If there was an occasional 2 or 3 duplicates, you can put the number of duplicates at the co-ordinate instead of the cross, but that loses impact when the numbers get higher.

I have sometimes used the technique of adding a teensy amount of randomness to get illustrative clusters of crosses around the real data-point, but that stops making sense around 10 duplicates.

I don&#039;t have any other techniques for representing this data meaningfully on a scatterplot. Let me know if you have any ideas.</description>
		<content:encoded><![CDATA[<p>Asd,</p>
<p>Scatterplots are great for sparse data-sets.</p>
<p>However, this one is very clumpy, with some co=ordinates on the scatterplot having over a hundred data-points. (e.g. there are 137 26-year-old users with a reputation of 1)</p>
<p>If there was an occasional 2 or 3 duplicates, you can put the number of duplicates at the co-ordinate instead of the cross, but that loses impact when the numbers get higher.</p>
<p>I have sometimes used the technique of adding a teensy amount of randomness to get illustrative clusters of crosses around the real data-point, but that stops making sense around 10 duplicates.</p>
<p>I don&#8217;t have any other techniques for representing this data meaningfully on a scatterplot. Let me know if you have any ideas.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Julian</title>
		<link>http://www.somethinkodd.com/oddthinking/2008/10/24/age-membership-length-and-reputation-distribution-on-stackoverflow/comment-page-1/#comment-149481</link>
		<dc:creator>Julian</dc:creator>
		<pubDate>Fri, 24 Oct 2008 22:59:28 +0000</pubDate>
		<guid isPermaLink="false">http://www.somethinkodd.com/oddthinking/?p=693#comment-149481</guid>
		<description>Richard,

Re: Colour Scheme

Yes, I agree, but I knew if I didn&#039;t get it out by yesterday morning, I wouldn&#039;t get a chance to complete it for several days, so I didn&#039;t finesse the charts as much as I would like.

As a member of the Epoch Generation, of course I believe that age has a tendency towards surprisingly high intelligence, knowledge and communication skills which results in a higher reputation score. However, I strongly suspect that this is just an artifact within the normal bounds of the variability of the data.</description>
		<content:encoded><![CDATA[<p>Richard,</p>
<p>Re: Colour Scheme</p>
<p>Yes, I agree, but I knew if I didn&#8217;t get it out by yesterday morning, I wouldn&#8217;t get a chance to complete it for several days, so I didn&#8217;t finesse the charts as much as I would like.</p>
<p>As a member of the Epoch Generation, of course I believe that age has a tendency towards surprisingly high intelligence, knowledge and communication skills which results in a higher reputation score. However, I strongly suspect that this is just an artifact within the normal bounds of the variability of the data.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Asd</title>
		<link>http://www.somethinkodd.com/oddthinking/2008/10/24/age-membership-length-and-reputation-distribution-on-stackoverflow/comment-page-1/#comment-149357</link>
		<dc:creator>Asd</dc:creator>
		<pubDate>Fri, 24 Oct 2008 15:53:02 +0000</pubDate>
		<guid isPermaLink="false">http://www.somethinkodd.com/oddthinking/?p=693#comment-149357</guid>
		<description>A scatter plot might be more revealing for the last one.</description>
		<content:encoded><![CDATA[<p>A scatter plot might be more revealing for the last one.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Richard Atkins</title>
		<link>http://www.somethinkodd.com/oddthinking/2008/10/24/age-membership-length-and-reputation-distribution-on-stackoverflow/comment-page-1/#comment-149324</link>
		<dc:creator>Richard Atkins</dc:creator>
		<pubDate>Fri, 24 Oct 2008 13:37:41 +0000</pubDate>
		<guid isPermaLink="false">http://www.somethinkodd.com/oddthinking/?p=693#comment-149324</guid>
		<description>&#039;Tis a pity the colour scheme isn&#039;t consistent across all the charts. I might be misreading the age/rep% graph, but it looks like there&#039;s some sorta interesting data points at 38 and 51 - these correspond to the highest and lowest proportions of high rep users (&gt; 200). I&#039;m guessing the 51yos are simply underrepresented at ~20 users, but what&#039;s with the ~200 38yos? They seem to have higher rep as a percentage by 5% over everyone else. Is there something special about people from &lt;a href=&quot;http://en.wikipedia.org/wiki/Unix_time&quot; rel=&quot;nofollow&quot;&gt;Epoch&lt;/a&gt;?</description>
		<content:encoded><![CDATA[<p>&#8216;Tis a pity the colour scheme isn&#8217;t consistent across all the charts. I might be misreading the age/rep% graph, but it looks like there&#8217;s some sorta interesting data points at 38 and 51 &#8211; these correspond to the highest and lowest proportions of high rep users (&gt; 200). I&#8217;m guessing the 51yos are simply underrepresented at ~20 users, but what&#8217;s with the ~200 38yos? They seem to have higher rep as a percentage by 5% over everyone else. Is there something special about people from <a href="http://en.wikipedia.org/wiki/Unix_time" rel="nofollow" class="wikipedia">Epoch</a>?</p>
]]></content:encoded>
	</item>
</channel>
</rss>

