{"id":1455,"date":"2011-01-18T09:24:02","date_gmt":"2011-01-17T23:24:02","guid":{"rendered":"http:\/\/www.somethinkodd.com\/oddthinking\/?p=1455"},"modified":"2011-01-18T09:24:02","modified_gmt":"2011-01-17T23:24:02","slug":"comment-profiling","status":"publish","type":"post","link":"https:\/\/www.somethinkodd.com\/oddthinking\/2011\/01\/18\/comment-profiling\/","title":{"rendered":"Comment Profiling"},"content":{"rendered":"<p>Here&#8217;s a bleary-eyed half-considered idea that I will never follow up.<\/p>\n<p>Take a web-site with a lot of comments from a regular community &#8211; forget OddThinking, it isn&#8217;t nearly big enough; find a blog in the top 100, or StackOverflow or the like.<\/p>\n<p>Grab all the comments, and group them by author. Discard any author with less than a threshold number of words in their personal corpus.<\/p>\n<p><em>[Magic happens here]<\/em> Use your computational linguistic skills to evaluate some metrics about the voice of each commenter &#8211; the mood, the vocabulary, their sentence length and complexity, etc. <\/p>\n<div class=\"aside\">There is a word in linguistics that describes the word-choice different people make; I am bleary-eyed and tired, and can&#8217;t remember it. Ironic: I can&#8217;t choose the right word to describe word-choice.<\/div>\n<p>Now, rank each article by the average deviation of each comment from the typical comments by that author, according to your magical linguistic measures.<\/p>\n<p>The result is a list of articles that are most likely to provoke the readers out of their own personal ruts.<\/p>\n<p>My hypothesis is that these will be particularly interesting articles.<\/p>\n<p>Well, either that, or they will be articles saying things like &#8220;Happy New Year&#8221;, and getting a whole lot of &#8220;Happy New Year!&#8221; responses&#8230;<\/p>\n<p>That&#8217;s as far as I got. I think I am going back to bed, now.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Can you find interesting web articles by profiling the comments?<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_s2mail":"yes","footnotes":""},"categories":[27],"tags":[],"class_list":["post-1455","post","type-post","status-publish","format-standard","hentry","category-thoughts-from-the-shower"],"_links":{"self":[{"href":"https:\/\/www.somethinkodd.com\/oddthinking\/wp-json\/wp\/v2\/posts\/1455","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.somethinkodd.com\/oddthinking\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.somethinkodd.com\/oddthinking\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.somethinkodd.com\/oddthinking\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.somethinkodd.com\/oddthinking\/wp-json\/wp\/v2\/comments?post=1455"}],"version-history":[{"count":3,"href":"https:\/\/www.somethinkodd.com\/oddthinking\/wp-json\/wp\/v2\/posts\/1455\/revisions"}],"predecessor-version":[{"id":1458,"href":"https:\/\/www.somethinkodd.com\/oddthinking\/wp-json\/wp\/v2\/posts\/1455\/revisions\/1458"}],"wp:attachment":[{"href":"https:\/\/www.somethinkodd.com\/oddthinking\/wp-json\/wp\/v2\/media?parent=1455"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.somethinkodd.com\/oddthinking\/wp-json\/wp\/v2\/categories?post=1455"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.somethinkodd.com\/oddthinking\/wp-json\/wp\/v2\/tags?post=1455"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}