First, is anyone here familiar enough with the work above to explain it better, and perhaps knew about it before it was posted to ArXiv?
I was not aware of these two articles until very recently, but there has been a fair amount of interest in MathOverflow in the social computing community. (More here, for example.) I don't think I can explain the work any better than they do. When I was curious about some details, I contacted the authors directly and they were always very responsive.
Second, although this may be a curmudgeonly viewpoint, I suspect that more credit for the success of MathOverflow is being given than it is due, possibly because of insufficient historical background (E.g. the Manhattan project, older forms of 'crowdsourcing'); does anyone here share the concern that the claims of effectiveness might be exaggerated?
Human-computer interaction is a fairly recent field of study but it didn't spawn from thin air. These studies use specific methodologies to explore different aspects of MathOverflow and all the papers I read describe their methodology in sufficient detail. In particular, the statistics in the two papers are based on a sample of 100 questions within the [gr.group-theory] tag 100 questions drawn from April 2011 and July 2010 to obtain a spread. This is obviously a biased sample, but it's an interesting choice since [gr.group-theory] is currently the fifth most active tag.
]]>I am not sure I understand what you (joro) mean, but there are plenty of reasons why I think (open) questions with comment-answers are not good. That it would affect the quality of an automatic analysis is not high (or at all) on the list of reasons for this, though.
]]>(as 90/100 of the questions are answered, fully or in part)
Isn't this a reason "easy" questions for which the answer is only a highly voted comment to have real (possibly CW) answer?
Just browsed the papers and suppose the results come from automatic analysis of the public dumps.
]]>The main credit given is that MO is described as 'very effective' (as 90/100 of the questions are answered, fully or in part). And, I would share the opinion that MO is very effective at getting a certain type of question answered. (The types of questions are also analysed/described.)
There are some details were I feel the description is slightly off, in particular concerning meta.MO. But in general as said it seems quite accurate. Finally, it seems the 'social machine' does not originate with the authors' of this article but is some sort of technical term in that context.
]]>