TY - GEN
T1 - Retrieving rising stars in focused community question-answering
AU - Le, Long T.
AU - Shah, Chirag
N1 - Funding Information: This work is partially funded by the US National Science Foundation (NSF) BCC-SBE award no. 1244704. Publisher Copyright: © Springer-Verlag Berlin Heidelberg 2016.
PY - 2016
Y1 - 2016
N2 - In Community Question Answering (CQA)‘ forums, there is typically a small fraction of users who provide high-quality posts and earn a very high reputation status from the community. These top contributors are critical to the community since they drive the development of the site and attract traffic from Internet users. Identifying these individuals could be highly valuable, but this is not an easy task. Unlike publication or social networks, most CQA sites lack information regarding peers, friends, or collaborators, which can be an important indicator signaling future success or performance. In this paper, we attempt to perform this analysis by extracting different sets of features to predict future contribution. The experiment covers 376,000 users who remain active in Stack Overflow for at least one year and together contribute more than 21 million posts. One of the highlights of our approach is that we can identify rising stars after short observations. Our approach achieves high accuracy, 85%, when predicting whether a user will become a top contributor after a few weeks of observation. As a slightly different problem in which we could observe a few posts by a user, our method achieves accuracy higher than 90 %. Our approach provides higher accuracy than baselines methods including a popular time series analysis. Furthermore, our methods are robust to different classifier algorithms. Identifying the rising stars early could help CQA administrators gain an overview of the site’s future and ensure that enough incentive and support is given to potential contributors.
AB - In Community Question Answering (CQA)‘ forums, there is typically a small fraction of users who provide high-quality posts and earn a very high reputation status from the community. These top contributors are critical to the community since they drive the development of the site and attract traffic from Internet users. Identifying these individuals could be highly valuable, but this is not an easy task. Unlike publication or social networks, most CQA sites lack information regarding peers, friends, or collaborators, which can be an important indicator signaling future success or performance. In this paper, we attempt to perform this analysis by extracting different sets of features to predict future contribution. The experiment covers 376,000 users who remain active in Stack Overflow for at least one year and together contribute more than 21 million posts. One of the highlights of our approach is that we can identify rising stars after short observations. Our approach achieves high accuracy, 85%, when predicting whether a user will become a top contributor after a few weeks of observation. As a slightly different problem in which we could observe a few posts by a user, our method achieves accuracy higher than 90 %. Our approach provides higher accuracy than baselines methods including a popular time series analysis. Furthermore, our methods are robust to different classifier algorithms. Identifying the rising stars early could help CQA administrators gain an overview of the site’s future and ensure that enough incentive and support is given to potential contributors.
UR - http://www.scopus.com/inward/record.url?scp=84961112717&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84961112717&partnerID=8YFLogxK
U2 - https://doi.org/10.1007/978-3-662-49390-8_3
DO - https://doi.org/10.1007/978-3-662-49390-8_3
M3 - Conference contribution
SN - 9783662493892
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 25
EP - 36
BT - Intelligent Information and Database Systems - 8th Asian Conference, ACIIDS 2016, Proceedings
A2 - Hong, Tzung-Pei
A2 - Nguyen, Ngoc Thanh
A2 - Trawinski, Bogdan
A2 - Fujita, Hamido
PB - Springer Verlag
T2 - 8th Asian Conference on Intelligent Information and Database Systems, ACIIDS 2016
Y2 - 14 March 2016 through 16 March 2016
ER -