+61 3 5227 1266
Centre for Pattern Recognition and Data Analytics
School of Information Technology
Locked Bag 20000
GEELONG VIC 3220
B. Adams, D. Phung, and S. Venkatesh. Discovery of latent subcommunities in a blog's readership. ACM Trans. Web, 4(3):1-30, 2010.
The blogosphere has grown to be a mainstream forum of social interaction as well as a commercially attractive source of information and influence. Tools are needed to better understand how communities that adhere to individual blogs are constituted in order to facilitate new personal, socially-focussed browsing paradigms, and understand how blog content is consumed, which is of interest to blog authors, big media and search. We present a novel approach to blog sub-community characterization by modelling individual blog readers using mixtures of an extension to the LDA family that jointly models phrases and time, Ngram Topic over Time (NTOT), and cluster with a number of similarity measures using Affinity Propagation. We experiment with two datasets: a small set of blogs whose authors provide feedback, and a set of popular, highly commented blogs, which provide indicators of algorithm scalability and interpretability without prior knowledge of a given blog. The results offer useful insight to the blog authors about their commenting community, and are observed to offer an integrated perspective on the topics of discussion and members engaged in those discussions for unfamiliar blogs. Our approach also holds promise as a component of solutions to related problems, such as online entity resolution and role discovery.
Deakin University CRICOS Provider Code: 00113B
27th February 2015