Detecting Social Influence in Event Cascades by Comparing Discriminative Rankers

Sandeep Soni, Shawn Ling Ramirez, Jacob Joseph Eisenstein
Proceedings of Machine Learning Research, PMLR 104:78-99, 2019.

Abstract

The global dynamics of event cascades are often governed by the local dynamics of peer influence. However, detecting social influence from observational data is challenging due to confounds like homophily and practical issues like missing data. We propose a simple discriminative method to detect influence from observational data. The core of the approach is to train a ranking algorithm to predict the source of the next event in a cascade, and compare its out-of-sample accuracy against a competitive baseline which lacks access to features corresponding to social influence. We analyze synthetically generated data to show that this method correctly identifies influence in the presence of confounds, and is robust to both missing data and misspecification — unlike well-known alternatives. We apply the method to two real-world datasets: (1) the co-sponsorship of legislation in the U.S. House of Representatives on a social network of shared campaign donors; (2) rumors about the Higgs boson discovery on a follower network of 105 Twitter accounts. Our model identifies the role of social influence in these scenarios and uses it to make more accurate predictions about the future trajectory of cascades.

Cite this Paper


BibTeX
@InProceedings{pmlr-v104-soni19a, title = {Detecting Social Influence in Event Cascades by Comparing Discriminative Rankers}, author = {Soni, Sandeep and Ramirez, Shawn Ling and Eisenstein, Jacob Joseph}, booktitle = {Proceedings of Machine Learning Research}, pages = {78--99}, year = {2019}, editor = {}, volume = {104}, series = {Proceedings of Machine Learning Research}, month = {05 Aug}, publisher = {PMLR}, pdf = {http://proceedings.mlr.press/v104/soni19a/soni19a.pdf}, url = {https://proceedings.mlr.press/v104/soni19a.html}, abstract = {The global dynamics of event cascades are often governed by the local dynamics of peer influence. However, detecting social influence from observational data is challenging due to confounds like homophily and practical issues like missing data. We propose a simple discriminative method to detect influence from observational data. The core of the approach is to train a ranking algorithm to predict the source of the next event in a cascade, and compare its out-of-sample accuracy against a competitive baseline which lacks access to features corresponding to social influence. We analyze synthetically generated data to show that this method correctly identifies influence in the presence of confounds, and is robust to both missing data and misspecification — unlike well-known alternatives. We apply the method to two real-world datasets: (1) the co-sponsorship of legislation in the U.S. House of Representatives on a social network of shared campaign donors; (2) rumors about the Higgs boson discovery on a follower network of 105 Twitter accounts. Our model identifies the role of social influence in these scenarios and uses it to make more accurate predictions about the future trajectory of cascades.} }
Endnote
%0 Conference Paper %T Detecting Social Influence in Event Cascades by Comparing Discriminative Rankers %A Sandeep Soni %A Shawn Ling Ramirez %A Jacob Joseph Eisenstein %B Proceedings of Machine Learning Research %C Proceedings of Machine Learning Research %D 2019 %E %F pmlr-v104-soni19a %I PMLR %P 78--99 %U https://proceedings.mlr.press/v104/soni19a.html %V 104 %X The global dynamics of event cascades are often governed by the local dynamics of peer influence. However, detecting social influence from observational data is challenging due to confounds like homophily and practical issues like missing data. We propose a simple discriminative method to detect influence from observational data. The core of the approach is to train a ranking algorithm to predict the source of the next event in a cascade, and compare its out-of-sample accuracy against a competitive baseline which lacks access to features corresponding to social influence. We analyze synthetically generated data to show that this method correctly identifies influence in the presence of confounds, and is robust to both missing data and misspecification — unlike well-known alternatives. We apply the method to two real-world datasets: (1) the co-sponsorship of legislation in the U.S. House of Representatives on a social network of shared campaign donors; (2) rumors about the Higgs boson discovery on a follower network of 105 Twitter accounts. Our model identifies the role of social influence in these scenarios and uses it to make more accurate predictions about the future trajectory of cascades.
APA
Soni, S., Ramirez, S.L. & Eisenstein, J.J.. (2019). Detecting Social Influence in Event Cascades by Comparing Discriminative Rankers. Proceedings of Machine Learning Research, in Proceedings of Machine Learning Research 104:78-99 Available from https://proceedings.mlr.press/v104/soni19a.html.

Related Material