skip to main content
10.1145/2911451.2914745acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
short-paper
Public Access

Controversy Detection in Wikipedia Using Collective Classification

Authors Info & Claims
Published:07 July 2016Publication History

ABSTRACT

Concerns over personalization in IR have sparked an interest in detection and analysis of controversial topics. Accurate detection would enable many beneficial applications, such as alerting search users to controversy. Wikipedia's broad coverage and rich metadata offer a valuable resource for this problem. We hypothesize that intensities of controversy among related pages are not independent; thus, we propose a stacked model which exploits the dependencies among related pages. Our approach improves classification of controversial web pages when compared to a model that examines each page in isolation, demonstrating that controversial topics exhibit homophily. Using notions of similarity to construct a subnetwork for collective classification, rather than using the default network present in the relational data, leads to improved classification with wider applications for semi-structured datasets, with the effects most pronounced when a small set of neighbors is used.

References

  1. U. Brandes, P. Kenis, J. Lerner, and D. van Raaij. Network analysis of collaboration structure in Wikipedia. WWW, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. M. Bröcheler, L. Mihalkova, and L. Getoor. Probabilistic Similarity Logic. CoRR, 2012.Google ScholarGoogle Scholar
  3. S. Chakrabarti, B. Dom, and P. Indyk. Enhanced hypertext categorization using hyperlinks. 1998.Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. S. Das, A. Lavoie, and M. Magdon-Ismail. Manipulation Among the Arbiters of Collective Intelligence. CIKM, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. S. Dori-Hacohen and J. Allan. Detecting controversy on the web. In CIKM, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. S. Dori-Hacohen, E. Yom-Tov, and J. Allan. Navigating Controversy as a Complex Search Task. In Supporting Complex Search Tasks, 2015.Google ScholarGoogle Scholar
  7. A. Fast and D. Jensen. Why stacked models perform effective collective classification. In ICDM, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. R. Jesus, M. Schwartz, and S. Lehmann. Bipartite networks of Wikipedia's articles and authors - a meso-level approach. WikiSym, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. A. Kittur, B. Suh, B. A. Pendleton, E. H. Chi, L. Angeles, and P. Alto. He Says, She Says: Conflict and Coordination in Wikipedia. In CHI, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Z. Kou and W. W. Cohen. Stacked Graphical Models for Efficient Inference in Markov Random Fields. SDM, 2007.Google ScholarGoogle ScholarCross RefCross Ref
  11. E. Pariser. The Filter Bubble: What the Internet is hiding from you. Penguin Press HC, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. H. Sepehri Rad and D. Barbosa. Identifying controversial articles in Wikipedia. WikiSym, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. B.-q. Vuong, E.-p. Lim, A. Sun, M.-T. Le, H. W. Lauw, and K. Chang. On ranking controversies in Wikipedia: models and evaluation. In WSDM, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. T. Yasseri, R. Sumi, A. Rung, A. Kornai, and J. Kertész. Dynamics of conflicts in Wikipedia. PloS one, 2012.Google ScholarGoogle Scholar

Index Terms

  1. Controversy Detection in Wikipedia Using Collective Classification

                      Recommendations

                      Comments

                      Login options

                      Check if you have access through your login credentials or your institution to get full access on this article.

                      Sign in
                      • Published in

                        cover image ACM Conferences
                        SIGIR '16: Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval
                        July 2016
                        1296 pages
                        ISBN:9781450340694
                        DOI:10.1145/2911451

                        Copyright © 2016 ACM

                        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

                        Publisher

                        Association for Computing Machinery

                        New York, NY, United States

                        Publication History

                        • Published: 7 July 2016

                        Permissions

                        Request permissions about this article.

                        Request Permissions

                        Check for updates

                        Qualifiers

                        • short-paper

                        Acceptance Rates

                        SIGIR '16 Paper Acceptance Rate62of341submissions,18%Overall Acceptance Rate792of3,983submissions,20%

                      PDF Format

                      View or Download as a PDF file.

                      PDF

                      eReader

                      View online with eReader.

                      eReader