Modeling Popularity and Reliability of Sources in Multilingual Wikipedia

Włodzimierz Lewoniewski , Krzysztof Węcel , Witold Abramowicz


One of the most important factors impacting quality of content in Wikipedia is presence of reliable sources. By following references, readers can verify facts or find more details about described topic. A Wikipedia article can be edited independently in any of over 300 languages, even by anonymous users, therefore information about the same topic may be inconsistent. This also applies to use of references in different language versions of a particular article, so the same statement can have different sources. In this paper we analyzed over 40 million articles from the 55 most developed language versions of Wikipedia to extract information about over 200 million references and find the most popular and reliable sources. We presented 10 models for the assessment of the popularity and reliability of the sources based on analysis of meta information about the references in Wikipedia articles, page views and authors of the articles. Using DBpedia and Wikidata we automatically identified the alignment of the sources to a specific domain. Additionally, we analyzed the changes of popularity and reliability in time and identified growth leaders in each of the considered months. The results can be used for quality improvements of the content in different languages versions of Wikipedia.
Author Włodzimierz Lewoniewski (WIiGE / KIE)
Włodzimierz Lewoniewski,,
- Department of Information Systems
, Krzysztof Węcel (WIiGE / KIE)
Krzysztof Węcel,,
- Department of Information Systems
, Witold Abramowicz (WIiGE / KIE)
Witold Abramowicz,,
- Department of Information Systems
Journal seriesInformation (Switzerland), [Information (Switzerland)], ISSN 2078-2489, (N/A 40 pkt)
Issue year2020
Publication size in sheets1.8
Keywords in PolishWikipedia, odnośnik, źródło, wiarygodność, popularność, Wikidane; DBpedia, jakość danych
Keywords in EnglishWikipedia, reference, source, reliability, popularity, Wikidata, DBpedia, data quality
ASJC Classification1710 Information Systems
Languageen angielski
Score (nominal)40
Score sourcejournalList
ScoreMinisterial score = 40.0, 12-09-2020, ArticleFromJournal
Publication indicators WoS Citations = 1; Scopus SNIP (Source Normalised Impact per Paper): 2018 = 1.034
Citation count*7 (2020-08-09)
Share Share

Get link to the record

* presented citation count is obtained through Internet information analysis and it is close to the number calculated by the Publish or Perish system.
Are you sure?