Completeness and Reliability of Wikipedia Infoboxes in Various Languages

Włodzimierz Lewoniewski

Abstract

Despite its popularity, Wikipedia is often criticized for poor information quality. Currently this online knowledge base consist over 45 million articles in almost 300 various languages. Articles in Wikipedia often includes special tables which present shortly important information about persons, places, products, organizations and other subjects. This table is usually placed in a visible part of the article and Wikipedia community called it “infobox”. These infoboxes contains information in a structured form that allows automatically enrich popular public databases such as DBpedia. Wikipedia users can edit infoboxes in different languages independently. So, quality of information about the same thing may differ between various language versions. This article will examine the completeness and reliability of infoboxes about different topics in seven language versions of Wikipedia: English, German, French, Polish, Russian, Ukrainian and Belarussian. The results of the study can be used for automatic assessing and improving the quality of information in Wikipedia as well as in other public knowledge bases.
Author Włodzimierz Lewoniewski (WIiGE / KIE)
Włodzimierz Lewoniewski,,
- Department of Information Systems
Pages295-305
Publication size in sheets0.5
Book Abramowicz Witold (eds.): Business Information Systems Workshops, Lecture Notes in Business Information Processing, vol. 303, 2017, Springer, ISBN 978-3-319-69022-3, [978-3-319-69023-0], 308 p., DOI:10.1007/978-3-319-69023-0
Keywords in EnglishWikipedia, Infobox quality, Reliability, Completeness, DBpedia
DOIDOI:10.1007/978-3-319-69023-0_25
URL https://link.springer.com/chapter/10.1007/978-3-319-69023-0_25
Languageen angielski
Score (nominal)70
Score sourceconferenceList
ScoreMinisterial score = 70.0, 24-03-2020, ChapterFromConference
Publication indicators WoS Citations = 0
Citation count*3 (2020-09-16)
Cite
Share Share

Get link to the record


* presented citation count is obtained through Internet information analysis and it is close to the number calculated by the Publish or Perish system.
Back
Confirmation
Are you sure?