back

Wikidata Statistics: What, Where, and How?

If you suspend your transcription on amara.org, please add a timestamp below to indicate how far you progressed! This will help others to resume your work!

Please do not press “publish” on amara.org to save your progress, use “save draft” instead. Only press “publish” when you're done with quality control.

Video duration
00:25:26
Language
English
Abstract
-

We will present and discuss various sources and analytical tools to access, analyze, and visualize the Wikidata related statistics. We will introduce the difference between (a) Wikidata statistics and (b) the statistics based on Wikidata re-use across the Wikimedia projects, while illustrating the contexts within each makes more sense.
A special focus will be placed on the Wikidata Concepts Monitor (WDCM) system (and its derivatives) which enable our users and editors to better understand the distribution of content in Wikimedia projects. We will also touch upon the Wikidata Languages Landscape project, a set of dashboards that analyze the representation and use of languages in Wikidata.

Although no formal background in statistics or programming will be necessary to follow the discussion, we will use R in the course of the session to illustrate where, what, and how can be done with our numbers. Finally, we discuss the possibilities to act upon the introduced statistics and indicators to potentially improve the existing and establish new connections among different Wikimedia communities.

Talk ID
wikidatacon2019-1091
Event:
wikidatacon2019
Day
2
Room
Kleist
Start
2:30 p.m.
Duration
00:25:00
Track
None
Type of
Talk
Speaker
Goran S. Milovanović
Talk Slug & media link
wikidatacon2019-1091-wikidata_statistics_what_where_and_how_
100.0% Checking done100.0%
0.0% Syncing done0.0%
0.0% Transcribing done0.0%
0.0% Nothing done yet0.0%
  

Work on this video on Amara!