Wikidata:WikidataCon 2019/Program/Sessions/Wikidata knowledge base completion using multilingual Wikipedia fact extraction

From Wikidata
Jump to navigation Jump to search
WikidataCon logo ID : SUB-120 Wikidata knowledge base completion using multilingual Wikipedia fact extraction
Speaker(s): Anders Sandholm, Michael Ringgaard Timeblock: tb-saturday Start: 15:00 Slides:
Wikidata fact annotation for multilingual Wikipedia
Room: Kleist Duration: 25min
Abstract:

In this session we’ll talk about the SLING project at Google. The aim of the project is to learn to read and understand Wikipedia articles in many languages in terms existing knowledge, i.e., specific entities and properties in Wikidata. A key part of the project is that we use the same representation for both knowledge and document annotation, namely frame semantics. The Sling parser can be trained to produce frame semantic representations of text directly without any explicit intervening linguistic representation.

The project is a work in progress and we have built a number of the components needed, like the SLING frame store (for building and manipulating frame semantic graph structures) and the Wiki flow pipeline which can take a raw dump of Wikidata and convert this into one big frame graph loadable into memory for fast graph traversal. The SLING Python API provides easy access to all this information.

Type: Presentation
Keywords: multilingual
Notes: #WikidataCon2019_SUB-120
Streaming: https://streaming.media.ccc.de/wikidatacon2019/kleist
Video: https://media.ccc.de/v/wikidatacon2019-1120-wikidata_knowledge_base_completion_using_multilingual_wikipedia_fact_extraction#t=242
People planning to attend:
  1. Yupik (talk) 03:07, 29 September 2019 (UTC)
  2. ...
Next session in this room: Wikicite panel