User talk:Lydia Pintscher (WMDE)/Archive/3

From Wikidata
Jump to navigation Jump to search

09:37, 21 October 2013 (UTC)

10:06, 28 October 2013 (UTC)

Structured linguistic / lexicographic data

Hello Lydia,
Could you tell me where the Wikidata team is about the structured-data for Wiktionary ? I'am very interested by it. Yug (talk) 00:00, 4 November 2013 (UTC)

The comparison Saskia published is the current status. We've not done more since then. Implementation is also still unfortunately quite far away. Lot's of other things on the list still like queries and structured data support for Commons :( --Lydia Pintscher (WMDE) (talk) 15:52, 4 November 2013 (UTC)

Sidenote : Edouard (brother) & me were proud and handsome while wearing your Wikidata T-shirts across the Mozilla Festival 2013. People don't know enough about Wikidata.org, but they do more now! Yug (talk) 00:00, 4 November 2013 (UTC)

Awesome! :D I hope you had fun at the event. --Lydia Pintscher (WMDE) (talk) 15:52, 4 November 2013 (UTC)

10:55, 4 November 2013 (UTC)

13:28, 11 November 2013 (UTC)

09:05, 18 November 2013 (UTC)

07:03, 25 November 2013 (UTC)

Data import from external databases

Hi Lydia, I try to start data import from external databases for chemicals (see Wikidata:WikiProject_Chemistry/ChemID). We need first to prepare the list of chemicals available in Wikidata and to identify them clearly (wikipedias manage chemicals in different ways). But the question of licences has to be solved before any data import. Some datbases offer their data to download and use CC licences, others just put some disclaimer and we have to clear if thses disclaimers can be covered by the Wikidata licence (CC BY-SA if I'm right) and for some databases we will need to get the agreement form the databases managers to cover our import. I stated the discussion about licences in the talk page of Wikidata:WikiProject_Chemistry/ChemID and we have to see what kind of information we want to extract: at least the IDs of each database but why not other data stored in these databases.

So a lot of work of preparation has to be done first but my idea is to involve you and your team when asking some agreements in order to have an institution-institution agreement which ensure a correct use of the agreement in the present and the future. If you have some persons in your team with law skills in general and licences skills in particular perhaps this person can have a look at the talk page to give advice: this can help to define for which database do we need an agreement. Thanks

Just as information: en:WP already organised some data imports to the en:WP based on open data (see w:en:Wikipedia:WikiProject_Chemicals/Chembox_validation): they had some problems with the CAS registry databse but they found an agreement with American Chemistry Society which results to the creation of a free and open database of some CAS numbers. Snipre (talk) 10:49, 8 December 2013 (UTC)

Hey Snipre! I'm happy to help. The best start is a document the lawyers at the Foundation have written: m:Wikilegal/Database Rights. Let me know if you have additional questions please. As for the license of the data in Wikidata: that is CC-0. Lydia Pintscher (WMDE) (talk) 10:55, 8 December 2013 (UTC)
Thanks for the information. Do you know why Wikidata is under CC-0 licence ? And why when we use the editing interface, above the button Save, we have a small line saying that by clicking this button we release our contribution under the CC BY-SA licence ? Snipre (talk) 16:18, 12 December 2013 (UTC)
The data in Wikidata is CC-0. The rest is under CC-BY-SA. The data is under CC-0 because anything else would be a huge pain when trying to re-use it anywhere and is legally unclear for data anyway. --Lydia Pintscher (WMDE) (talk) 16:23, 12 December 2013 (UTC)
Ok, so last question: what do we need from databases managers to be allowed to extract using bots some data ?
Sorry to ask that again and again but as nothing is really clear from legal point of view I want to know 1) if there are some possibilities to organize some data extraction with the agreement of databases managers, 2) how do we get an agreement which cover wikidata from any problem. Just a small thing to be accurate: in those databases some data are coming from other databases/book/research articles/...: as first step we don't want to import thses data. Only the internal identifier of each database meaning that data don't need the agreement of a third party.
To be clear can you ask to your legal representant which authorization and in which terms we need to extract and import in Wikidata the following data:
  • PubChem compound identifier from PubChem. PubChem licence, see here
  • chEBI identifier from chEBI. ChEBI licence, see here
  • chembl from chembl. Chembl licence, see here.
Thanks for your help. Snipre (talk) 19:10, 12 December 2013 (UTC)
Last detail: the import will be made with the data source based on help:Sources policy. Snipre (talk) 19:13, 12 December 2013 (UTC)
I am not a lawyer and can't give you legal advice obviously. This is a pretty grey area and you'll probably not be happy following every possible restriction. That being said the closest you can get to legal advice is the page linked above from the legal team at the Foundation. --Lydia Pintscher (WMDE) (talk) 19:59, 12 December 2013 (UTC)
I don't expect from you any answer but I thought you had some persons with legal skills in your team Wikimedia Deutschland who can give some hints about this questions. You are right I'm not happy because if I recognized that the law is not clear I don't understand why the above mentionned text from the Foundation doesn't give any way to go out of this grey area. Snipre (talk) 20:11, 12 December 2013 (UTC)
Snipre. I thought the page on Meta was pretty clear - Don't import from European databases. It was prepared by WMF_Legal in response to a query from wikidata users. If you have problems with it, you should raise them on the Talk page of that Meta page. Filceolaire (talk) 00:19, 14 December 2013 (UTC)

08:38, 9 December 2013 (UTC)

08:24, 16 December 2013 (UTC)

08:22, 23 December 2013 (UTC)

08:40, 30 December 2013 (UTC)

08:34, 6 January 2014 (UTC)

09:32, 13 January 2014 (UTC)

new job?

Dear Lydia,

I see you're sending the news letters. Grazie mille :-) Keep up the great work for WikiData.
Freundliche Grüsse aus Valdambra Italia von  Klaas|Z4␟V14:10, 19 January 2014 (UTC)

10:21, 20 January 2014 (UTC)