Who am I?
MusicBrainz editor Jokipii and operator of Jokipii_bot. I have both MusicBrainz and Discogs databases installed on PostgreSQL. I am currently trying to improve linking between those. Bot code can be found at musicbrainz-bot and code that produces Discogs database from monthly XML dumps found at discogs-xml2db.
Here is userscript that makes voting for Discogs links easier.
Set descriptions and number of links
* Release links identified by exact match on catalog number, release name, linked label, format, same number of tracks and same release country. ** 18000
* Artist Discogs links ** Exact name match. One or more already linked various artist release(s) where artist have track(s). All track(s) found that way point on same artist at Discogs. ** Example ** 21546 * Advanced relationships between releases and artists where both are already linked to discogs ** Producer Hand made example 74164 ** Mastered 27970 ** and certainly lots also in other relationship classes * Artist name with exact (case insensitive) match, is member of groups with Discogs links, all groups found that way have same Discogs artist as member. ** 8692 * Artist (type:group) name with exact (case insensitive) match, have members with Discogs links, all members found that way have been also market as members in Discogs entry. ** 2083 * Artist that have Discogs link, and not have type(person/group) set, and have multiple members in Discogs (indicating type:group) ** 1309 * Artist that have Discogs link, and not have type(person/group) set, and have Discogs realname without characters "&,/+" and word "and" (indicating type:person) ** 1699
* Artist Discogs links ** Exact name match. Have release(s) with Discogs links. All releases found that way point on same artist at Discogs. * Artist types based on disambiguation comment
Bot programming tasks
- Merge bot code to musicbrainz-bot Done
- Start using discogs-xml2db to produce Discogs database Done
- Better documentation
- Map Discogs credits <-> MB Advanced relationships
|MusicBrainz Total||Discogs Total||Links (all these are not unique)||Percent done (compared to smaller total)||Sum of unique MusicBrainz releases connected to linked entities||Percent of all MusicBrainz releases||Sum of unique Discogs releases connected to linked entities||Percent of all Discogs releases|
|Label:||55844||245988||16004||29%||414436||42% see note||1542803||57%|
note: In MB only 567392 releases have label information, and 420909 don't have.
|2012-02-23||MusicBrainz Total||Discogs Total||Links (all these are not unique)||Percent done (compared to smaller total)|
|2012-03-09||MusicBrainz Total||Discogs Total||Links (all these are not unique)||Percent done (compared to smaller total)|