User:Jokipii
From MusicBrainz Wiki
Jump to navigationJump to searchWho am I?
MusicBrainz editor Jokipii and operator of Jokipii_bot. I have both MusicBrainz and Discogs databases installed on PostgreSQL. I am currently trying to improve linking between those. Bot code can be found at musicbrainz-bot and code that produces Discogs database from monthly XML dumps found at discogs-xml2db.
Userscripts
Here is userscript that makes voting for Discogs links easier.
Bot queue
Set descriptions and number of links
In Progress
* Release links identified by exact match on catalog number, release name, linked label, format, same number of tracks and same release country. ** 18000
Not Started
* Artist Discogs links ** Exact name match. One or more already linked various artist release(s) where artist have track(s). All track(s) found that way point on same artist at Discogs. ** Example ** 21546 * Advanced relationships between releases and artists where both are already linked to discogs ** Producer Hand made example 74164 ** Mastered 27970 ** and certainly lots also in other relationship classes * Artist name with exact (case insensitive) match, is member of groups with Discogs links, all groups found that way have same Discogs artist as member. ** 8692 * Artist (type:group) name with exact (case insensitive) match, have members with Discogs links, all members found that way have been also market as members in Discogs entry. ** 2083 * Artist that have Discogs link, and not have type(person/group) set, and have multiple members in Discogs (indicating type:group) ** 1309 * Artist that have Discogs link, and not have type(person/group) set, and have Discogs realname without characters "&,/+" and word "and" (indicating type:person) ** 1699
Done
* Artist Discogs links ** Exact name match. Have release(s) with Discogs links. All releases found that way point on same artist at Discogs. * Artist types based on disambiguation comment
Bot programming tasks
- Merge bot code to musicbrainz-bot Done
- Start using discogs-xml2db to produce Discogs database Done
- Better documentation
- Map Discogs credits <-> MB Advanced relationships
Some stats
MusicBrainz Total | Discogs Total | Links (all these are not unique) | Percent done (compared to smaller total) | Sum of unique MusicBrainz releases connected to linked entities | Percent of all MusicBrainz releases | Sum of unique Discogs releases connected to linked entities | Percent of all Discogs releases | |
---|---|---|---|---|---|---|---|---|
Releases: | 988301 | 2720810 | 171170 | 17% | ||||
Release groups: | 822442 | 365081 | 47387 | 13% | 106445 | 11% | 277207 | 10% |
Artist: | 626598 | 2100250 | 110825 | 18% | 606737 | 61% | 1738474 | 64% |
Label: | 55844 | 245988 | 16004 | 29% | 414436 | 42% see note | 1542803 | 57% |
note: In MB only 567392 releases have label information, and 420909 don't have.
2012-02-23 | MusicBrainz Total | Discogs Total | Links (all these are not unique) | Percent done (compared to smaller total) |
---|---|---|---|---|
Releases: | 1008061 | 2926422 | 182581 | 18% |
Release Groups: | 839314 | 405891 | 73656 | 18% |
Artists: | 644784 | 2251519 | 126210 | 20% |
Labels: | 58038 | 300452 | 17141 | 30% |
2012-03-09 | MusicBrainz Total | Discogs Total | Links (all these are not unique) | Percent done (compared to smaller total) |
---|---|---|---|---|
Releases: | 1012072 | 2926422 | 193183 | 19% |
Release Groups: | 842794 | 405891 | 76053 | 19% |
Artists: | 647860 | 2251526 | 128111 | 20% |
Labels: | 58432 | 300452 | 17224 | 29% |