Difference between revisions of "User:Jokipii"
From MusicBrainz Wiki
Jump to navigationJump to search (new artist discogs links set) |
(possible future bot sets (Advanced relationships between releases and artists where both are already linked to discogs)) |
||
Line 11: | Line 11: | ||
== Bot queue == |
== Bot queue == |
||
* Release links identified by exact match on catalog number, release name, linked label, format, same number of tracks and same release country. |
* Release links identified by exact match on catalog number, release name, linked label, format, same number of tracks and same release country. |
||
− | ** |
+ | ** 24500 |
== Possible future sets == |
== Possible future sets == |
||
Line 19: | Line 19: | ||
** [http://test.musicbrainz.org/edit/16086228 Example] |
** [http://test.musicbrainz.org/edit/16086228 Example] |
||
** 21546 |
** 21546 |
||
+ | * Advanced relationships between releases and artists where both are already linked to discogs |
||
+ | ** Producer [http://musicbrainz.org/edit/16730693 Hand made example] 74164 |
||
+ | ** Mastered 27970 |
||
+ | ** and certainly lots also in other relationship classes |
||
* Artist name with exact (case insensitive) match, is member of groups with Discogs links, all groups found that way have same Discogs artist as member. |
* Artist name with exact (case insensitive) match, is member of groups with Discogs links, all groups found that way have same Discogs artist as member. |
||
** 8692 |
** 8692 |
Revision as of 02:46, 1 March 2012
Who am I?
MusicBrainz editor Jokipii and operator of Jokipii_bot.
Currently working with
I have both MusicBrainz and Discogs databases installed on PostgreSQL. I am currently trying to improve linking between those. Bot code can be found at musicbrainz-bot and code that produces Discogs database from monthly XML dumps found at discogs-xml2db.
Userscripts
Here is userscript that makes voting for Discogs links easier.
Bot queue
- Release links identified by exact match on catalog number, release name, linked label, format, same number of tracks and same release country.
- 24500
Possible future sets
Set descriptions and number of links
- Artist Discogs links
- Exact name match. One or more already linked various artist release(s) where artist have track(s). All track(s) found that way point on same artist at Discogs.
- Example
- 21546
- Advanced relationships between releases and artists where both are already linked to discogs
- Producer Hand made example 74164
- Mastered 27970
- and certainly lots also in other relationship classes
- Artist name with exact (case insensitive) match, is member of groups with Discogs links, all groups found that way have same Discogs artist as member.
- 8692
- Artist (type:group) name with exact (case insensitive) match, have members with Discogs links, all members found that way have been also market as members in Discogs entry.
- 2083
- Artist that have Discogs link, and not have type(person/group) set, and have multiple members in Discogs (indicating type:group)
- 1309
- Artist that have Discogs link, and not have type(person/group) set, and have Discogs realname without characters "&,/+" and word "and" (indicating type:person)
- 1699
Bot tasks done
- Artist Discogs links
- Exact name match. Have release(s) with Discogs links. All releases found that way point on same artist at Discogs.
Bot programming tasks
- Merge bot code to musicbrainz-bot
- Start using discogs-xml2db to produce Discogs database
Some stats
MusicBrainz Total | Discogs Total | Links (all these are not unique) | Percent done (compared to smaller total) | Sum of unique MusicBrainz releases connected to linked entities | Percent of all MusicBrainz releases | Sum of unique Discogs releases connected to linked entities | Percent of all Discogs releases | |
---|---|---|---|---|---|---|---|---|
Releases: | 988301 | 2720810 | 171170 | (17%) | ||||
Release groups: | 822442 | 365081 | 47387 | (13%) | 106445 | (11%) | 277207 | (10%) |
Artist: | 626598 | 2100250 | 110825 | (18%) | 606737 | (61%) | 1738474 | (64%) |
Label: | 55844 | 245988 | 16004 | (29%) | 414436 | (42%) see note | 1542803 | (57%) |
note: In MB only 567392 releases have label information, and 420909 don't have.
Stats 2012-02-23
MusicBrainz Total | Discogs Total | Links (all these are not unique) | Percent done (compared to smaller total) | |
---|---|---|---|---|
Releases: | 1008061 | 2926422 | 182581 | 18% |
Release Groups: | 839314 | 405891 | 73656 | 18% |
Artists: | 644784 | 2251519 | 126210 | 20% |
Labels: | 58038 | 300452 | 17141 | 30% |