Difference between revisions of "User:Jokipii"
From MusicBrainz Wiki
Jump to navigationJump to search (Bot tasks done) |
(stats update) |
||
Line 10: | Line 10: | ||
== Bot queue == |
== Bot queue == |
||
− | * Release links identified by exact match on catalog number, release name, linked label, format |
+ | * Release links identified by exact match on catalog number, release name, linked label, format, same number of tracks and same release country. |
− | ** |
+ | ** 27500 |
== Possible future sets == |
== Possible future sets == |
||
Line 50: | Line 50: | ||
| 171170 |
| 171170 |
||
| (17%) |
| (17%) |
||
⚫ | |||
| |
| |
||
| |
| |
||
Line 88: | Line 87: | ||
note: In MB only 567392 releases have label information, and 420909 don't have. |
note: In MB only 567392 releases have label information, and 420909 don't have. |
||
+ | |||
+ | Stats 2012-02-23 |
||
⚫ | |||
+ | ! |
||
+ | ! MusicBrainz Total |
||
+ | ! Discogs Total |
||
+ | ! Links (all these are not unique) |
||
+ | ! Percent done (compared to smaller total) |
||
+ | |- |
||
+ | ! Releases: |
||
+ | | 1008061 |
||
+ | | 2926422 |
||
+ | | 182581 |
||
+ | | 18% |
||
+ | |- |
||
+ | ! Release Groups: |
||
+ | | 839314 |
||
+ | | 405891 |
||
+ | | 73656 |
||
+ | | 18% |
||
+ | |- |
||
+ | ! Artists: |
||
+ | | 644784 |
||
+ | | 2251519 |
||
+ | | 126210 |
||
+ | | 20% |
||
+ | |- |
||
+ | ! Labels: |
||
+ | | 58038 |
||
+ | | 300452 |
||
+ | | 17141 |
||
+ | | 30% |
||
+ | |} |
Revision as of 10:05, 23 February 2012
Who am I?
MusicBrainz editor Jokipii and operator of Jokipii_bot.
Currently working with
I have both MusicBrainz and Discogs databases installed on PostgreSQL. I am currently trying to improve linking between those. Bot code can be found at musicbrainz-bot and code that produces Discogs database from monthly XML dumps found at discogs-xml2db.
Userscripts
Here is userscript that makes voting for Discogs links easier.
Bot queue
- Release links identified by exact match on catalog number, release name, linked label, format, same number of tracks and same release country.
- 27500
Possible future sets
Set descriptions and number of links
- Artist (type:group) name with exact (case insensitive) match, have members with Discogs links, all members found that way have been also market as members in Discogs entry.
- 2083
- Artist name with exact (case insensitive) match, is member of groups with Discogs links, all groups found that way have same Discogs artist as member.
- 8692
- Artist that have Discogs link, and not have type(person/group) set, and have multiple members in Discogs (indicating type:group)
- 1309
- Artist that have Discogs link, and not have type(person/group) set, and have Discogs realname without characters "&,/+" and word "and" (indicating type:person)
- 1699
Bot tasks done
- Artist Discogs links
- Exact name match. Have release(s) with Discogs links. All releases found that way point on same artist at Discogs.
Bot programming tasks
- Merge bot code to musicbrainz-bot
- Start using discogs-xml2db to produce Discogs database
Some stats
MusicBrainz Total | Discogs Total | Links (all these are not unique) | Percent done (compared to smaller total) | Sum of unique MusicBrainz releases connected to linked entities | Percent of all MusicBrainz releases | Sum of unique Discogs releases connected to linked entities | Percent of all Discogs releases | |
---|---|---|---|---|---|---|---|---|
Releases: | 988301 | 2720810 | 171170 | (17%) | ||||
Release groups: | 822442 | 365081 | 47387 | (13%) | 106445 | (11%) | 277207 | (10%) |
Artist: | 626598 | 2100250 | 110825 | (18%) | 606737 | (61%) | 1738474 | (64%) |
Label: | 55844 | 245988 | 16004 | (29%) | 414436 | (42%) see note | 1542803 | (57%) |
note: In MB only 567392 releases have label information, and 420909 don't have.
Stats 2012-02-23
MusicBrainz Total | Discogs Total | Links (all these are not unique) | Percent done (compared to smaller total) | |
---|---|---|---|---|
Releases: | 1008061 | 2926422 | 182581 | 18% |
Release Groups: | 839314 | 405891 | 73656 | 18% |
Artists: | 644784 | 2251519 | 126210 | 20% |
Labels: | 58038 | 300452 | 17141 | 30% |