User:Jokipii: Difference between revisions
From MusicBrainz Wiki
Jump to navigationJump to search
Reosarevok (talk | contribs) mNo edit summary |
(Bot tasks done) |
||
Line 10: | Line 10: | ||
== Bot queue == |
== Bot queue == |
||
⚫ | |||
⚫ | |||
** 36974 |
|||
⚫ | |||
*** 2800 |
|||
== Possible future sets == |
== Possible future sets == |
||
Line 20: | Line 19: | ||
* Artist name with exact (case insensitive) match, is member of groups with Discogs links, all groups found that way have same Discogs artist as member. |
* Artist name with exact (case insensitive) match, is member of groups with Discogs links, all groups found that way have same Discogs artist as member. |
||
** 8692 |
** 8692 |
||
⚫ | |||
** 32704 |
|||
* Artist that have Discogs link, and not have type(person/group) set, and have multiple members in Discogs (indicating type:group) |
* Artist that have Discogs link, and not have type(person/group) set, and have multiple members in Discogs (indicating type:group) |
||
** 1309 |
** 1309 |
||
* Artist that have Discogs link, and not have type(person/group) set, and have Discogs realname without characters "&,/+" and word "and" (indicating type:person) |
* Artist that have Discogs link, and not have type(person/group) set, and have Discogs realname without characters "&,/+" and word "and" (indicating type:person) |
||
** 1699 |
** 1699 |
||
== Bot tasks done == |
|||
⚫ | |||
⚫ | |||
== Bot programming tasks == |
== Bot programming tasks == |
Revision as of 17:37, 20 January 2012
Who am I?
MusicBrainz editor Jokipii and operator of Jokipii_bot.
Currently working with
I have both MusicBrainz and Discogs databases installed on PostgreSQL. I am currently trying to improve linking between those. Bot code can be found at musicbrainz-bot and code that produces Discogs database from monthly XML dumps found at discogs-xml2db.
Userscripts
Here is userscript that makes voting for Discogs links easier.
Bot queue
- Release links identified by exact match on catalog number, release name, linked label, format and same number of tracks. See example
- 36974
Possible future sets
Set descriptions and number of links
- Artist (type:group) name with exact (case insensitive) match, have members with Discogs links, all members found that way have been also market as members in Discogs entry.
- 2083
- Artist name with exact (case insensitive) match, is member of groups with Discogs links, all groups found that way have same Discogs artist as member.
- 8692
- Artist that have Discogs link, and not have type(person/group) set, and have multiple members in Discogs (indicating type:group)
- 1309
- Artist that have Discogs link, and not have type(person/group) set, and have Discogs realname without characters "&,/+" and word "and" (indicating type:person)
- 1699
Bot tasks done
- Artist Discogs links
- Exact name match. Have release(s) with Discogs links. All releases found that way point on same artist at Discogs.
Bot programming tasks
- Merge bot code to musicbrainz-bot
- Start using discogs-xml2db to produce Discogs database
Some stats
MusicBrainz Total | Discogs Total | Links (all these are not unique) | Percent done (compared to smaller total) | Sum of unique MusicBrainz releases connected to linked entities | Percent of all MusicBrainz releases | Sum of unique Discogs releases connected to linked entities | Percent of all Discogs releases | ||
---|---|---|---|---|---|---|---|---|---|
Releases: | 988301 | 2720810 | 171170 | (17%) | |||||
Release groups: | 822442 | 365081 | 47387 | (13%) | 106445 | (11%) | 277207 | (10%) | |
Artist: | 626598 | 2100250 | 110825 | (18%) | 606737 | (61%) | 1738474 | (64%) | |
Label: | 55844 | 245988 | 16004 | (29%) | 414436 | (42%) see note | 1542803 | (57%) |
note: In MB only 567392 releases have label information, and 420909 don't have.