Difference between revisions of "User:Jokipii"
From MusicBrainz Wiki
Jump to navigationJump to searchm (bot queue update) |
|||
Line 4: | Line 4: | ||
== Currently working with == |
== Currently working with == |
||
I have both MusicBrainz and Discogs databases installed on PostgreSQL. I am currently trying to improve linking between those. |
I have both MusicBrainz and Discogs databases installed on PostgreSQL. I am currently trying to improve linking between those. |
||
+ | Bot code can be found at [https://github.com/Jokipii/musicbrainz-bot musicbrain-bot] and code that produces Discogs database from monthly XML dumps found at [https://github.com/Jokipii/discogs-xml2db discogs-xml2db]. |
||
== Userscripts == |
== Userscripts == |
Revision as of 15:24, 6 January 2012
Who am I?
MusicBrainz editor Jokipii and operator of Jokipii_bot.
Currently working with
I have both MusicBrainz and Discogs databases installed on PostgreSQL. I am currently trying to improve linking between those. Bot code can be found at musicbrain-bot and code that produces Discogs database from monthly XML dumps found at discogs-xml2db.
Userscripts
Here is userscript that makes voting for Discogs links easier.
Bot queue
- Artist Discogs links
- Exact name match. Have release(s) with Discogs links. All releases found that way point on same artist at Discogs.
- 2800
- Exact name match. Have release(s) with Discogs links. All releases found that way point on same artist at Discogs.
Possible future sets
Set descriptions and number of links
- Artist (type:group) name with exact (case insensitive) match, have members with Discogs links, all members found that way have been also market as members in Discogs entry.
- 2083
- Artist name with exact (case insensitive) match, is member of groups with Discogs links, all groups found that way have same Discogs artist as member.
- 8692
- Release links identified by exact match on catalog number, release name and linked label. See example
- 32704
- Artist that have Discogs link, and not have type(person/group) set, and have multiple members in Discogs (indicating type:group)
- 1309
- Artist that have Discogs link, and not have type(person/group) set, and have Discogs realname without characters "&,/+" and word "and" (indicating type:person)
- 1699
Bot programming tasks
- Merge bot code to musicbrainz-bot
- Start using discogs-xml2db to produce Discogs database
Some stats
MusicBrainz Total | Discogs Total | Links (all these are not unique) | Percent done (compared to smaller total) | Sum of unique MusicBrainz releases connected to linked entities | Percent of all MusicBrainz releases | Sum of unique Discogs releases connected to linked entities | Percent of all Discogs releases | ||
---|---|---|---|---|---|---|---|---|---|
Releases: | 988301 | 2720810 | 171170 | (17%) | |||||
Release groups: | 822442 | 365081 | 47387 | (13%) | 106445 | (11%) | 277207 | (10%) | |
Artist: | 626598 | 2100250 | 110825 | (18%) | 606737 | (61%) | 1738474 | (64%) | |
Label: | 55844 | 245988 | 16004 | (29%) | 414436 | (42%) see note | 1542803 | (57%) |
note: In MB only 567392 releases have label information, and 420909 don't have.