User:CallerNo6/sandbox: Difference between revisions

From MusicBrainz Wiki
Jump to navigationJump to search
No edit summary
No edit summary
Line 13: Line 13:
(Please feel free to leave comments in this section)
(Please feel free to leave comments in this section)
# "''Jr.''", "''Sr.''", etc, were being sorted wrong. Example: if one follows the current official guideline, Hank Williams would not sort adjacent to Hank Williams, Jr.
# "''Jr.''", "''Sr.''", etc, were being sorted wrong. Example: if one follows the current official guideline, Hank Williams would not sort adjacent to Hank Williams, Jr.
# Refrences to "first" and "last" names changed to "given" and "family", respectively
# References to "first" and "last" names changed to "given" and "family", respectively
# Some restructuring/reformatting done to (I hope) improve readability
# Some restructuring/reformatting done to (I hope) improve readability
# Some redundant examples removed. Also, specifically, ''Bob Marley & The Wailers'' removed since that may become an NGS collaboration (and in any case the MB entry didn't follow the guideline :P )
# Some redundant examples removed. Also, specifically, ''Bob Marley & The Wailers'' removed since that may become an NGS collaboration (and in any case the MB entry didn't follow the guideline :P )
# Guideline for "Leading Articles" (e.g. "The) simplified. The effect is the same. Only move the "The" if it is the first word in a name and will significantly affect the sort-order.
# Guideline for "Leading Articles" (e.g. "The") simplified. The effect is the same. Only move the "The" if it is the first word in a name and will significantly affect the sort-order.
# Guidelines for "stylistic ligatures" and "symbols" condensed. As it says in the "Intent" section, this is not an attempt to have a separate "rule" to cover ''every possible'' possibility.

# Some example sort-names for the existing guideline no longer (or never did) comply with the guideline as it is currently written.
# Relaxed the use of commas somewhat. Do I have that right? Do commas affect sort algorithms?
# If this passes, I'll make a discussion section for "corner cases".


==Intent==
==Intent==
Sort names make it possible for external applications to create logical, alphabetical indexes. For the best results, have an understanding of the guidelines' intent, but also understand that there will be times when an editor should use his/her ''best judgment''™. The intent of this guideline is ''not'' to anticipate every possible exception to every "rule". Unusual "corner cases" can be found on the [[{{TALKPAGENAME}}|discussion]] page.
Sort names make it possible for external applications to create logical, alphabetical indexes. For the best results, have an understanding of the guidelines listed below, but also understand that there will be cases in which an editor should override these guidelines using his/her ''best judgment''™ if a more logical sort-order would result. The intent of this guideline is ''not'' to anticipate every possible exception and "corner case". Unusual cases can be discussed on the [[{{TALKPAGENAME}}|discussion]] page.
<br />


==Sorting Orders==
===Personal names:===
{|border="1" style="border-collapse:collapse;" cellpadding="10"
|Family Name
|Given Name
|''Jr.'' , ''Sr.'' , ''III'' etc.
|Prefix / Suffix
|}
=====Notes=====
:#Prefixes and suffixes include, for example, "Sir", "Dame", "DJ" and "Esq."
:#Treat middle names and nicknames as part of the "given name"
:#[http://en.wikipedia.org/wiki/Family_name Family names] take different forms in different cultures. The [[{{TALKPAGENAME}}|discussion]] page has some language-specific suggestions. When in doubt, consult an [[User_Language | editor familiar with the language and culture in question]].
=====Example=====
:*Hank Williams, Jr. has the sort name "''Williams, Hank, Jr.''"
:*[[Artist:defba3df-b0f3-4ef1-8e1f-4ef88cc8fc42 | Johnny "Guitar" Watson]] has the sort name "''Watson, Johnny "Guitar"''"
:* [[Artist:ebd7d505-9eb8-4c74-9dd4-ebe308e04d72|DJ Shah]] has sort name "Shah, DJ"


==Sort-order Guidelines==

{|border="1" style="border-collapse:collapse; font-size:smaller;" cellpadding="10"
===Band names:===
|'''Family Name'''
{|border="1" style="border-collapse:collapse;" cellpadding="10"
|'''Given Name''' (plus "nicknames", "middle names", Jr., Sr. etc)
|Band Name
|'''Prefix / Suffix'''
|''Leading'' Article (e.g. "An" or "The")
|'''Connector''' (e.g. "&")
|'''Band Name'''
|''Leading'' '''Article''' (e.g. "The")
|}
|}
<br />
=====Note=====
#"Given names" include "middle names", "nicknames" and (as the last element) ''Jr.'', ''Sr.'', ''III''
:* If the '''first''' word in a band's name is a common article (no matter the language), then move that article to the last position in the sort-order.
#Prefixes and Suffixes include, for example, "Sir", "Dame", "DJ" and "Esq."
=====Examples=====
#A "leading article" is an article (e.g. "the") that would otherwise have been the '''first''' word in the sort-order. Other articles need not be moved.
:*The Beatles have sort name "''Beatles, The''"
<br />
:*Los Lobos have sort name "''Lobos, Los''"


==Typographical Guidelines==
# Separate the "parts" of a sort name with commas where doing so makes the sort name more "human readable". Commas will not affect the sort order.
# Regardless of the script used for the [[Artist_Name|Artist Name]], use Latin characters (including diacritics and numerals) for the Sort Name, transliterating if necessary.
# Where special characters or symbols are used "stylistically" in an artist's name to represent common Latin characters, they should be converted to those common characters (if doing so allows an artist's name to be more logically collated).
<br />


==Examples==
===Personal names combined with a band name:===
{|border="1" style="border-collapse:collapse; font-size:smaller;" cellpadding="10"
Generally, this will indicate an artist's "supporting band". It does not include collaborations.
|'''[[Artist_Name|Artist Name]]'''
{|border="1" style="border-collapse:collapse;" cellpadding="10"
|Family Name
|'''Family Name'''
|Given Name
|'''Given Name'''
|'''Prefix / Suffix'''
|''Jr.'' , ''Sr.'' , ''III'' etc.
|'''Connector'''
|Prefix / Suffix
|'''Band Name'''
|Connector (e.g. "&")
|'''Article'''
|Band Name
|'''[[Sortname|Sort Name]]'''
|-
|[[Artist:1d8010a0-b642-4c8f-9c8d-8dc66b7bb81a|Hank Williams, Jr.]]
|Williams
|Hank Jr.
|
|
|
|
|Williams, Hank, Jr.
|-
|[[Artist:defba3df-b0f3-4ef1-8e1f-4ef88cc8fc42|Johnny "Guitar" Watson]]
|Watson
|Johnny "Guitar"
|
|
|
|
|Watson, Johnny "Guitar"
|-
|[[Artist:ebd7d505-9eb8-4c74-9dd4-ebe308e04d72|DJ Shah]]
|Shah
|
|DJ
|
|
|
|Shah, DJ
|-
|[[Artist:b10bbbfc-cf9e-42e0-be17-e2c3e1d2600d|The Beatles]]
|
|
|
|
|Beatles
|The
|Beatles, The
|-
|[[Artist:2b9967c7-3246-4658-a561-ded5408dd9af|Los Lobos]]
|
|
|
|
|Lobos
|Los
|Lobos, Los
|-
|[[Artist:4458d70d-c215-4f06-beea-ebd448dad6ac|Bill Haley & His Comets]]
|Haley
|Bill
|
|&
|His Comets
|
|Haley, Bill & His Comets
|-
|[[Artist:b3120863-d98d-4bad-a637-8abd8cde6685|Hootie & The Blowfish]]
|
|
|
|
|Hootie & The Blowfish
|
|Hootie & The Blowfish
|-
|[[Artist:33b3c323-77c2-417c-a5b4-af7e6a111cc9|The Jimi Hendrix Experience]]
|Hendrix
|Jimi
|
|
|The Experience
|
|Hendrix, Jimi, The Experience
|-
|[[Artist:5565ae4b-bb20-48a4-a8b0-dfdeb53b555a|布袋寅泰‎]]
|Hotei
|Tomoyasu
|
|
|
|
|Hotei, Tomoyasu
|-
|[[Artist:7c259e8a-f67b-45ff-b0f5-6b63d3a692cf|My$t:c DJz]]
|
|
|
|
|My$t:c DJz
|
|Mystic DJz
|-
|[[Artist:d867d382-35fc-49fb-a45a-64b66c0b8557|♪◆m599XGSMF6]]
|
|
|
|
|♪◆m599XGSMF6
|
|♪◆m599XGSMF6
|-
|[[Artist:f12647e5-d3aa-4dd4-b83b-c45eea712b60|René Löwe]]
|Löwe
|René
|
|
|
|
|Löwe, René
|-
|[[Artist:d0a0b3a2-ea58-4a7a-b18c-849d589bfac7|Spın̈al Tap]]
|
|
|
|
|Spın̈al Tap
|
|Spinal Tap
|}
|}
=====Examples=====
:* [[Artist:40d8f506-0d62-49bd-9bae-8c52b72c9ee1|Gloria Estefan and the Miami Sound Machine]] has sort name "Estefan, Gloria and the Miami Sound Machine"
:* [[Artist:abe2669a-a612-4bf6-9193-bb4f4b8a9088|Ben Folds Five]] has sort name "Folds, Ben, Five"
:* [[Artist:4458d70d-c215-4f06-beea-ebd448dad6ac|Bill Haley & His Comets]] has sort name "Haley, Bill & His Comets"
:* [[Artist:33b3c323-77c2-417c-a5b4-af7e6a111cc9|The Jimi Hendrix Experience]] has sort name "Hendrix, Jimi, The, Experience"




==Typographical Guidelines==
==Language-Specific Guidelines==
No language-specific exceptions are currently official. Please reference the discussion page for proposed language-specific exceptions to this guideline.
:* Seperate the "parts" of a sort name with commas where doing so makes the sort name more "human readable". Commas will not affect the sort order.
:* Use Latin characters (including diacritics) regardless of the script used for the Artist Name
:**[[Artist:9ddd7abc-9e1b-471d-8031-583bc6bc8be9|Пётр Ильич Чайковский]] "Tchaikovsky, Pyotr Ilyich"
:* Avoid purely "stylistic" characters in favor of the common Latin characters they represent (where doing so allows an artist's name to be more logically colllated)
:** Convert symbols representing Latin characters to those characters
:** Convert "stylistic" ligatures into their constituent characters (i.e. ligatures that have no "semantic" role, but are purely "presentational")
:** Use half-width Latin characters, not full-width
=====Examples=====
:* [[Artist:7c259e8a-f67b-45ff-b0f5-6b63d3a692cf|My$t:c DJz]] has sort name "Mystic DJz"
:* [[Artist:902286c2-e7d1-4ee8-adb6-216d6a71a05c|trance[]control]] has sort name "trancecontrol"
:* [[Artist:595198f1-3558-4a57-be64-f0f5f5de7254|(´・д・)ノ]] has sort name "(´・д・)ノ"
:* [[Artist:d867d382-35fc-49fb-a45a-64b66c0b8557|♪◆m599XGSMF6]] has sort name "♪◆m599XGSMF6"


Note: [http://en.wikipedia.org/wiki/Family_name Family names] take different forms in different cultures. When in doubt, consult an [[User_Language | editor familiar with the language and culture]] in question.

==Language specific rules==
No language-specific exceptions are currently official. Please reference the discussion page for proposed language-specific exceptions to this guideline.

Revision as of 22:18, 25 March 2011


Status: This page describes an active advanced relationship proposal and is not official.



Proposal number: RFC-xxx
Champion: caller#6
Current status: Discussion



Changes (this section to be removed from final draft)

(Please feel free to leave comments in this section)

  1. "Jr.", "Sr.", etc, were being sorted wrong. Example: if one follows the current official guideline, Hank Williams would not sort adjacent to Hank Williams, Jr.
  2. References to "first" and "last" names changed to "given" and "family", respectively
  3. Some restructuring/reformatting done to (I hope) improve readability
  4. Some redundant examples removed. Also, specifically, Bob Marley & The Wailers removed since that may become an NGS collaboration (and in any case the MB entry didn't follow the guideline :P )
  5. Guideline for "Leading Articles" (e.g. "The") simplified. The effect is the same. Only move the "The" if it is the first word in a name and will significantly affect the sort-order.
  6. Guidelines for "stylistic ligatures" and "symbols" condensed. As it says in the "Intent" section, this is not an attempt to have a separate "rule" to cover every possible possibility.
  7. Some example sort-names for the existing guideline no longer (or never did) comply with the guideline as it is currently written.
  8. Relaxed the use of commas somewhat. Do I have that right? Do commas affect sort algorithms?
  9. If this passes, I'll make a discussion section for "corner cases".

Intent

Sort names make it possible for external applications to create logical, alphabetical indexes. For the best results, have an understanding of the guidelines listed below, but also understand that there will be cases in which an editor should override these guidelines using his/her best judgment™ if a more logical sort-order would result. The intent of this guideline is not to anticipate every possible exception and "corner case". Unusual cases can be discussed on the discussion page.


Sort-order Guidelines

Family Name Given Name (plus "nicknames", "middle names", Jr., Sr. etc) Prefix / Suffix Connector (e.g. "&") Band Name Leading Article (e.g. "The")


  1. "Given names" include "middle names", "nicknames" and (as the last element) Jr., Sr., III
  2. Prefixes and Suffixes include, for example, "Sir", "Dame", "DJ" and "Esq."
  3. A "leading article" is an article (e.g. "the") that would otherwise have been the first word in the sort-order. Other articles need not be moved.


Typographical Guidelines

  1. Separate the "parts" of a sort name with commas where doing so makes the sort name more "human readable". Commas will not affect the sort order.
  2. Regardless of the script used for the Artist Name, use Latin characters (including diacritics and numerals) for the Sort Name, transliterating if necessary.
  3. Where special characters or symbols are used "stylistically" in an artist's name to represent common Latin characters, they should be converted to those common characters (if doing so allows an artist's name to be more logically collated).


Examples

Artist Name Family Name Given Name Prefix / Suffix Connector Band Name Article Sort Name
Hank Williams, Jr. Williams Hank Jr. Williams, Hank, Jr.
Johnny "Guitar" Watson Watson Johnny "Guitar" Watson, Johnny "Guitar"
DJ Shah Shah DJ Shah, DJ
The Beatles Beatles The Beatles, The
Los Lobos Lobos Los Lobos, Los
Bill Haley & His Comets Haley Bill & His Comets Haley, Bill & His Comets
Hootie & The Blowfish Hootie & The Blowfish Hootie & The Blowfish
The Jimi Hendrix Experience Hendrix Jimi The Experience Hendrix, Jimi, The Experience
布袋寅泰‎ Hotei Tomoyasu Hotei, Tomoyasu
My$t:c DJz My$t:c DJz Mystic DJz
♪◆m599XGSMF6 ♪◆m599XGSMF6 ♪◆m599XGSMF6
René Löwe Löwe René Löwe, René
Spın̈al Tap Spın̈al Tap Spinal Tap


Language-Specific Guidelines

No language-specific exceptions are currently official. Please reference the discussion page for proposed language-specific exceptions to this guideline.

Note: Family names take different forms in different cultures. When in doubt, consult an editor familiar with the language and culture in question.