A taxon identifier is composed of name, author, year and attribute, all separated by a blank. These are all extracted from the original publication.
The name is reproduced exactly as proposed in the original publication. The name of a genus is made up of one word and species made up of two words (genus and species) separated by a blank.
The author's name is made up of a string of letters, with no blanks, and multiple authors' names are separated by a comma. Spelling of author's name is based on the original publication. If there are more than three authors, only the names of the first two authors are shown, followed by ",+" and the number of omitted authors.
Year is the year of publication.
Attribute is enclosed in square brackets. This is rarely needed, but to differentiate homo-identifiers, this will contain the page, line or plate number of original publication.
All diacritic marks, hyphens, and apostrophes are eliminated, thus only the following characters are used: A to Z, a to z, 0 to 9, blank, comma, and opening and closing square brackets. Although upper and lower cases are used for the convenience of human recognition, it is not case sensitive.