Documentation

Face markup

Our schema supports minimal face markup in order to avoid ambiguity in certain disciplines, such as genetics, where the same text may be a gene (when italicized) or a protein (when not italicized).

Face markup that appears in the title, subtitle, original_language_title, and unstructured_citation elements should be retained when depositing metadata. Face markup in other elements (such as small caps in author names) must be dropped. Face markup support includes bold (b), italic (i), underline (u), over-line (ovl), superscript (sup), subscript (sub), small caps (scp), and typewriter text (tt).

Examples where inclusion of face markup is especially important include:

  1. Italics in titles for terms such as species names or genes
  2. Superscript and subscript in titles as part of chemical names (for example, H20)
  3. Superscript and subscript in simple inline mathematics (for example, x2 + y2 = z2)

The schema supports nested face markup (for example: This text is bold and italic), which would be tagged as:

This text is <b><i>bold and italic</i></b>

Correspondingly, superscript and subscript may be nested for correct representation of xyz. This expression should be tagged as:

x<sup>y<sup>z</sup></sup>

We also support MathML markup in title elements.

Page owner: Patricia Feeney   |   Last updated 2020-April-08