GSUB — Glyph Substitution Table
Overview
The Glyph Substitution (GSUB) table provides data for substitution of glyphs for appropriate rendering of scripts, such as cursively-connecting forms in Arabic script, or for advanced typographic effects, such as ligatures.
Many language systems require substitution of alternate glyph forms. For example, in the Arabic script, the glyph shape that depicts a particular character varies according to its position in a word or text string (see Figure 1). In other language systems, glyph substitutes are aesthetic options for the user, such as the use of ligature glyphs in the English language (see Figure 2).
OpenType fonts use character encoding standards, such as the Unicode Standard, that assumes a distinction between characters and glyphs: text is encoded as sequences of characters, and the 'cmap' table provides a mapping from that character to a single default glyph. Multiple characters are not directly mapped to a single glyph, as needed for ligatures; and a single character is not mapped directly to multiple glyphs, as may be needed for some complex-script scenarios. The GSUB table provides a way to describe such substitutions, enabling applications to apply such substitutions during text layout and rendering to achieve desired results.
To access substitute glyphs, GSUB maps from the glyph index or indices defined in a 'cmap' subtable to the glyph index or indices of the substitute glyphs. For example, if a font has three alternative forms of an ampersand glyph, the 'cmap' table associates the ampersand’s character code with only one of these glyphs. In GSUB, the indices of the other ampersand glyphs are then referenced from this one default index.
The text-processing client uses the GSUB data to manage glyph substitution actions. GSUB identifies the glyphs that are input to and output from each glyph substitution action, specifies how and where the client uses glyph substitutes, and regulates the order of glyph substitution operations. Any number of substitutions can be defined for each script or language system represented in a font.
The GSUB table supports seven types of glyph substitutions that are widely used in international typography:
A single substitution replaces a single glyph with another single glyph. This is used, for example, to render positional glyph variants in Arabic and vertical text in East Asia (see Figure 3).
A multiple substitution replaces a single glyph with more than one glyph. This is used to specify actions such as ligature decomposition (see Figure 4).
An alternate substitution identifies functionally equivalent but different looking forms of a glyph. These glyphs are often referred to as aesthetic alternatives. For example, a font might have five different glyphs for the ampersand symbol, but one would have a default glyph index in the 'cmap' table. The client could use the default glyph or substitute any of the four alternatives (see Figure 5).
A ligature substitution replaces several glyph indices with a single glyph index, as when an Arabic ligature glyph replaces a string of separate glyphs (see Figure 6). When a string of glyphs can be replaced with a single ligature glyph, the first glyph is substituted with the ligature. The remaining glyphs in the string are deleted, this does not include those glyphs that are skipped as a result of lookup flags.
Contextual substitution is an extension of the above lookup types, describing glyph substitutions in context — that is, a substitution of one or more glyphs within a certain pattern of glyphs. Each substitution describes one or more input glyph sequences and one or more substitutions to be performed on that sequence. Contextual substitutions can be applied to specific glyph sequences, glyph classes, or sets of glyphs.
Chained contexts substitution extends the capabilities of contextual substitution. As with contextual substitution, actions can be performed on one or more glyphs within a pattern of glyphs—the input sequence. But the actions can be constrained by chained glyph sequence contexts: a backtrack sequence that precedes the input sequence, and a lookahead sequence that follows the input sequence. Three formats allow the backtrack, input and lookahead sequence patterns to be described using specific glyphs, glyph classes, or glyph sets.
Reverse Chaining contextual single substitution allows one glyph to be substituted with another by chaining input glyph to a backtrack and/or lookahead sequence. The difference between this and other lookup types is that processing of input glyph sequence goes from end to start.
The GSUB data formats used to implement the different types of substitution include an eighth type, substitution extension. This provides a format extension mechanism, allowing reference to subtables using 32-bit offsets rather than 16-bit offsets. It does not provide an additional type of substitution action, however.
GSUB table and OpenType Font Variations
OpenType Font Variations allow a single font to support many design variations along one or more axes of design variation. For example, a font with weight and width variations might support weights from thin to black, and widths from ultra-condensed to ultra-expanded. For general information on OpenType Font Variations, see the chapter, OpenType Font Variations Overview.
In a variable font, it may be desirable to have different glyph-substitution actions used for different regions within the font’s variation space. For example, for narrow or heavy instances in which counters become small, it may be desirable to make certain glyph substitutions to use alternate glyphs with certain strokes removed or outlines simplified to allow for larger counters. Such effects can be achieved using a FeatureVariations table within the GSUB table. The FeatureVariations table is described in the chapter, OpenType Layout Common Table Formats. See also the Required Variation Alternates ('rvrn') feature in the OpenType Layout tag registry.
GSUB table organization
The GSUB table begins with a header that defines offsets to a ScriptList, a FeatureList, a LookupList, and an optional FeatureVariations table (see Figure 7):
- The ScriptList identifies all the scripts and language systems in the font that use glyph substitutes.
- The FeatureList defines all the glyph substitution features required to render these scripts and language systems.
- The LookupList contains all the lookup data needed to implement each glyph substitution feature.
- The FeatureVariations table can be used to substitute alternate sets of lookup tables to use for any given feature under specified conditions. This is currently used only in variable fonts.
For a detailed discussion of ScriptLists, FeatureLists, LookupLists, and FeatureVariation tables, see the chapter, OpenType Layout Common Table Formats.
This organization helps text-processing clients to easily locate the features and lookups that apply to a particular script or language system. To access GSUB information, clients should use the following procedure:
- Locate the current script in the GSUB ScriptList table.
- If the language system is known, search the script for the correct LangSys table; otherwise, use the script’s default LangSys table.
- The LangSys table provides index numbers into the GSUB FeatureList table to access a required feature and a number of additional features.
- Inspect the featureTag of each feature, and select the feature tables to apply to an input glyph string.
- If a Feature Variations table is present, evaluate conditions in the Feature Variation table to determine if any of the initially-selected feature tables should be substituted by an alternate feature table.
- Each feature table provides an array of index numbers into the GSUB LookupList table. Assemble all lookups from the set of chosen features, and apply the lookups in the order given in the LookupList table.
For a detailed description of the Feature Variations table and how it is processed, see the FeatureVariations table section in the OpenType Layout Common Table Formats chapter.
Lookup data is defined in Lookup tables, which are defined in the OpenType Layout Common Table Formats chapter. A Lookup table contains one or more Lookup subtables that define the specific conditions, type, and results of a substitution action used to implement a feature. Specific Lookup subtable types are used for glyph substitution actions, and are defined in this chapter. All subtables within a Lookup table must be of the same lookup type, as listed in the following table for the GsubLookupType enumeration:
GsubLookupType enumeration
Value | Type | Description |
---|---|---|
1 | Single | Replace one glyph with one glyph |
2 | Multiple | Replace one glyph with more than one glyph |
3 | Alternate | Replace one glyph with one of many glyphs |
4 | Ligature | Replace multiple glyphs with one glyph |
5 | Contextual substitution | Replace one or more glyphs in context |
6 | Chained contexts substitution | Replace one or more glyphs in chained context |
7 | Substitution extension | Extension mechanism for other substitution types |
8 | Reverse chaining context single | Applied in reverse order, replace single glyph in chained contexts |
Each lookup type has one or more subtable formats. The “best” format depends on the type of substitution and the resulting storage efficiency. When glyph information is best presented in more than one format, a single lookup may define more than one subtable, as long as all the subtables are for the same lookup type. For example, within a given lookup, a glyph index array format could best represent one set of target glyphs, whereas a glyph index range format could be better for another set.
A series of substitution operations on the same glyph or string requires multiple lookups, one for each separate action. Each lookup has a different array index in the LookupList table and is applied in the LookupList order. The substitution action of each lookup is applied to the results of previous lookups. Some substitution lookups could “feed” later lookups by producing a glyph sequence that matches the input sequence pattern of a later lookup that would not have matched the original glyph sequence. The opposite is also possible: one substitution lookup produces a glyph sequence that does not match the pattern of a later lookup that would have matched the original glyph sequence. Thus, the ordering of lookups in the LookupList can be very significant.
During text processing, a client applies a lookup to each glyph in the string before moving to the next lookup. A lookup is finished for a glyph after the client locates the target glyph or glyph context and performs a substitution, if specified. To move to the “next” glyph, the client will skip all the glyphs that participated in the lookup operation: glyphs that were substituted as well as any other glyphs that formed an input sequence context for the operation. Only glyphs in the input sequence are skipped; in the case of chained contexts substitution, the glyphs in the lookahead sequence are not skipped.
The next section of this chapter describes the GSUB header and the subtables defined for each GsubLookupType. Examples at the end of this chapter illustrate the GSUB header and six of the eight LookupTypes, including the three formats available for contextual substitutions (LookupType 5).
GSUB table structures
GSUB Header
The GSUB table begins with a header that contains a version number for the table and offsets to three tables: ScriptList, FeatureList, and LookupList. For descriptions of each of these tables, see the chapter, OpenType Layout Common Table Formats. Example 1 at the end of this chapter shows a GSUB Header version 1.0 table definition.
GSUB Header, version 1.0
Type | Name | Description |
---|---|---|
uint16 | majorVersion | Major version of the GSUB table, = 1. |
uint16 | minorVersion | Minor version of the GSUB table, = 0. |
Offset16 | scriptListOffset | Offset to ScriptList table, from beginning of GSUB table. |
Offset16 | featureListOffset | Offset to FeatureList table, from beginning of GSUB table. |
Offset16 | lookupListOffset | Offset to LookupList table, from beginning of GSUB table. |
GSUB Header, version 1.1
Type | Name | Description |
---|---|---|
uint16 | majorVersion | Major version of the GSUB table, = 1. |
uint16 | minorVersion | Minor version of the GSUB table, = 1. |
Offset16 | scriptListOffset | Offset to ScriptList table, from beginning of GSUB table. |
Offset16 | featureListOffset | Offset to FeatureList table, from beginning of GSUB table. |
Offset16 | lookupListOffset | Offset to LookupList table, from beginning of GSUB table. |
Offset32 | featureVariationsOffset | Offset to FeatureVariations table, from beginning of the GSUB table (may be NULL). |
Lookup type 1 subtable: single substitution
Single substitution (SingleSubst) subtables tell a client to replace a single glyph with another glyph. The subtables can be either of two formats. Both formats require two distinct sets of glyph indices: one that defines input glyphs (specified in the Coverage table), and one that defines the output glyphs. Format 1 requires less space than format 2, but it is less flexible.
Single substitution format 1
Format 1 calculates the indices of the output glyphs, which are not explicitly defined in the subtable. To calculate an output glyph index, format 1 adds a constant delta value to the input glyph index. The input and output glyphs do not need to be in continuous glyph ID ranges, but the delta between input glyph IDs and output glyph IDs need to be constant. This format does not use the Coverage index that is returned from the Coverage table.
The SingleSubstFormat1 subtable begins with a format identifier of 1. An offset references a Coverage table that specifies the indices of the input glyphs. The deltaGlyphID is a constant value added to each input glyph index to calculate the index of the corresponding output glyph. Addition of deltaGlyphID is modulo 65536. If the result after adding deltaGlyphID to the input glyph index is less than zero, add 65536 to obtain a valid glyph ID.
Example 2 at the end of this chapter uses format 1 to replace standard numerals with lining numerals.
SingleSubstFormat1 subtable
Type | Name | Description |
---|---|---|
uint16 | format | Format identifier: format = 1. |
Offset16 | coverageOffset | Offset to Coverage table, from beginning of substitution subtable. |
int16 | deltaGlyphID | Add to original glyph ID to get substitute glyph ID. |
Single substitution format 2
Format 2 is more flexible than format 1 but requires more space. It provides an array of output glyph indices (substituteGlyphIDs) explicitly matched to the input glyph indices specified in the Coverage table.
The SingleSubstFormat2 subtable specifies a format identifier, an offset to a Coverage table that defines the input glyph indices, and an array of output glyph indices (substituteGlyphIDs).
The substituteGlyphIDs array must contain the same number of glyph indices as the Coverage table, and the glyphs must be ordered to match the order of corresponding input glyphs in the Coverage table. To locate the corresponding output glyph index in the substituteGlyphIDs array, this format uses the Coverage index returned from the Coverage table.
Example 3 at the end of this chapter uses format 2 to substitute vertically oriented glyphs for horizontally oriented glyphs.
SingleSubstFormat2 subtable
Type | Name | Description |
---|---|---|
uint16 | format | Format identifier: format = 2. |
Offset16 | coverageOffset | Offset to Coverage table, from beginning of substitution subtable. |
uint16 | glyphCount | Number of glyph IDs in the substituteGlyphIDs array. |
uint16 | substituteGlyphIDs[glyphCount] | Array of substitute glyph IDs — ordered by Coverage index. |
Lookup type 2 subtable: multiple substitution
A multiple substitution (MultipleSubst) subtable replaces a single glyph with a sequence of glyphs, as when multiple glyphs replace a single ligature. The subtable has a single format.
Multiple substitution format 1
The MultipleSubstFormat1 subtable specifies a format identifier, an offset to a Coverage table that defines the input glyph indices, and an array of offsets to Sequence tables that define the output glyph indices. The Sequence table offsets are ordered by the Coverage index of the input glyphs.
For each input glyph listed in the Coverage table, a Sequence table defines the output glyphs. Each Sequence table contains a count of the glyphs in the output glyph sequence and an array of output glyph indices.
Note: The order of the output glyph indices depends on the writing direction of the text. For text written left to right, the left-most glyph will be first glyph in the sequence. Conversely, for text written right to left, the right-most glyph will be first.
The use of multiple substitution for deletion of an input glyph is prohibited. The glyphCount value must always be greater than 0.
Example 4 at the end of this chapter shows how to replace a single ligature with three glyphs.
Type | Name | Description |
---|---|---|
uint16 | format | Format identifier: format = 1. |
Offset16 | coverageOffset | Offset to Coverage table, from beginning of substitution subtable. |
uint16 | sequenceCount | Number of Sequence table offsets in the sequenceOffsets array. |
Offset16 | sequenceOffsets[sequenceCount] | Array of offsets to Sequence tables. Offsets are from beginning of substitution subtable, ordered by Coverage index. |
Sequence table
Type | Name | Description |
---|---|---|
uint16 | glyphCount | Number of glyph IDs in the substituteGlyphIDs array. This must always be greater than 0. |
uint16 | substituteGlyphIDs[glyphCount] | String of glyph IDs to substitute. |
Lookup type 3 subtable: alternate substitution
An alternate substitution (AlternateSubst) subtable identifies any number of aesthetic alternatives from which a user can choose a glyph variant to replace the input glyph. For example, if a font contains four variants of the ampersand symbol, the 'cmap' table will specify the index of one of the four glyphs as the default glyph index, and an AlternateSubst subtable will list the indices of the other three glyphs as alternatives. A text-processing client would then have the option of replacing the default glyph with any of the three alternatives.
The subtable has one format.
Alternate substitution format 1
The AlternateSubstFormat1 subtable contains a format identifier, an offset to a Coverage table containing the indices of glyphs with alternative forms, and an array of offsets to AlternateSet tables.
For each glyph in the Coverage table, an AlternateSet subtable contains a count of the alternative glyphs and an array of their glyph indices. Because all the glyphs are functionally equivalent, they can be in any order in the array.
Example 5 at the end of this chapter shows how to replace the default ampersand glyph with alternative glyphs.
AlternateSubstFormat1 subtable
Type | Name | Description |
---|---|---|
uint16 | format | Format identifier: format = 1. |
Offset16 | coverageOffset | Offset to Coverage table, from beginning of substitution subtable. |
uint16 | alternateSetCount | Number of AlternateSet tables |
Offset16 | alternateSetOffsets[alternateSetCount] | Array of offsets to AlternateSet tables. Offsets are from beginning of substitution subtable, ordered by Coverage index. |
AlternateSet table
Type | Name | Description |
---|---|---|
uint16 | glyphCount | Number of glyph IDs in the alternateGlyphIDs array. |
uint16 | alternateGlyphIDs[glyphCount] | Array of alternate glyph IDs, in arbitrary order. |
Lookup type 4 subtable: ligature substitution
A ligature substitution (LigatureSubst) subtable identifies ligature substitutions where a single glyph replaces multiple glyphs. One LigatureSubst subtable can specify any number of ligature substitutions. The subtable has one format.
Ligature substitution format 1
The LigatureSubstFormat1 subtable contains a format identifier, a Coverage table offset, and an array of offsets to LigatureSet tables. The Coverage table specifies only the index of the first glyph component of each ligature set.
Example 6 at the end of this chapter shows how to replace a string of glyphs with a single ligature.
LigatureSubstFormat1 subtable
Type | Name | Description |
---|---|---|
uint16 | format | Format identifier: format = 1. |
Offset16 | coverageOffset | Offset to Coverage table, from beginning of substitution subtable. |
uint16 | ligatureSetCount | Number of LigatureSet tables. |
Offset16 | ligatureSetOffsets[ligatureSetCount] | Array of offsets to LigatureSet tables. Offsets are from beginning of substitution subtable, ordered by Coverage index. |
A LigatureSet table, one for each covered glyph, specifies all the ligature sequences that begin with the covered glyph. For example, if the Coverage table lists the glyph index for a lowercase “f,” then a LigatureSet table will define ligature that begin with “f”, such as the “ffl”, “fl”, “ffi”, “fi” and “ff” ligatures. If the Coverage table also lists the glyph index for a lowercase “e”, then a different LigatureSet table will define ligatures that begin with “e”, such as the “etc” ligature.
A LigatureSet table consists of a count of the ligatures that begin with the covered glyph and an array of offsets to Ligature tables, which define the glyphs in each ligature. The order in the Ligature offset array defines the preference for using the ligatures. For example, if the “ffl” ligature is preferable to the “ff” ligature, then the Ligature array would list the offset to the “ffl” Ligature table before the offset to the “ff” Ligature table.
LigatureSet table
Type | Name | Description |
---|---|---|
uint16 | ligatureCount | Number of Ligature tables. |
Offset16 | ligatureOffsets[LigatureCount] | Array of offsets to Ligature tables. Offsets are from beginning of LigatureSet table, ordered by preference. |
For each ligature in the set, a Ligature table specifies the glyph ID of the output ligature glyph; a count of the total number of component glyphs in the ligature, including the first component; and an array of glyph IDs for the components. The array starts with the second component glyph in the ligature (input glyph sequence index = 1, componentGlyphIDs array index = 0) because the first component glyph is specified in the Coverage table.
Note: The componentGlyphIDs array lists glyph IDs according to the writing direction — that is, the logical order — of the text. For text written right to left, the right-most glyph will be first. Conversely, for text written left to right, the left-most glyph will be first.
Ligature table
Type | Name | Description |
---|---|---|
uint16 | ligatureGlyph | Glyph ID of ligature to substitute. |
uint16 | componentCount | Number of components in the ligature. |
uint16 | componentGlyphIDs[componentCount - 1] | Array of component glyph IDs — start with the second component, ordered in writing direction. |
Lookup type 5 subtable: contextual substitution
A contextual substitution subtable describes glyph substitutions in context that replace one or more glyphs within a certain pattern of glyphs.
Contextual substitution subtables can use any of three formats that are common to the GSUB and GPOS tables. These define input sequence patterns to be matched against the text glyph sequence, and then actions to be applied to glyphs within the input sequence. The actions are specified as “nested” lookups, and each is applied to a particular sequence position within the input sequence.
Each sequence position + nested lookup combination is specified in a SequenceLookupRecord. Examples 7, 8, and 9 at the end of this chapter illustrate use of sequence lookup records within the GSUB table.
While the subtable formats are common between the GSUB and GPOS tables, the lookups referenced by sequence lookup records within the GSUB table are referenced by index into the GSUB LookupList table. In this way, actions specified by a GSUB contextual lookup can only be substitutions.
An input sequence pattern is matched against the current glyph sequence before any substitution actions are performed. The substitutions may change the current glyph sequence, but that has no effect on the initial matching operation. For a given lookup subtable, there may be multiple sequence lookup records, and these are processed in the specified order. Each substitution action on the glyph sequence applies to the results from the preceding sequence lookup records. Note in particular that the sequence position index in each sequence lookup record is relative to the glyph sequence as modified by the actions of preceding SequenceLookupRecords.
For example, consider a contextual lookup specifying an input glyph sequence of four glyphs. Suppose that no substitution is performed on the first glyph, but that the middle two glyphs will be replaced with a ligature, and a single glyph will replace the fourth glyph. Suppose also that the actions are listed in that order.
- The first glyph is at sequence position 0. No SequenceLookupRecord is specified for sequence index 0.
- The first SequenceLookupRecord specifies sequence position 1 and gives a LookupList index referencing a ligature substitution lookup. The first glyph specified in the nested lookup will be the glyph at sequence position 1; the second glyph specified in the nested lookup will be the glyph at sequence position 2. After the nested substitution has been performed, there will be three glyphs in the sequence context, not four.
- The last SequenceLookupRecord is defined in terms of the modified sequence context, specifying sequence position 2, not position 3. The nested single-substitution lookup will specify the glyph at position 2 as its input glyph.
Contextual substitution format 1: simple glyph contexts
Format 1 defines the context for a glyph substitution as a particular sequence of glyphs. For example, a context could be <xyz>, <holiday>, <!?*#@>, or any other glyph sequence.
For example, suppose the glyph string <abc> is to be replaced with its reverse glyph string <cba>. The input context would be defined as the glyph sequence, <abc>. Two single-substitution actions can be specified: the “a” at sequence position 0 is substituted by “c”, and the “c” at sequence position 2 is substituted by “a”.
Format 1 contextual substitutions are implemented using a SequenceContextFormat1 table. See Sequence context format 1: simple glyph contexts in the OpenType Layout Common Table Formats chapter for complete details.
Example 7 at the end of the chapter uses a SequenceContextFormat1 table to replace a sequence of three glyphs with a sequence preferred for the French language system.
Contextual substitution format 2: class-based glyph contexts
Format 2 defines contexts for glyph substitutions as input sequence patterns, with patterns expressed in terms of glyph classes. The glyph classes are defined using a Class Definition table. Several sequence patterns may be specified, with each pattern specifying a class of glyphs for each input sequence position.
For example, suppose that a swash capital glyph should replace each uppercase letter glyph that is preceded by a space glyph and followed by a lowercase letter glyph (a glyph sequence of space - uppercase - lowercase). The set of uppercase glyphs would constitute one glyph class (class 1), the set of lowercase glyphs would constitute a second class (class 2), and the space glyph would constitute a third class (class 3). The input context might be specified as a pattern of one glyph from class 3, followed by one glyph from class 1, followed by one glyph from class 2.
Format 2 contextual substitutions are implemented using a SequenceContextFormat2 table. See Sequence context format 2: class-based glyph contexts in the OpenType Layout Common Table Formats chapter for complete details.
Example 8 at the end of this chapter uses a SequenceContextFormat2 table to substitute Arabic mark glyphs for base glyphs of different heights.
Contextual substitution format 3: coverage-based glyph contexts
Format 3 defines a context for glyph substitutions as an input sequence pattern, with the pattern expressed in terms of Coverage tables. A different Coverage table is defined for each sequence position.
Format 3 is like format 2 in that patterns are defined using sets of glyphs. However, with the glyph classes used in format 2, each glyph is in exactly one class. With format 3, any glyph can occur in multiple Coverage tables.
Unlike Formats 1 and 2, however, this format can define only one context.
For example, consider an input context that contains a lowercase glyph (position 0), followed by an uppercase glyph (position 1), either a lowercase or numeral glyph (position 2), and then either a lowercase or uppercase vowel (position 3). This context requires four Coverage tables, one for each position:
- For position 0, the Coverage table lists the set of lowercase glyphs.
- For position 1, the Coverage table lists the set of uppercase glyphs.
- For position 2, the Coverage table lists the set of lowercase and numeral glyphs, a superset of the glyphs defined in the Coverage table for position 0.
- For position 3, the Coverage table lists the set of lowercase and uppercase vowels, a subset of the glyphs defined in the Coverage tables for both positions 0 and 1.
Format 3 contextual substitutions are implemented using a SequenceContextFormat3 table. See Sequence context format 3: coverage-based glyph contexts in the OpenType Layout Common Table Formats chapter for complete details.
Example 9 at the end of this chapter uses SequenceContextFormat3 to substitute swash glyphs for two out of three glyphs in a sequence.
Lookup type 6 subtable: chained contexts substitution
A chained contexts substitution subtable describes glyph substitutions in context with an ability to look back and/or look ahead in the sequence of glyphs. The design of the chained contexts substitution subtable is parallel to that of the contextual substitution subtable, including the availability of three formats. Each format can describe one or more chained backtrack, input, and lookahead sequence combinations, and one or more substitutions for glyphs in each input sequence.
Note: Substitutions can be specified only for the input sequence context, not for backtrack and lookahead sequences.
See the introduction to the Contextual substitution section for general remarks regarding contextual substitutions, which also apply to chained contexts substitutions.
Note that backtrack sequences are specified in reverse logical order. See the Chained sequence context format 1 section in the OpenType Layout Common Table Formats chapter for details regarding chained backtrack, input, and lookahead sequences.
Chained contexts substitution format 1: simple glyph contexts
Format 1 defines the context for a glyph substitution as a particular sequence of glyphs. For example, a context could be <xyz>, <holiday>, <!?*#@>, or any other glyph sequence. Specific glyph sequences are used for input, backtrack or lookahead contexts.
Format 1 chained context substitutions are implemented using a ChainedSequenceContextFormat1 table. See Chained sequence context format 1: simple glyph contexts in the OpenType Layout Common Table Formats chapter for complete details.
Chained contexts substitution format 2: class-based glyph contexts
Format 2 defines contexts for glyph substitutions as patterns expressed in terms of glyph classes. The glyph classes are defined using a Class Definition table. Several sequence patterns may be specified, with each pattern specifying a class of glyphs for each sequence position.
To chain contexts, three separate Class Definition tables are used for the backtrack sequence, input sequence, and lookahead sequence.
Format 2 contextual substitutions are implemented using a ChainedSequenceContextFormat2 table. See Chained sequence context format 2: class-based glyph contexts in the OpenType Layout Common Table Formats chapter for complete details.
Chained contexts substitution format 3: coverage-based glyph contexts
Format 3 defines contexts for glyph substitutions as patterns expressed in terms of Coverage tables. A different Coverage table is defined for each position in a sequence. To chain contexts, three separate sets of Coverage tables are used for the backtrack sequence, input sequence, and lookahead sequence.
Format 3 is like format 2 in that patterns are defined using sets of glyphs. However, with the glyph classes used in format 2, each glyph is in exactly one class. With format 3, any glyph can occur in multiple Coverage tables.
Format 3 contextual substitutions are implemented using a ChainedSequenceContextFormat3 table. See Chained sequence context format 3: coverage-based glyph contexts in the OpenType Layout Common Table Formats chapter for complete details.
Lookup type 7 subtable: substitution subtable extension
This lookup type provides a way to access lookup subtables within the GSUB table using 32-bit offsets. This is needed if the total size of the subtables exceeds the 16-bit limits of the various other offsets in the GSUB table. In this specification, the subtable stored at the 32-bit offset location is termed the “extension” subtable.
This subtable type uses one format.
Substitution extension format 1
SubstExtensionFormat1 subtable
Type | Name | Description |
---|---|---|
uint16 | format | Format identifier. Set to 1. |
uint16 | extensionLookupType | Lookup type of subtable referenced by extensionOffset (that is, the extension subtable). |
Offset32 | extensionOffset | Offset to the extension subtable, of lookup type extensionLookupType, relative to the start of the ExtensionSubstFormat1 subtable. |
The extensionLookupType field must be set to any lookup type other than 7. If a lookup table uses extension subtables, then all of the extension subtables must have the same extensionLookupType. All offsets to extension subtables are set in the usual way—that is, relative to start of the ExtensionSubstFormat1 subtable.
When a layout engine encounters a GSUB type 7 Lookup table, it shall:
- Proceed as though the Lookup table’s lookupType field were set to the extensionLookupType of the subtables.
- Proceed as though each extension subtable referenced by extensionOffset replaced the type 7 subtable that referenced it.
Lookup type 8 subtable: reverse chained contexts single substitution
The reverse chaining contextual single substitution subtable (ReverseChainSingleSubst) describes single-glyph substitutions in context with an ability to look back and/or look ahead in the sequence of glyphs. The major difference between this and other lookup types is that processing of input glyph sequence goes from end to start.
Compared to chained contexts substitution (lookup subtable type 6), this format is restricted to only a coverage-based subtable format, input sequences can contain only a single glyph, and only single substitutions are allowed on this glyph. This constraint is integrated into the subtable format.
This lookup type is designed specifically for Arabic script writing styles like Nastaliq in which the shape of the glyph is determined by the following glyph, beginning at the last glyph of the “joor”, or set of connected glyphs.
This subtable type uses one format.
Reverse chained contexts single substitution format 1: coverage-based glyph contexts
Format 1 defines a chaining context rule as a sequence of Coverage tables. Each position in the sequence may define a different Coverage table for the set of glyphs that matches the context pattern. With format 1, the glyph sets defined in the different Coverage tables may intersect.
Despite reverse order processing, the order of the Coverage tables listed in the Coverage array must be in logical order (follow the writing direction). The backtrack sequence is as illustrated for the Chained sequence context format 1 table, in the OpenType Layout Common Table Formats chapter. The input sequence is one glyph located at i in the logical string. The backtrack begins at i - 1 and increases in offset value as one moves toward the logical beginning of the string. The lookahead sequence begins at i + 1 and increases in offset value as one moves toward the logical end of the string. In processing a reverse chaining substitution, i begins at the logical end of the string and moves to the beginning.
The subtable contains a Coverage table for the input glyph and Coverage table arrays for backtrack and lookahead sequences. It also contains an array of substitute glyph indices (substituteGlyphIDs), which are substitutions for glyphs in the Coverage table, and a count of glyphs in the substituteGlyphIDs array. The substituteGlyphIDs array must contain the same number of glyph indices as the Coverage table. To locate the corresponding output glyph index in the substituteGlyphIDs array, this format uses the Coverage index returned from the Coverage table.
Example 10 at the end of this chapter uses ReverseChainSingleSubstFormat1 to substitute Arabic glyphs with a correct stroke thickness on the left (exit) to match the stroke thickness on the right (entry) of the following glyph (in logical order).
ReverseChainSingleSubstFormat1 subtable
Type | Name | Description |
---|---|---|
uint16 | format | Format identifier: format = 1. |
Offset16 | coverageOffset | Offset to Coverage table, from beginning of substitution subtable. |
uint16 | backtrackGlyphCount | Number of glyphs in the backtrack sequence. |
Offset16 | backtrackCoverageOffsets[backtrackGlyphCount] | Array of offsets to coverage tables in backtrack sequence, in glyph sequence order. |
uint16 | lookaheadGlyphCount | Number of glyphs in lookahead sequence. |
Offset16 | lookaheadCoverageOffsets[lookaheadGlyphCount] | Array of offsets to coverage tables in lookahead sequence, in glyph sequence order. |
uint16 | glyphCount | Number of glyph IDs in the substituteGlyphIDs array. |
uint16 | substituteGlyphIDs[glyphCount] | Array of substitute glyph IDs — ordered by Coverage index. |
GSUB structure examples
The rest of this chapter describes and illustrates examples of the various GSUB subtables, including each of the three formats available for contextual substitutions. All the examples reflect unique parameters described below, but the samples provide a useful reference for building subtables specific to other situations.
All the examples have three columns showing hex data, source, and comments.
Example 1: GSUB Header
Example 1 shows a typical GSUB Header table definition.
Example 1
Hex Data | Source | Comments |
---|---|---|
GSUBHeader TheGSUBHeader |
GSUBHeader table definition | |
00010000 | 0x00010000 | major/minor version |
000A | TheScriptList | offset to ScriptList table |
001E | TheFeatureList | offset to FeatureList table |
002C | TheLookupList | offset to LookupList table |
Example 2: SingleSubstFormat1 subtable
Example 2 illustrates the SingleSubstFormat1 subtable , which uses ranges to replace single input glyphs with their corresponding output glyphs. The indices of the output glyphs are calculated by adding a constant delta value to the indices of the input glyphs. In this example, the Coverage table has a format identifier of 2 to indicate the range format, which is used because the input glyph indices are in consecutive order in the font. The Coverage table specifies one range that contains a startGlyphID for the “0” (zero) glyph and an endGlyphID for the “9” glyph.
Example 2
Hex Data | Source | Comments |
---|---|---|
SingleSubstFormat1 LiningNumeralSubtable |
SingleSubst subtable definition | |
0001 | 1 | format: calculated output glyph indices |
0006 | LiningNumeralCoverage | offset to Coverage table for input glyphs |
00C0 | 192 | deltaGlyphID = 192: add to each input glyph index to produce output glyph index |
CoverageFormat2 LiningNumeralCoverage |
Coverage table definition | |
0002 | 2 | coverageFormat: ranges |
1 | rangeCount | |
rangeRecords[0] | ||
004E | 78 | Start glyph ID for numeral zero glyph |
0058 | 87 | End glyph ID for numeral nine glyph |
0000 | 0 | startCoverageIndex: first CoverageIndex = 0 |
Example 3: SingleSubstFormat2 subtable
Example 3 uses the SingleSubstFormat2 subtable for lists to substitute punctuation glyphs in Japanese text that is written vertically. Horizontally oriented parentheses and square brackets (the input glyphs) are replaced with vertically oriented parentheses and square brackets (the output glyphs).
The Coverage table, format 1, identifies each input glyph index. The number of input glyph indices listed in the Coverage table matches the number of output glyph indices listed in the subtable. For correct substitution, the order of the glyph indices in the Coverage table (input glyphs) must match the order in the Substitute array (output glyphs).
Example 3
Hex Data | Source | Comments |
---|---|---|
SingleSubstFormat2 VerticalPunctuationSubtable |
SingleSubst subtable definition | |
0002 | 2 | format: lists |
000E | VerticalPunctuationCoverage | offset to Coverage table |
0004 | 4 | glyphCount — equals glyphCount in Coverage table |
0131 | VerticalOpenBracketGlyph | substituteGlyphIDs[0], ordered by Coverage index |
0135 | VerticalClosedBracketGlyph | substituteGlyphIDs[1] |
013E | VerticalOpenParenthesisGlyph | substituteGlyphIDs[2] |
0143 | VerticalClosedParenthesisGlyph | substituteGlyphIDs[3] |
CoverageFormat1 VerticalPunctuationCoverage |
Coverage table definition | |
0001 | 1 | coverageFormat: lists |
0004 | 4 | glyphCount |
003C | HorizontalOpenBracketGlyph | glyphArray[0], ordered by glyph ID |
0040 | HorizontalClosedBracketGlyph | glyphArray[1] |
004B | HorizontalOpenParenthesisGlyph | glyphArray[2] |
004F | HorizontalClosedParenthesisGlyph | glyphArray[3] |
Example 4: MultipleSubstFormat1 subtable
Example 4 uses a MultipleSubstFormat1 subtable to replace a single “ffi” ligature with three individual glyphs that form the string <ffi>. The subtable defines a format identifier of 1, an offset to a Coverage table that specifies the glyph index of the “ffi” ligature (the input glyph), an offset to a Sequence table that specifies the sequence of glyph indices for the <ffi> string in its substitute array (the output glyph sequence), and a count of Sequence table offsets.
Example 4
Hex Data | Source | Comments |
---|---|---|
MultipleSubstFormat1 FfiDecompSubtable |
MultipleSubst subtable definition | |
0001 | 1 | format |
0008 | FfiDecompCoverage | offset to Coverage table |
0001 | 1 | sequenceCount — equals glyphCount in Coverage table |
000E | FfiDecompSequence | sequenceOffsets[0] (offset to Sequence table 0) |
CoverageFormat1 FfiDecompCoverage |
Coverage table definition | |
0001 | 1 | coverageFormat: lists |
0001 | 1 | glyphCount |
00F1 | ffiGlyphID | ligature glyph |
Sequence FfiDecompSequence |
Sequence table definition | |
0003 | 3 | glyphCount |
001A | fGlyphID | first glyph in sequence order |
001A | fGlyphID | second glyph |
001D | iGlyphID | third glyph |
Example 5: AlternateSubstFormat 1 subtable
Example 5 uses the AlternateSubstFormat1 subtable to replace the default ampersand glyph (input glyph) with one of two alternative ampersand glyphs (output glyph).
In this case, the Coverage table specifies the index of a single glyph, the default ampersand, because it is the only glyph covered by this lookup. The AlternateSet table for this covered glyph identifies the alternative glyphs: AltAmpersand1GlyphID and AltAmpersand2GlyphID.
In Example 5, the index position of the AlternateSet table offset in the AlternateSet array is zero (0), which correlates with the index position (also zero) of the default ampersand glyph in the Coverage table.
Example 5
Hex Data | Source | Comments |
---|---|---|
AlternateSubstFormat1 AltAmpersandSubtable |
AlternateSubstFormat1 subtable definition | |
0001 | 1 | format |
0008 | AltAmpersandCoverage | offset to Coverage table |
0001 | 1 | alternateSetCount — equals glyphCount in Coverage table |
000E | AltAmpersandSet | alternateSetOffsets[0] (offset to AlternateSet table 0) |
CoverageFormat1 AltAmpersandCoverage |
Coverage table definition | |
0001 | 1 | coverageFormat: lists |
0001 | 1 | glyphCount |
003A | DefaultAmpersandGlyphID | glyphArray[0] |
AlternateSet AltAmpersandSet |
AlternateSet table definition | |
0002 | 2 | glyphCount |
00C9 | AltAmpersand1GlyphID | alternateGlyphIDs[0] — glyphs in arbitrary order |
00CA | AltAmpersand2GlyphID | alternateGlyphIDs[1] |
Example 6: LigatureSubstFormat1 subtable
Example 6 shows a LigatureSubstFormat1 subtable that defines data to replace a string of glyphs with a single ligature glyph. Because a LigatureSubstFormat1 subtable can specify glyph substitutions for more than one ligature, this subtable defines three ligatures: “etc”, “ffi”, and “fi.”
The sample subtable contains a format identifier (4) and an offset to a Coverage table. The Coverage table, which lists an index for each first glyph in the ligatures, lists indices for the “e” and “f” glyphs. The Coverage table range format is used here because the “e” and “f” glyph indices are numbered consecutively.
In the LigatureSubst subtable, ligatureSetCount specifies two LigatureSet tables, one for each covered glyph, and the ligatureSetOffsets array stores offsets to them. In this array, the “e” LigatureSet precedes the “f” LigatureSet, matching the order of the corresponding first-glyph components in the Coverage table.
Each LigatureSet table identifies all ligatures that begin with a covered glyph. The sample LigatureSet table defined for the “e” glyph contains only one ligature, “etc.” A LigatureSet table defined for the “f” glyph contains two ligatures, “ffi” and “fi.”
The sample FLigaturesSet table has offsets to two Ligature tables, one for “ffi” and one for “fi.” The ligatureOffsets array lists the “ffi” Ligature table first to indicate that the “ffi” ligature is preferred to the “fi” ligature.
Example 6
Hex Data | Source | Comments |
---|---|---|
LigatureSubstFormat1 LigaturesSubtable |
LigatureSubstFormat1 subtable definition | |
0001 | 1 | format |
000A | LigaturesCoverage | offset to Coverage table |
0002 | 2 | ligatureSetCount |
0014 | ELigaturesSet | ligatureSetOffsets[0] (offset to LigatureSet table 0) — LigatureSet tables in Coverage index order |
0020 | FLigaturesSet | ligatureSetOffsets[1] |
CoverageFormat2 LigaturesCoverage |
Coverage table definition | |
0002 | 2 | coverageFormat: ranges |
0001 | 1 | rangeCount |
rangeRecords[0] | ||
0019 | eGlyphID | Start, first glyph ID |
001A | fGlyphID | End, last glyph ID in range |
0000 | 0 | startCoverageIndex: coverage index of start glyph ID = 0 |
LigatureSet ELigaturesSet |
LigatureSet table definition — all ligatures that start with e | |
0001 | 1 | ligatureCount |
0004 | etcLigature | ligatureOffsets[0] (offset to Ligature table 0) |
Ligature etcLigature |
Ligature table definition | |
015B | etcGlyphID | ligatureGlyph — output glyph ID |
0003 | 3 | componentCount |
0028 | tGlyphID | componentGlyphIDs[0] — second component in ligature |
0017 | cGlyphID | componentGlyphIDs[1] — third component in ligature |
LigatureSet FLigaturesSet |
LigatureSet table definition all ligatures start with f | |
0002 | 2 | ligatureCount |
0006 | ffiLigature | ligatureOffsets[0] — listed first because ffi ligature is preferred to fi ligature |
000E | fiLigature | ligatureOffsets[1] |
Ligature ffiLigature |
Ligature table definition | |
00F1 | ffiGlyphID | ligatureGlyph — output glyph ID |
0003 | 3 | componentCount |
001A | fGlyphID | componentGlyphIDs[0] — second component in ligature |
001D | iGlyphID | componentGlyphIDs[1] — third component in ligature |
Ligature fiLigature |
Ligature table definition | |
00F0 | fiGlyphID | ligatureGlyph — output glyph ID |
0002 | 2 | componentCount |
001D | iGlyphID | componentGlyphIDs[0] — second component in ligature |
Example 7: Contextual substitution format 1
Example 7 illustrates format 1 contextual substitution, using a SequenceContextFormat1 subtable to replace a string of three glyphs with another string. For the French language system, the subtable defines a contextual substitution that replaces the input sequence, space-dash-space, with the output sequence, thin space-dash-thin space.
The contextual substitution, called Dash Lookup in this example, contains one SequenceContextFormat1 subtable called the DashSubtable. The subtable specifies two contexts: a SpaceGlyph followed by a DashGlyph, and a DashGlyph followed by a SpaceGlyph. In each sequence, a single substitution replaces the SpaceGlyph with a ThinSpaceGlyph.
The Coverage table, labeled DashCoverage, lists two glyph IDs for the first glyphs in the SpaceGlyph and DashGlyph sequences. One SequenceRuleSet table is defined for each covered glyph.
SpaceAndDashSubRuleSet lists all the contexts that begin with a SpaceGlyph. It contains an offset to one SequenceRule table (SpaceAndDashSubRule), which specifies two glyphs in the context sequence, the second of which is a DashGlyph. The SequenceRule table contains a SequenceLookupRecord that lists the position in the sequence where the glyph substitution should occur (position 0) and the index of the SpaceToThinSpaceLookup applied there to replace the SpaceGlyph with a ThinSpaceGlyph. DashAndSpaceSubRuleSet lists all the contexts that begin with a DashGlyph. An offset points to a SequenceRule table (DashAndSpaceSubRule), which specifies two glyphs in the context sequence, and the second one is a SpaceGlyph. The SequenceRule table contains a SequenceLookupRecord that lists the position in the sequence where the glyph substitution should occur, and an index to the same lookup used in the SpaceAndDashSubRule. The lookup replaces the SpaceGlyph with a ThinSpaceGlyph.
Example 7
Hex Data | Source | Comments |
---|---|---|
SequenceContextFormat1 DashSubtable |
SequenceContextFormat1 subtable definition for Lookup[0], DashLookup | |
0001 | 1 | format |
000A | DashCoverage | offset to Coverage table |
0002 | 2 | seqRuleSetCount |
0012 | SpaceAndDashSubRuleSet | seqRuleSetOffsets[0] (offset to SequenceRuleSet table 0) — SequenceRuleSets ordered by Coverage index |
0020 | DashAndSpaceSubRuleSet | seqRuleSetOffsets[1] |
CoverageFormat1 DashCoverage |
Coverage table definition | |
0001 | 1 | coverageFormat: lists |
0002 | 2 | glyphCount |
0028 | SpaceGlyph | glyphArray[0] — glyphs in numeric order |
005D | DashGlyph | glyphArray[1], dash glyph ID |
SequenceRuleSet SpaceAndDashSubRuleSet |
SequenceRuleSet[0] table definition | |
0001 | 1 | seqRuleCount |
0004 | SpaceAndDashSubRule | seqRuleOffsets[0] (offset to SequenceRule table 0) — SequenceRule tables ordered by preference |
SequenceRule SpaceAndDashSubRule |
SequenceRule[0] table definition | |
0002 | 2 | glyphCount — number in input sequence |
0001 | 1 | seqLookupCount |
005D | DashGlyph | inputSequence[0], starting with second glyph — SpaceGlyph, in Coverage table, is first glyph |
seqLookupRecords[0] | ||
0000 | 0 | sequenceIndex — substitution at first glyph position (0) |
0001 | 1 | lookupListIndex — index for SpaceToThinSpaceLookup in LookupList |
SequenceRuleSet DashAndSpaceSubRuleSet |
SequenceRuleSet[1] table definition | |
0001 | 1 | seqRuleCount |
0004 | DashAndSpaceSubRule | seqRuleOffsets[0] (offset to SequenceRule table 0) — SequenceRule tables ordered by preference |
SequenceRule DashAndSpaceSubRule |
SequenceRule[0] table definition | |
0002 | 2 | glyphCount — number in the input glyph sequence |
0001 | 1 | seqLookupCount |
0028 | SpaceGlyph | inputSequence[0] — starting with second glyph |
seqLookupRecords[0] | ||
0001 | 1 | sequenceIndex — substitution at second glyph position (glyph sequence index = 1) |
0001 | 1 | lookupListIndex — index for SpaceToThinSpaceLookup |
Example 8: Contextual substitution format 2
Example 8 illustrates a format 2 contextual substitution using a SequenceContextFormat2 subtable with glyph classes to replace default mark glyphs with their alternative forms. Glyph alternatives are selected depending upon the height of the base glyph that they combine with; that is, the mark glyph used above a high base glyph differs from the mark glyph above a very high base glyph.
In the example, SetMarksHighSubtable contains a Class Definition table that defines four glyph classes: default mark glyphs (class 1), high base glyphs (class 2), very high base glyphs (class 3), and all remaining glyphs, including medium-height base glyphs. The subtable also contains a Coverage table that lists each base glyph that functions as a first component in a context, ordered by glyph index.
Two ClassSequenceRuleSet tables are defined, one for substituting high marks and one for very high marks. No ClassSequenceRuleSets are specified for class 0 and class 1 glyphs because no contexts begin with glyphs from these classes. The classSeqRuleSetOffsets lists offsets to the ClassSequenceRuleSet tables in class value order, so the offset for ClassSequenceRuleSet for class 2 precedes that for class 3.
Within each ClassSequenceRuleSet, a ClassSequencRule is defined. In SetMarksHighSubClassSet2, corresponding to contexts that begin with a glyph in class 2, the ClassSequenceRule table specifies an input sequence with two glyphs: the first glyph in class 2 (a high glyph), and the second in class 1 (a mark glyph). The SequenceLookupRecord specifies applying SubstituteHighMarkLookup at the second position in the sequence—that is, a high mark glyph will replace the default mark glyph.
In SetMarksVeryHighSubClassSet3, corresponding to contexts that begin with a glyph in class 3, the ClassSequencRule specifies an input sequence with two glyphs: the first in class 3 (a very high glyph), and the second in class 1 (a mark glyph). The SequenceLookupRecord specifies applying SubstituteVeryHighMarkLookup at the second position in the sequence—that is, a very high mark glyph will replace the default mark glyph.
Example 8
Hex Data | Source | Comments |
---|---|---|
SequenceContextFormat2 SetMarksHighSubtable |
SequenceContextFormat2 subtable definition | |
0002 | 2 | format |
0010 | SetMarksHighCoverage | offset to Coverage table |
001C | SetMarksHighClassDef | offset to Class Def table |
0004 | 4 | classSeqRuleSetCount |
0000 | NULL | classSeqRuleSetOffsets[0] — NULL: no contexts that begin with Class 0 glyphs are defined |
0000 | NULL | classSeqRuleSetOffsets[1] — no contexts that begin with Class 1 glyphs are defined |
0032 | SetMarksHighSubClassSet2 | classSeqRuleSetOffsets[2] — offset to ClassSequencRuleSet table for contexts that begin with Class 2 glyphs (high base glyphs) |
0040 | SetMarksVeryHighSubClassSet3 | classSeqRuleSetOffsets[3] — offset to ClassSequencRuleSet table for contexts that begin with Class 3 glyphs (very high base glyphs) |
CoverageFormat1 SetMarksHighCoverage |
Coverage table definition | |
0001 | 1 | coverageFormat: lists |
0004 | 4 | glyphCount |
0030 | tahGlyphID | glyphArray[0], high base glyph |
0031 | dhahGlyphID | glyphArray[1], high base glyph |
0040 | cafGlyphID | glyphArray[2], very high base glyph |
0041 | gafGlyphID | glyphArray[3], very high base glyph |
ClassDefFormat2 SetMarksHighClassDef |
Class table definition | |
0002 | 2 | classFormat: ranges |
0003 | 3 | classRangeCount |
classRangeRecords[0] | ClassRangeRecords ordered by startGlyphID; record for Class 2, high base glyphs | |
0030 | tahGlyphID | Start, first Glyph ID in range |
0031 | dhahGlyphID | End, last Glyph ID in range |
0002 | 2 | class: 2 |
classRangeRecords[1] | ClassRangeRecord for Class 3, very high base glyphs | |
0040 | cafGlyphID | Start, first Glyph ID in the range |
0041 | gafGlyphID | End, last Glyph ID in the range |
0003 | 3 | class: 3 ClassRange[2] for Class 1, mark gyphs |
classRangeRecords[2] | ClassRangeRecord for Class 1, mark glyphs | |
00D2 | fathatanDefaultGlyphID | Start, first Glyph ID in range default fathatan mark |
00D3 | dammatanDefaultGlyphID | End, last Glyph ID in the range default dammatan mark |
0001 | 1 | class: 1 |
ClassSequencRuleSet SetMarksHighSubClassSet2 |
ClassSequencRuleSet[2] table definition— all contexts that begin with Class 2 glyphs | |
0001 | 1 | classSeqRuleCount |
0004 | SetMarksHighSubClassRule2 | classSeqRuleOffsets[0] (offset to ClassSequenceRule table 0) — ClassSequenceRule tables ordered by preference |
ClassSequenceRule SetMarksHighSubClassRule2 |
ClassSequenceRule[0] table definition, Class 2 glyph (high base) glyph followed by a Class 1 glyph (mark) | |
0002 | 2 | glyphCount |
0001 | 1 | seqLookupCount |
0001 | 1 | inputSequence[0] — input sequence beginning with the second Class in the input context sequence; Class 1, mark glyphs |
seqLookupRecords[0] | seqLookupRecords array in design order | |
0001 | 1 | sequenceIndex — apply substitution to position 2, a mark |
0001 | 1 | lookupListIndex |
ClassSequencRuleSet SetMarksVeryHighSubClassSet3 |
ClassSequencRuleSet[3] table definition — all contexts that begin with Class 3 glyphs | |
0001 | 1 | classSeqRuleCount |
0004 | SetMarksVeryHighSubClassRule3 | classSeqRuleOffsets[0] |
ClassSequenceRule SetMarksVeryHighSubClassRule3 |
ClassSequenceRule[0] table definition — Class 3 glyph (very high base glyph) followed by a Class 1 glyph (mark) | |
0002 | 2 | glyphCount |
0001 | 1 | seqLookupCount |
0001 | 1 | inputSequence[0] — input sequence beginning with the second Class in the input context sequence; Class 1, mark glyphs |
seqLookupRecords[0] | seqLookupRecords array in design order | |
0001 | 1 | sequenceIndex — apply substitution to position 2, second glyph class (mark) |
0002 | 2 | lookupListIndex |
Example 9: Contextual substitution format 3
Example 9 illustrates a format 3 contextual substitution, using a SequenceContextFormat3 subtable with Coverage tables to describe a context sequence of three lowercase glyphs in the pattern: any ascender or descender glyph in position 0 (zero), any x-height glyph in position 1, and any descender glyph in position 2. The overlapping sets of covered glyphs for positions 0 and 2 make Format 3 better for this context than the class-based Format 2.
In positions 0 and 2, swash versions of the glyphs replace the default glyphs. The contextual-substitution lookup is SwashLookup (LookupList index = 0), and its subtable is SwashSubtable. The SwashSubtable defines three Coverage tables: AscenderDescenderCoverage, XheightCoverage, and DescenderCoverage-one for each glyph position in the context sequence, respectively.
The SwashSubtable also defines two SequenceLookupRecords: one that applies to position 0, and one for position 2. (No substitutions are applied to position 1.) The record for position 0 uses a single substitution lookup called AscDescSwashLookup to replace the current ascender or descender glyph with a swash ascender or descender glyph. The record for position 2 uses a single substitution lookup called DescSwashLookup to replace the current descender glyph with a swash descender glyph.
Example 9
Hex Data | Source | Comments |
---|---|---|
SequenceContextFormat3 SwashSubtable |
SequenceContextFormat3 subtable definition | |
0003 | 3 | format |
0003 | 3 | glyphCount — number in input glyph sequence |
0002 | 2 | seqLookupCount |
0014 | AscenderDescenderCoverage | coverageOffsets[0] — offsets to Coverage tables, in context sequence order |
0030 | XheightCoverage | coverageOffsets[1] |
0052 | DescenderCoverage | coverageOffsets[2] |
seqLookupRecords[0] | SequenceLookupRecords in glyph position order | |
0000 | 0 | sequenceIndex |
0001 | 1 | lookupListIndex — single substitution to output ascender or descender swash |
seqLookupRecords[1] | ||
0002 | 2 | sequenceIndex |
0002 | 2 | lookupListIndex — single substitution to output descender swash |
CoverageFormat1 AscenderDescenderCoverage |
Coverage table definition | |
0001 | 1 | coverageFormat: lists |
000C | 12 | glyphCount |
0033 | bGlyphID | glyphArray[0] — glyphs in glyph ID order |
0035 | dGlyphID | glyphArray[1] |
0037 | fGlyphID | glyphArray[2] |
0038 | gGlyphID | glyphArray[3] |
0039 | hGlyphID | glyphArray[4] |
003B | jGlyphID | glyphArray[5] |
003C | kGlyphID | glyphArray[6] |
003D | lGlyphID | glyphArray[7] |
0041 | pGlyphID | glyphArray[8] |
0042 | qGlyphID | glyphArray[9] |
0045 | tGlyphID | glyphArray[10] |
004A | yGlyphID | glyphArray[11] |
CoverageFormat1 XheightCoverage |
Coverage table definition | |
0001 | 1 | coverageFormat: lists |
000F | 15 | glyphCount |
0032 | aGlyphID | glyphArray[0] |
0034 | cGlyphID | glyphArray[1] |
0036 | eGlyphID | glyphArray[2] |
003A | iGlyphID | glyphArray[3] |
003E | mGlyphID | glyphArray[4] |
003F | nGlyphID | glyphArray[5] |
0040 | oGlyphID | glyphArray[6] |
0043 | rGlyphID | glyphArray[7] |
0044 | sGlyphID | glyphArray[8] |
0045 | tGlyphID | glyphArray[9] |
0046 | uGlyphID | glyphArray[10] |
0047 | vGlyphID | glyphArray[11] |
0048 | wGlyphID | glyphArray[12] |
0049 | xGlyphID | glyphArray[13] |
004B | zGlyphID | GlyphArray[14] |
CoverageFormat1 DescenderCoverage |
Coverage table definition | |
0001 | 1 | coverageFormat: lists |
0005 | 5 | glyphCount |
0038 | gGlyphID | glyphArray[0] |
003B | jGlyphID | glyphArray[1] |
0041 | pGlyphID | glyphArray[2] |
0042 | qGlyphID | glyphArray[3] |
004A | yGlyphID | glyphArray[4] |
Example 10: ReverseChainSingleSubstFormat1 subtable
Example 10 uses a ReverseChainSingleSubstFormat1 subtable to substitute glyphs with a form that has a thick connection to the left (thick exit). This allows the glyph to correctly connect to the letter form to the left of it.
The ThickExitCoverage table is the listing of glyphs to be matched for substitution.
The LookaheadCoverage table, labeled ThickEntryCoverage, lists four glyph IDs for the glyph following a substitution coverage glyph. This lookahead coverage attempts to match the context that will cause the substitution to take place.
The substituteGlyphIDs array provides the glyphs to replace glyphs that correspond in order in the ThickExitCoverage table.
Example 10
Hex Data | Source | Comments |
---|---|---|
ReverseChainSingleSubstFormat1 ThickConnect |
ReverseChainSingleSubstFormat1 subtable definition | |
0001 | 1 | format |
0068 | ThickExitCoverage | offset to Coverage table |
0000 | 0 | backtrackGlyphCount |
0000 | null - not used | backtrackCoverageOffsets[0] |
0001 | 1 | lookaheadGlyphCount |
0026 | ThickEntryCoverage | lookaheadCoverageOffsets[0] |
000C | 12 | glyphCount |
00A7 | BEm2 | substituteGlyphIDs[0] — substitute glyphs ordered by Coverage index |
00B9 | BEi3 | substituteGlyphIDs[1] |
00C5 | JIMm3 | substituteGlyphIDs[2] |
00D4 | JIMi2 | substituteGlyphIDs[3] |
00EA | SINm2 | substituteGlyphIDs[4] |
00F2 | SINi2 | substituteGlyphIDs[5] |
00FD | SADm2 | substituteGlyphIDs[6] |
010D | SADi2 | substituteGlyphIDs[7] |
011B | TOEm3 | substituteGlyphIDs[8] |
012B | TOEi3 | substituteGlyphIDs[9] |
013B | AINm2 | substituteGlyphIDs[10] |
0141 | AINi2 | substituteGlyphIDs[11] |
CoverageFormat1 ThickEntryCoverage |
Coverage table definition | |
0001 | 1 | coverageFormat: lists |
001F | 31 | glyphCount |
00A5 | ALEFf1 | glyphArray[0] — glyphs in glyph ID order |
00A9 | BEm4 | glyphArray[1] |
00AA | BEm5 | glyphArray[2] |
00E2 | DALf1 | glyphArray[3] |
0167 | KAFf1 | glyphArray[4] |
0168 | KAFfs1 | glyphArray[5] |
0169 | KAFm1 | glyphArray[6] |
016D | KAFm5 | glyphArray[7] |
016E | KAFm6 | glyphArray[8] |
0170 | KAFm8 | glyphArray[9] |
0183 | GAFf1 | glyphArray[10] |
0184 | GAFfs1 | glyphArray[11] |
0185 | GAFm1 | glyphArray[12] |
0189 | GAFm5 | glyphArray[13] |
018A | GAFm6 | glyphArray[14] |
018C | GAFm8 | glyphArray[15] |
019F | LAMf1 | glyphArray[16] |
01A0 | LAMm1 | glyphArray[17] |
01A1 | LAMm2 | glyphArray[18] |
01A2 | LAMm3 | glyphArray[19] |
01A3 | LAMm4 | glyphArray[20] |
01A4 | LAMm5 | glyphArray[21] |
01A5 | LAMm6 | glyphArray[22] |
01A6 | LAMm7 | glyphArray[23] |
01A7 | LAMm8 | glyphArray[24] |
01A8 | LAMm9 | glyphArray[25] |
01A9 | LAMm10 | glyphArray[26] |
01AA | LAMm11 | glyphArray[27] |
01AB | LAMm12 | glyphArray[28] |
01AC | LAMm13 | glyphArray[29] |
01EC | HAYf2 | glyphArray[30] |
CoverageFormat1 ThickExitCoverage |
Coverage table definition | |
0001 | 1 | coverageFormat: lists |
000C | 12 | glyphCount |
00A6 | BEm1 | glyphArray[0] |
00B7 | BEi1 | glyphArray[1] |
00C3 | JIMm1 | glyphArray[2] |
00D2 | JIMi1 | glyphArray[3] |
00E9 | SINm1 | glyphArray[4] |
00F1 | SINi1 | glyphArray[5] |
00FC | SADm1 | glyphArray[6] |
010C | SADi1 | glyphArray[7] |
0119 | TOEm1 | glyphArray[8] |
0129 | TOEi1 | glyphArray[9] |
013A | AINm1 | glyphArray[10] |
0140 | AINi1 | glyphArray[11] |