Professional Documents
Culture Documents
Diacritic
Diacritic
win!Learn more
Diacritic
From Wikipedia, the free encyclopedia
Jump to navigationJump to search
For the academic journal, see Diacritics (journal).
This article contains special characters. Without proper rendering support, you may see question marks, boxes, or
other symbols.
This page uses orthographic and related notations. For the notations ⟨ ⟩ , / / and [ ] used in this article,
see IPA Brackets and transcription delimiters.
Contents
• 1Types
• 2Diacritics specific to non-Latin alphabets
o 2.1Arabic
o 2.2Greek
o 2.3Hebrew
o 2.4Korean
o 2.5Sanskrit and Indic
o 2.6Syriac
• 3Non-alphabetic scripts
• 4Alphabetization or collation
• 5Generation with computers
• 6Languages with letters containing diacritics
o 6.1Latin/Roman letters
o 6.2Cyrillic letters
• 7Diacritics that do not produce new letters
o 7.1English
o 7.2Other languages
• 8Transliteration
• 9Limits
o 9.1Orthographic
o 9.2Unorthographic/ornamental
• 10List of diacritics in Unicode
• 11See also
• 12Notes
• 13References
• 14External links
Types[edit]
Larger renditions of these glyphs are given at § List of diacritics in Unicode, below
Among the types of diacritic used in alphabets based on the Latin script are:
• Niqqud
o ּ – Dagesh
o ּ – Mappiq
o ֿ – Rafe
o ◌ – Shin dot (at top right corner)
o ◌ – Sin dot (at top left corner)
o ְ – Shva
o ֻ – Kubutz
o ֹ – Holam
◌
o ָ – Kamatz
o ַ – Patakh
o ֶ – Segol
o ֵ – Tzeire
o ִ – Hiriq
• Cantillation marks do not generally render correctly;
refer to Hebrew cantillation#Names and shapes of
the ta'amim for a complete table together with
instructions for how to maximize the possibility of
viewing them in a web browser
• Other
o – ׳Geresh
o – ״Gershayim
Korean[edit]
The diacritics 〮 and 〯 , known as Bangjeom (방점; 傍點), were used to mark pitch
accents in Hangul for Middle Korean. They were written to the left of a syllable in
vertical writing and above a syllable in horizontal writing.
Sanskrit and Indic[edit]
Further information: Brahmic scripts
Devanagari script's (from Brahmic family) compound letters, which are vowels combined with consonants, have
᷇ , ◌ T᷆, ◌͜, ◌̯, ◌, ◌̙,etc.
diacritics. Here क(Ka) is shown with vowel diacritics. That is: ◌T, T ◌
Syriac[edit]
Further information: Syriac alphabet
Non-alphabetic scripts[edit]
Some non-alphabetic scripts also employ symbols that function essentially as diacritics.
Alphabetization or collation[edit]
Main article: Collation
Different languages use different rules to put diacritic characters in alphabetical order.
French and Portuguese treat letters with diacritical marks the same as the underlying
letter for purposes of ordering and dictionaries.
The Scandinavian languages and the Finnish language, by contrast, treat the
characters with diacritics å, ä, and ö as distinct letters of the alphabet, and sort them
after z. Usually ä (a-umlaut) and ö (o-umlaut) [used in Swedish and Finnish] are sorted
as equivalent to æ (ash) and ø (o-slash) [used in Danish and Norwegian]. Also, aa,
when used as an alternative spelling to å, is sorted as such. Other letters modified by
diacritics are treated as variants of the underlying letter, with the exception that ü is
frequently sorted as y.
Languages that treat accented letters as variants of the underlying letter usually
alphabetize words with such symbols immediately after similar unmarked words. For
instance, in German where two words differ only by an umlaut, the word without it is
sorted first in German dictionaries (e.g. schon and then schön, or fallen and then fällen).
However, when names are concerned (e.g. in phone books or in author catalogues in
libraries), umlauts are often treated as combinations of the vowel with a suffixed e;
Austrian phone books now treat characters with umlauts as separate letters
(immediately following the underlying vowel).
In Spanish, the grapheme ñ is considered a new letter different from n and collated
between n and o, as it denotes a different sound from that of a plain n. But the accented
vowels á, é, í, ó, ú are not separated from the unaccented vowels a, e, i, o, u, as the
acute accent in Spanish only modifies stress within the word or denotes a distinction
between homonyms, and does not modify the sound of a letter.
For a comprehensive list of the collating orders in various languages, see Collating
sequence.
Diacritics that do
not produce new
letters[edit]
English[edit]
Main article: English terms
with diacritical marks
English is one of the few
European languages that
does not have many words
that contain diacritical marks.
Instead, digraphs are the main
way the Modern English
alphabet adapts the Latin to
its phonemes. Exceptions are
unassimilated foreign
loanwords, including
borrowings from French (and,
increasingly, Spanish,
like jalapeño and piñata);
however, the diacritic is also
sometimes omitted from such
words. Loanwords that
frequently appear with the
diacritic in English
include café, résumé or resum
é (a usage that helps
distinguish it from the
verb resume), soufflé,
and naïveté (see English
terms with diacritical marks).
In older practice (and even
among some orthographically-
conservative modern writers),
one may see examples such
as élite, mêlée and rôle.
English speakers and writers
once used the diaeresis more
often than now in words such
as coöperation (from
Fr. coopération), zoölogy (fro
m Grk. zoologia),
and seeër (now more
commonly see-er or
simply seer) as a way of
indicating that adjacent vowels
belonged to separate
syllables, but this practice has
become far less common. The
New Yorker magazine is a
major publication that
continues to use the diaeresis
in place of a hyphen for clarity
and economy of space.[10]
A few English words, out of
context, can only be
distinguished from others by a
diacritic or modified letter,
including exposé, lamé, maté,
öre, øre, pâté, and rosé. The
same is true
of résumé, alternatively resum
é, but, nevertheless, it is
regularly spelled resume. In a
few words, diacritics that did
not exist in the original have
been added for
disambiguation, as
in maté (from Sp. and
Port. mate), saké (the
standard Romanization of the
Japanese has no accent
mark), and Malé (from Dhivehi
ެ) ާމލ, to clearly distinguish them
from the English
words mate, sake, and male.
The acute and grave accents
are occasionally used in
poetry and lyrics: the acute to
indicate stress overtly where it
might be ambiguous
(rébel vs. rebél) or
nonstandard for metrical
reasons (caléndar), the grave
to indicate that an ordinarily
silent or elided syllable is
pronounced
(warnèd, parlìament).
In certain personal names
such as Renée and Zoë, often
two spellings exist, and the
preference will be known only
to those close to the person
themselves. Even when the
name of a person is spelled
with a diacritic, like Charlotte
Brontë, this may be dropped
in English-language articles,
and even in official documents
such as passports, due either
to carelessness, the typist not
knowing how to enter letters
with diacritical marks, or
technical reasons (California,
for example, does not allow
names with diacritics, as the
computer system cannot
process such characters).
They also appear in some
worldwide company names
and/or trademarks, such
as Nestlé or Citroën.
Other languages[edit]
The following languages have
letter-diacritic combinations
that are not considered
independent letters.
• Afrikaans uses a
diaeresis to mark
vowels that are
pronounced
separately and not
as one would
expect where they
occur together, for
example voel (to
feel) as opposed
to voël (bird). The
circumflex is used
in ê, î,
ô and û generally to
indicate long close-
mid, as opposed
to open-mid vowels,
for example in the
words wêreld (world
)
and môre (morning,
tomorrow). The
acute accent is
used to add
emphasis in the
same way as
underlining or
writing in bold or
italics in English, for
example Dit is jóú
boek (It
is your book). The
grave accent is
used to distinguish
between words that
are different only in
placement of the
stress, for
example appel (app
le)
and appèl (appeal)
and in a few cases
where it makes no
difference to the
pronunciation but
distinguishes
between
homophones. The
two most usual
cases of the latter
are in the
sayings òf...
òf (either... or)
and nòg...
nòg (neither... nor)
to distinguish them
from of (or)
and nog (again,
still).
• Aymara uses a
diacritical horn
over p, q, t, k, ch.
• Catalan has the
following composite
characters: à, ç, é,
è, í, ï, ó, ò, ú, ü, l·l.
The acute and the
grave
indicate stress and
vowel height, the
cedilla marks the
result of a
historical palatalizati
on, the diaeresis
indicates either
a hiatus, or that the
letter u is
pronounced when
the graphemes gü,
qü are followed
by e or i,
the interpunct (·)
distinguishes the
different values
of ll/l·l.
• Some
orthographies
of Cornish such
as Kernowek
Standard and Unifie
d Cornish use
diacritics, while
others such
as Kernewek
Kemmyn and
the Standard
Written Form do not
(or only use them
optionally in
teaching materials).
• Dutch uses the
diaeresis. For
example, in ruïne it
means that
the u and the i are
separately
pronounced in their
usual way, and not
in the way that the
combination ui is
normally
pronounced. Thus it
works as a
separation sign and
not as an indication
for an alternative
version of the i.
Diacritics can be
used for emphasis
(érg
koud for very cold)
or for
disambiguation
between a number
of words that are
spelled the same
when context
doesn't indicate the
correct meaning
(één appel = one
apple, een appel =
an
apple; vóórkomen =
to
occur, voorkómen =
to prevent). Grave
and acute accents
are used on a very
small number of
words, mostly
loanwords. The ç
also appears in
some loanwords.[11]
• Faroese. Non-
Faroese accented
letters are not
added to the
Faroese alphabet.
These
include é, ö, ü, å an
d recently also
letters like š, ł,
and ć.
• Filipino has the
following composite
characters: á, à, â,
é, è, ê, í, ì, î, ó, ò, ô,
ú, ù, û. The actual
use of diacritics for
Filipino is, however,
uncommon, and is
meant only to
distinguish
between homonyms
with different
stresses and
meanings that
either occur near
each other in a text
or to aid the reader
in ascertaining its
otherwise
ambiguous
meaning. The letter
eñe is due to the
Spanish alphabet
and too, is
considered a
separate letter. The
diacritics appears
in Spanish loanwor
ds and names if
Spanish
orthography is
observed.
• Finnish. Carons
in š and ž appear
only in foreign
proper names
and loanwords, but
may be substituted
with sh or zh if and
only if it is
technically
impossible to
produce accented
letters in the
medium. Contrary
to
Estonian, š and ž ar
e not considered
distinct letters in
Finnish.
• French uses five
diacritics. The grave
(accent grave)
marks the
sound /ɛ/ when over
an e, as
in père ("father") or
is used to
distinguish words
that are otherwise
homographs such
as a/à ("has"/"to")
or ou/où ("or"/"wher
e").
The acute (accent
aigu) is only used in
"é", modifying the
"e" to make the
sound /e/, as
in étoile ("star").
The circumflex (acc
ent circonflexe)
generally denotes
that an S once
followed the vowel
in Old French or
Latin, as
in fête ("party"), the
Old French
being feste and the
Latin being festum.
Whether the
circumflex modifies
the vowel's
pronunciation
depends on the
dialect and the
vowel.
The cedilla (cédille)
indicates that a
normally hard "c"
(before the vowels
"a", "o", and "u") is
to be
pronounced /s/, as
in ça ("that"). The
diaeresis diacritic
(French: tréma)
indicates that two
adjacent vowels
that would normally
be pronounced as
one are to be
pronounced
separately, as
in Noël ("Christmas"
).
• Galician vowels can
bear an acute (á, é,
í, ó, ú) to indicate
stress or difference
between two
otherwise same
written words (é, 'is'
vs. e, 'and'), but the
diaeresis is only
used with ï and ü to
show two separate
vowel sounds in
pronunciation. Only
in foreign words
may Galician use
other diacritics such
as ç (common
during the Middle
Ages), ê, or à.
• German uses the
three umlauted
characters ä, ö and
ü. These diacritics
indicate vowel
changes. For
instance, the
word Ofen [ˈoːfən] "
oven" has the
plural Öfen [ˈøːfən].
The mark originated
as a superscript e;
a handwritten
blackletter e resem
bles two parallel
vertical lines, like a
diaeresis. Due to
this history, "ä", "ö"
and "ü" can be
written as "ae", "oe"
and "ue"
respectively, if the
umlaut letters are
not available.
• Hebrew has many
various diacritic
marks known
as niqqud that are
used above and
below script to
represent vowels.
These must be
distinguished
from cantillation,
which are keys to
pronunciation and
syntax.
• The International
Phonetic
Alphabet uses
diacritic symbols
and characters to
indicate phonetic
features or
secondary
articulations.
• Irish uses the acute
to indicate that a
vowel
is long: á, é, í, ó, ú.
It is known
as síneadh
fada "long sign" or
simply fada "long" in
Irish. In the
older Gaelic
type, overdots are
used to
indicate lenition of a
consonant: ḃ, ċ, ḋ, ḟ
, ġ, ṁ, ṗ, ṡ, ṫ.
• Italian mainly has
the acute and
the grave (à, è/é, ì,
ò/ó, ù), typically to
indicate a stressed
syllable that would
not be stressed
under the normal
rules of
pronunciation but
sometimes also to
distinguish between
words that are
otherwise spelled
the same way (e.g.
"e", and; "è", is).
Despite its rare use,
Italian orthography
allows the
circumflex (î) too, in
two cases: it can be
found in old literary
context (roughly up
to 19th century) to
signal
a syncope (fêro→fe
cero, they did), or in
modern Italian to
signal the
contraction of ″-ii″
due to the plural
ending -i whereas
the root ends with
another -i;
e.g., s. demonio, p.
demonii→demonî;
in this case the
circumflex also
signals that the
word intended is not
demoni, plural of
"demone" by
shifting the accent
(demònî, "devils";
dèmoni, "demons").
• Lithuanian uses
the acute, grave an
d tilde in
dictionaries to
indicate stress
types in the
language's pitch
accent system.
• Maltese also uses
the grave on its
vowels to indicate
stress at the end of
a word with two
syllables or more:–
lowercase letters: à,
è, ì, ò, ù ; capital
letters: À, È, Ì, Ò, Ù
• Māori makes use of
macrons to mark
long vowels.
• Occitan has the
following composite
characters: á, à, ç,
é, è, í, ï, ó, ò, ú, ü,
n·h, s·h. The acute
and the grave
indicate stress and
vowel height, the
cedilla marks the
result of a
historical palatalizati
on, the diaeresis
indicates either
a hiatus, or that the
letter u is
pronounced when
the graphemes gü,
qü are followed
by e or i, and
the interpunct (·)
distinguishes the
different values
of nh/n·h and sh/s·h
(i.e., that the letters
are supposed to be
pronounced
separately, not
combined into "ny"
and "sh").
• Portuguese has the
following composite
characters: à, á, â,
ã, ç, é, ê, í, ó, ô, õ,
ú. The acute and
the circumflex
indicate stress and
vowel height, the
grave indicates
crasis, the tilde
represents
nasalization, and
the cedilla marks
the result of a
historical lenition.
• Acutes are also
used in Slavic
language dictionarie
s and textbooks to
indicate lexical
stress, placed over
the vowel of the
stressed syllable.
This can also serve
to disambiguate
meaning (e.g., in
Russian писа́ть
(pisáť) means "to
write", but пи́сать
(písať) means "to
piss"), or "бо́льшая
часть" (the biggest
part) vs "больша́я
часть" (the big
part).
• Spanish uses the
acute and the
diaeresis. The
acute is used on a
vowel in a stressed
syllable in words
with irregular stress
patterns. It can also
be used to "break
up" a diphthong as
in tío (pronounced [ˈ
ti.o], rather
than [ˈtjo] as it
would be without
the accent).
Moreover, the acute
can be used to
distinguish words
that otherwise are
spelled alike, such
as si ("if")
and sí ("yes"), and
also to distinguish
interrogative and
exclamatory
pronouns from
homophones with a
different
grammatical
function, such
as donde/¿dónde? (
"where"/"where?")
or como/¿cómo? ("
as"/"how?"). The
acute may also be
used to avoid
typographical
ambiguity, as in 1 ó
2 ("1 or 2"; without
the acute this might
be interpreted as "1
0 2". The diaeresis
is used only
over u (ü) for it to
be
pronounced [w] in
the
combinations gue a
nd gui, where u is
normally silent, for
example ambigüed
ad. In poetry, the
diaeresis may be
used on i and u as
a way to force a
hiatus. As
foreshadowed
above, in
nasal ñ the tilde (sq
uiggle) is not
considered a
diacritic sign at all,
but a composite
part of a distinct
glyph, with its own
chapter in the
dictionary: a glyph
that denotes the
15th letter of the
Spanish alphabet.
• Swedish uses
the acute to show
non-standard
stress, for example
in kafé (café)
and resumé (résum
é). This
occasionally helps
resolve ambiguities,
such
as ide (hibernation)
versus idé (idea). In
these words, the
acute is not
optional. Some
proper names use
non-standard
diacritics, such
as Carolina
Klüft and Staël von
Holstein. For
foreign loanwords
the original accents
are strongly
recommended,
unless the word has
been infused into
the language, in
which case they are
optional.
Hence crème
fraîche but ampere.
Swedish also has
the letters å, ä,
and ö, but these are
considered distinct
letters,
not a and o with
diacritics.
• Tamil does not
have any diacritics
in itself, but uses
the Arabic
numerals 2, 3 and 4
as diacritics to
represent aspirated,
voiced, and voiced-
aspirated
consonants when
Tamil script is used
to write long
passages
in Sanskrit.
• Thai has its own
system of
diacritics derived
from Indian
numerals, which
denote
different tones.
• Vietnamese uses
the acute (dấu sắc),
the grave (dấu
huyền), the tilde
(dấu ngã), the
underdot (dấu
nặng) and the hook
above (dấu hỏi) on
vowels
as tone indicators.
• Welsh uses the
circumflex,
diaeresis, acute,
and grave on its
seven vowels a, e,
i, o, u, w, y. The
most common is the
circumflex (which it
calls to bach,
meaning "little roof",
or acen
grom "crooked
accent",
or hirnod "long
sign") to denote a
long vowel, usually
to disambiguate it
from a similar word
with a short vowel.
The rarer grave
accent has the
opposite effect,
shortening vowel
sounds that would
usually be
pronounced long.
The acute accent
and diaeresis are
also occasionally
used, to denote
stress and vowel
separation
respectively. The w-
circumflex and
the y-circumflex are
among the most
commonly accented
characters in
Welsh, but unusual
in languages
generally, and were
until recently very
hard to obtain in
word-processed
and HTML
documents.
Transliteration[edit]
Several languages that are
not written with the Roman
alphabet are transliterated, or
romanized, using diacritics.
Examples:
• Arabic has
several romanisatio
ns, depending on
the type of the
application, region,
intended audience,
country, etc. many
of them extensively
use diacritics, e.g.,
some methods use
an underdot for
rendering emphatic
consonants (ṣ, ṭ, ḍ,
ẓ, ḥ). The macron is
often used to render
long vowels. š is
often used for /ʃ/, ġ
for /ɣ/.
• Chinese has
several romanizatio
ns that use the
umlaut, but only
on u (ü). In Hanyu
Pinyin, the
four tones of Manda
rin Chinese are
denoted by the
macron (first tone),
acute (second
tone), caron (third
tone) and grave
(fourth tone)
diacritics.
Example: ā, á, ǎ, à.
• Romanized Japane
se (Rōmaji)
occasionally uses
macrons to mark
long vowels.
The Hepburn
romanization syste
m uses macrons to
mark long vowels,
and the Kunrei-
shiki and Nihon-
shiki systems use
a circumflex.
• Sanskrit, as well as
many of its
descendants,
like Hindi and Beng
ali, uses a
lossless romanizati
on system, IAST.
This includes
several letters with
diacritical markings,
such as the macron
(ā, ī, ū), over- and
underdots (ṛ, ḥ, ṃ,
ṇ, ṣ, ṭ, ḍ) as well as
a few others (ś, ñ).
Limits[edit]
Orthographic[edit]
ཧྐྵྨླྺྼྻྂ
Possibly the greatest
number of combining
diacritics required to
compose a valid
character in any
Unicode language is
8, for the "well-known
grapheme cluster in
Tibetan and Ranjana
scripts"
or HAKṢHMALAWAR
AYAṀ.[12]
It is U+0F67 ཧ TIBETAN
LETTER HA, combined
with TIBETAN LETTER HA +
TIBETAN SUBJOINED
LETTER KA + TIBETAN
SUBJOINED LETTER SSA +
TIBETAN SUBJOINED
LETTER MA + TIBETAN
SUBJOINED LETTER LA +
TIBETAN SUBJOINED
LETTER FIXED-FORM WA +
TIBETAN SUBJOINED
LETTER FIXED-FORM RA +
TIBETAN SUBJOINED
LETTER FIXED-FORM YA +
TIBETAN SIGN NYI ZLA NAA
DA (U+0F67 with
U+0F90 U+0FB5
U+0FA8 U+0FB3
̑ ́ ̓̇ ̀
̓
U+0FBA U+0FBC
̆ ̌
U+0FBB U+0F82).
́
Unorthographic/ornam ́ ̏ ̏̈
̓ ̂ [edit]
̓ ̃
̋̏ ͞ ̇ ̒̋ ̃ ͠͡
ental
c̄ ̳̻͚̻̩̻͉̯̄̏͑̋͆̎͐ͬ͑͌́͢ h̵ǎ̡ō̳̻͚̻̩̻͉̯̏͑̋͆̎͐ͬ͑͌́͢͜ s̵̵
̩ ̯ users
Some̯ ͅ ̙̱ have
explored the limits
̯
of ̙ rendering in web
browsers and other
software by
"decorating" words
with multiple
nonsensical
diacritics per
character. The
result is called
"Zalgo text". The
composed bogus
characters and
words can be
copied and pasted
normally via the
system clipboard.
List of diacritics in
Unicode [edit]
Diacritics for Latin script in
Unicode:
hide
• v
• t
• e
Diacritics in Unicode for Latin script
Gener
Character Name
Chara Unicode code al Scri
Mark
cter point Categ pt
ory
• COMBINING Mn:
◌̀ GRAVE
ACCENT
Grave
Mark,
nonspaci
Inherit
ed
• U+0300 ng
• COMBINING Mn:
◌́ ACUTE
ACCENT
Acute
Mark,
nonspaci
Inherit
ed
• U+0301 ng
• COMBINING
Mn:
CIRCUMF
◌̂ LEX
ACCENT
Circumfle
x
Mark,
nonspaci
Inherit
ed
ng
• U+0302
Mn:
• COMBINING
◌̃ TILDE
• U+0303
Tilde
Mark,
nonspaci
Inherit
ed
ng
• COMBINING
◌̄ MACRON
• U+0304
Macron
Mn:
Mark,
Inherit
ed
hide
• v
• t
• e
Diacritics in Unicode for Latin script
Gener
Character Name
Chara Unicode code al Scri
Mark
cter point Categ pt
ory
nonspaci
ng
• COMBINING Mn:
◌̅ OVERLIN
E
Overline
Mark,
nonspaci
Inherit
ed
• U+0305 ng
Mn:
• COMBINING
◌̆ BREVE
• U+0306
Breve
Mark,
nonspaci
Inherit
ed
ng
• COMBINING Mn:
◌̇ DOT
ABOVE
Dot
Mark,
nonspaci
Inherit
ed
• U+0307 ng
• COMBINING Mn:
◌̈ DIAERESI
S
Diaeresis
Mark,
nonspaci
Inherit
ed
• U+0308 ng
• COMBINING Mn:
◌̉ HOOK
ABOVE
Hook
Mark,
nonspaci
Inherit
ed
• U+0309 ng
• COMBINING Mn:
◌̊ RING
ABOVE
Ring
Mark,
nonspaci
Inherit
ed
• U+030A ng
• COMBINING
Mn:
DOUBLE
◌̋ ACUTE
ACCENT
Double
acute
Mark,
nonspaci
Inherit
ed
ng
• U+030B
Mn:
• COMBINING
◌̌ CARON
• U+030C
Caron
Mark,
nonspaci
Inherit
ed
ng
• COMBINING Mn:
◌̍ VERTICAL Vertical
LINE line
Mark,
nonspaci
Inherit
ed
ABOVE ng
hide
• v
• t
• e
Diacritics in Unicode for Latin script
Gener
Character Name
Chara Unicode code al Scri
Mark
cter point Categ pt
ory
• U+030D
• COMBINING
DOUBLE Mn:
Double
◌̎ VERTICAL
LINE
vertical
line
Mark,
nonspaci
Inherit
ed
ABOVE ng
• U+030E
• COMBINING
Mn:
DOUBLE
◌̏ GRAVE
ACCENT
Double
grave
Mark,
nonspaci
Inherit
ed
ng
• U+030F
• COMBINING Mn:
◌̐ CANDRAB
INDU
Candrabin
du
Mark,
nonspaci
Inherit
ed
• U+0310 ng
• COMBINING Mn:
◌̑ INVERTE
D BREVE
Inverted
breve
Mark,
nonspaci
Inherit
ed
• U+0311 ng
• COMBINING
Mn:
TURNED
◌̒ COMMA
ABOVE
Turned
comma
Mark,
nonspaci
Inherit
ed
ng
• U+0312
• COMBINING Mn:
◌̓ COMMA
ABOVE
Comma
Mark,
nonspaci
Inherit
ed
• U+0313 ng
• COMBINING
Mn:
REVERSE
◌̔ D COMMA
ABOVE
Reversed
comma
Mark,
nonspaci
Inherit
ed
ng
• U+0314
• COMBINING
Mn:
COMMA
◌̕ ABOVE
RIGHT
Comma
right
Mark,
nonspaci
Inherit
ed
ng
• U+0315
hide
• v
• t
• e
Diacritics in Unicode for Latin script
Gener
Character Name
Chara Unicode code al Scri
Mark
cter point Categ pt
ory
• COMBINING
Mn:
GRAVE
◌̖ ACCENT
BELOW
Grave
Mark,
nonspaci
Inherit
ed
ng
• U+0316
• COMBINING
Mn:
ACUTE
◌̗ ACCENT
BELOW
Acute
Mark,
nonspaci
Inherit
ed
ng
• U+0317
• COMBINING
Mn:
LEFT
◌̘ TACK
BELOW
Left tack
Mark,
nonspaci
Inherit
ed
ng
• U+0318
• COMBINING
Mn:
RIGHT
◌̙ TACK
BELOW
Right tack
Mark,
nonspaci
Inherit
ed
ng
• U+0319
• COMBINING
Mn:
LEFT
◌̚ ANGLE
ABOVE
Left angle
Mark,
nonspaci
Inherit
ed
ng
• U+031A
Mn:
• COMBINING
◌̛ HORN
• U+031B
Horn
Mark,
nonspaci
Inherit
ed
ng
• COMBINING
LEFT Mn:
◌̜ HALF
RING
Left half
ring
Mark,
nonspaci
Inherit
ed
BELOW ng
• U+031C
• COMBINING Mn:
◌̝ UP TACK
BELOW
Up tack
Mark,
nonspaci
Inherit
ed
• U+031D ng
hide
• v
• t
• e
Diacritics in Unicode for Latin script
Gener
Character Name
Chara Unicode code al Scri
Mark
cter point Categ pt
ory
• COMBINING
Mn:
DOWN
◌̞ TACK
BELOW
Down tack
Mark,
nonspaci
Inherit
ed
ng
• U+031E
• COMBINING
Mn:
PLUS
◌̟ SIGN
BELOW
Plus sign
Mark,
nonspaci
Inherit
ed
ng
• U+031F
• COMBINING
Mn:
MINUS
◌̠ SIGN
BELOW
Minus
sign
Mark,
nonspaci
Inherit
ed
ng
• U+0320
• COMBINING
PALATALI Mn:
◌̡ ZED
HOOK
Palatalized
hook
Mark,
nonspaci
Inherit
ed
BELOW ng
• U+0321
• COMBINING
Mn:
RETROFL
◌̢ EX HOOK
BELOW
Retroflex
hook
Mark,
nonspaci
Inherit
ed
ng
• U+0322
• COMBINING Mn:
◌̣ DOT
BELOW
Dot
Mark,
nonspaci
Inherit
ed
• U+0323 ng
• COMBINING Mn:
◌̤ DIAERESI
S BELOW
Diaeresis
Mark,
nonspaci
Inherit
ed
• U+0324 ng
• COMBINING Mn:
◌̥ RING
BELOW
Ring
Mark,
nonspaci
Inherit
ed
• U+0325 ng
hide
• v
• t
• e
Diacritics in Unicode for Latin script
Gener
Character Name
Chara Unicode code al Scri
Mark
cter point Categ pt
ory
• COMBINING Mn:
◌̦ COMMA
BELOW
Comma
Mark,
nonspaci
Inherit
ed
• U+0326 ng
Mn:
• COMBINING
◌̧ CEDILLA
• U+0327
Cedilla
Mark,
nonspaci
Inherit
ed
ng
Mn:
• COMBINING
◌̨ OGONEK
• U+0328
Ogonek
Mark,
nonspaci
Inherit
ed
ng
• COMBINING
Mn:
VERTICAL
◌̩ LINE
BELOW
Vertical
line
Mark,
nonspaci
Inherit
ed
ng
• U+0329
• COMBINING Mn:
◌̪ BRIDGE
BELOW
Bridge
Mark,
nonspaci
Inherit
ed
• U+032A ng
• COMBINING
INVERTE
Mn:
D
◌̫ DOUBLE
ARCH
Double
arch
Mark,
nonspaci
Inherit
ed
BELOW ng
• U+032B
• COMBINING Mn:
◌̬ CARON
BELOW
Caron
Mark,
nonspaci
Inherit
ed
• U+032C ng
• COMBINING
CIRCUMF Mn:
◌̭ LEX
ACCENT
Circumfle
x
Mark,
nonspaci
Inherit
ed
BELOW ng
• U+032D
hide
• v
• t
• e
Diacritics in Unicode for Latin script
Gener
Character Name
Chara Unicode code al Scri
Mark
cter point Categ pt
ory
• COMBINING Mn:
◌̮ BREVE
BELOW
Breve
Mark,
nonspaci
Inherit
ed
• U+032E ng
• COMBINING
Mn:
INVERTE
◌̯ D BREVE
BELOW
Inverted
breve
Mark,
nonspaci
Inherit
ed
ng
• U+032F
• COMBINING Mn:
◌̰ TILDE
BELOW
Tilde
Mark,
nonspaci
Inherit
ed
• U+0330 ng
• COMBINING Mn:
◌̱ MACRON
BELOW
Macron
Mark,
nonspaci
Inherit
ed
• U+0331 ng
Mn:
• COMBINING
◌̲ LOW LINE
• U+0332
Low line
Mark,
nonspaci
Inherit
ed
ng
• COMBINING Mn:
◌̳ DOUBLE
LOW LINE
Double
low line
Mark,
nonspaci
Inherit
ed
• U+0333 ng
• COMBINING Mn:
◌̴ TILDE
OVERLAY
Tilde
overlay
Mark,
nonspaci
Inherit
ed
• U+0334 ng
• COMBINING
Mn:
SHORT Short
◌̵ STROKE
OVERLAY
stroke
overlay
Mark,
nonspaci
Inherit
ed
ng
• U+0335
• COMBINING
Mn:
LONG Long
◌̶ STROKE
OVERLAY
stroke
overlay
Mark,
nonspaci
Inherit
ed
ng
• U+0336
hide
• v
• t
• e
Diacritics in Unicode for Latin script
Gener
Character Name
Chara Unicode code al Scri
Mark
cter point Categ pt
ory
• COMBINING
Mn:
SHORT Short
◌̷ SOLIDUS
OVERLAY
solidus
overlay
Mark,
nonspaci
Inherit
ed
ng
• U+0337
• COMBINING
Mn:
LONG Long
◌̸ SOLIDUS
OVERLAY
solidus
overlay
Mark,
nonspaci
Inherit
ed
ng
• U+0338
• COMBINING
RIGHT Mn:
◌̹ HALF
RING
Right half
ring
Mark,
nonspaci
Inherit
ed
BELOW ng
• U+0339
• COMBINING
Mn:
INVERTE
◌̺ D BRIDGE
BELOW
Inverted
bridge
Mark,
nonspaci
Inherit
ed
ng
• U+033A
• COMBINING Mn:
◌̻ SQUARE
BELOW
Square
Mark,
nonspaci
Inherit
ed
• U+033B ng
• COMBINING Mn:
◌̼ SEAGULL
BELOW
Seagull
Mark,
nonspaci
Inherit
ed
• U+033C ng
Mn:
• COMBINING
◌̽ X ABOVE
• U+033D
X
Mark,
nonspaci
Inherit
ed
ng
• COMBINING Mn:
◌̾ VERTICAL Vertical
TILDE tilde
Mark,
nonspaci
Inherit
ed
• U+033E ng
◌̿ • COMBINING
DOUBLE
Double
overline
Mn:
Mark,
Inherit
ed
hide
• v
• t
• e
Diacritics in Unicode for Latin script
Gener
Character Name
Chara Unicode code al Scri
Mark
cter point Categ pt
ory
OVERLIN nonspaci
E ng
• U+033F
• COMBINING
Mn:
GRAVE
◌̀ TONE
MARK
Grave
tone
Mark,
nonspaci
Inherit
ed
ng
• U+0340
• COMBINING
Mn:
ACUTE
◌́ TONE
MARK
Acute tone
Mark,
nonspaci
Inherit
ed
ng
• U+0341
• COMBINING Mn:
◌͆ BRIDGE
ABOVE
Bridge
Mark,
nonspaci
Inherit
ed
• U+0346 ng
• COMBINING
Mn:
EQUALS
◌͇ SIGN
BELOW
Equals
sign
Mark,
nonspaci
Inherit
ed
ng
• U+0347
• COMBINING
DOUBLE Mn:
Double
◌͈ VERTICAL
LINE
vertical
line
Mark,
nonspaci
Inherit
ed
BELOW ng
• U+0348
• COMBINING
Mn:
LEFT
◌͉ ANGLE
BELOW
Left angle
Mark,
nonspaci
Inherit
ed
ng
• U+0349
• COMBINING
Mn:
NOT
◌͊ TILDE
ABOVE
Not tilde
Mark,
nonspaci
Inherit
ed
ng
• U+034A
hide
• v
• t
• e
Diacritics in Unicode for Latin script
Gener
Character Name
Chara Unicode code al Scri
Mark
cter point Categ pt
ory
• COMBINING
Mn:
HOMOTH
◌͋ ETIC
ABOVE
Homotheti
c
Mark,
nonspaci
Inherit
ed
ng
• U+034B
• COMBINING
ALMOST Mn:
◌͌ EQUAL
TO
Almost
equal to
Mark,
nonspaci
Inherit
ed
ABOVE ng
• U+034C
• COMBINING
LEFT Mn:
◌͍ RIGHT
ARROW
Left right
arrow
Mark,
nonspaci
Inherit
ed
BELOW ng
• U+034D
• COMBINING
Mn:
UPWARDS
◌͎ ARROW
BELOW
Upwards
arrow
Mark,
nonspaci
Inherit
ed
ng
• U+034E
• COMBINING
RIGHT Mn:
◌͐ ARROWH
EAD
Right
arrowhead
Mark,
nonspaci
Inherit
ed
ABOVE ng
• U+0350
• COMBINING
LEFT Mn:
◌͑ HALF
RING
Left half
ring
Mark,
nonspaci
Inherit
ed
ABOVE ng
• U+0351
Mn:
• COMBINING
◌͒ FERMATA
• U+0352
Fermata
Mark,
nonspaci
Inherit
ed
ng
◌͓ • COMBINING
X BELOW
X
Mn:
Mark,
Inherit
ed
hide
• v
• t
• e
Diacritics in Unicode for Latin script
Gener
Character Name
Chara Unicode code al Scri
Mark
cter point Categ pt
ory
• U+0353 nonspaci
ng
• COMBINING
LEFT Mn:
◌͔ ARROWH
EAD
Left
arrowhead
Mark,
nonspaci
Inherit
ed
BELOW ng
• U+0354
• COMBINING
RIGHT Mn:
◌͕ ARROWH
EAD
Right
arrowhead
Mark,
nonspaci
Inherit
ed
BELOW ng
• U+0355
• COMBINING
RIGHT
ARROWH
Right Mn:
EAD AND
◌͖ UP
ARROWH
arrowhead
and up
Mark,
nonspaci
Inherit
ed
arrowhead ng
EAD
BELOW
• U+0356
• COMBINING
RIGHT Mn:
◌͗ HALF
RING
Right half
ring
Mark,
nonspaci
Inherit
ed
ABOVE ng
• U+0357
• COMBINING
Mn:
DOT
◌͘ ABOVE
RIGHT
Dot right
Mark,
nonspaci
Inherit
ed
ng
• U+0358
• COMBINING Mn:
◌͙ ASTERISK
BELOW
Asterisk
Mark,
nonspaci
Inherit
ed
• U+0359 ng
◌͚ • COMBINING
DOUBLE
Double
ring
Mn:
Mark,
Inherit
ed
hide
• v
• t
• e
Diacritics in Unicode for Latin script
Gener
Character Name
Chara Unicode code al Scri
Mark
cter point Categ pt
ory
RING nonspaci
BELOW ng
• U+035A
• COMBINING Mn:
◌͛ ZIGZAG
ABOVE
Zigzag
Mark,
nonspaci
Inherit
ed
• U+035B ng
• COMBINING
Mn:
DOUBLE
◌͜◌ BREVE
BELOW
Double
breve
Mark,
nonspaci
Inherit
ed
ng
• U+035C
• COMBINING Mn:
◌͝◌ DOUBLE
BREVE
Double
breve
Mark,
nonspaci
Inherit
ed
• U+035D ng
• COMBINING Mn:
◌͞◌ DOUBLE
MACRON
Double
macron
Mark,
nonspaci
Inherit
ed
• U+035E ng
• COMBINING
Mn:
DOUBLE
◌͟◌ MACRON
BELOW
Double
macron
Mark,
nonspaci
Inherit
ed
ng
• U+035F
• COMBINING Mn:
◌͠◌ DOUBLE
TILDE
Double
tilde
Mark,
nonspaci
Inherit
ed
• U+0360 ng
• COMBINING
Mn:
◌͡◌
DOUBLE Double
Mark, Inherit
INVERTE inverted
nonspaci ed
D BREVE breve
ng
• U+0361
• COMBINING Double
Mn:
◌͢◌ DOUBLE
RIGHTWA
rightwards
arrow
Mark,
nonspaci
Inherit
ed
RDS ng
hide
• v
• t
• e
Diacritics in Unicode for Latin script
Gener
Character Name
Chara Unicode code al Scri
Mark
cter point Categ pt
ory
ARROW
BELOW
• U+0362
• COMBINING
Mn:
◌ͣ
LATIN Latin
Mark, Inherit
SMALL small
nonspaci ed
LETTER A letter a
ng
• U+0363
• COMBINING
Mn:
◌ͤ
LATIN Latin
Mark, Inherit
SMALL small
nonspaci ed
LETTER E letter e
ng
• U+0364
• COMBINING
Mn:
◌ͥ
LATIN Latin
Mark, Inherit
SMALL small
nonspaci ed
LETTER I letter i
ng
• U+0365
• COMBINING
Mn:
◌ͦ
LATIN Latin
Mark, Inherit
SMALL small
nonspaci ed
LETTER O letter o
ng
• U+0366
• COMBINING
Mn:
◌ͧ
LATIN Latin
Mark, Inherit
SMALL small
nonspaci ed
LETTER U letter u
ng
• U+0367
• COMBINING
Mn:
◌ͨ
LATIN Latin
Mark, Inherit
SMALL small
nonspaci ed
LETTER C letter c
ng
• U+0368
• COMBINING
Mn:
◌ͩ
LATIN Latin
Mark, Inherit
SMALL small
nonspaci ed
LETTER D letter d
ng
• U+0369
hide
• v
• t
• e
Diacritics in Unicode for Latin script
Gener
Character Name
Chara Unicode code al Scri
Mark
cter point Categ pt
ory
• COMBINING
Mn:
◌ͪ
LATIN Latin
Mark, Inherit
SMALL small
nonspaci ed
LETTER H letter h
ng
• U+036A
• COMBINING
Mn:
◌ͫ
LATIN Latin
Mark, Inherit
SMALL small
nonspaci ed
LETTER M letter m
ng
• U+036B
• COMBINING
Mn:
◌ͬ
LATIN Latin
Mark, Inherit
SMALL small
nonspaci ed
LETTER R letter r
ng
• U+036C
• COMBINING
Mn:
◌ͭ
LATIN Latin
Mark, Inherit
SMALL small
nonspaci ed
LETTER T letter t
ng
• U+036D
• COMBINING
Mn:
◌ͮ
LATIN Latin
Mark, Inherit
SMALL small
nonspaci ed
LETTER V letter v
ng
• U+036E
• COMBINING
Mn:
◌ͯ
LATIN Latin
Mark, Inherit
SMALL small
nonspaci ed
LETTER X letter x
ng
• U+036F
• COMBINING
DOUBLED Mn:
◌᪰ CIRCUMF
LEX
Doubled
circumflex
Mark,
nonspaci
Inherit
ed
ACCENT ng
• U+1AB0
• COMBINING Mn:
◌᪰ DIAERESI
S-RING
Diaeresis-
ring
Mark,
nonspaci
Inherit
ed
• U+1AB1 ng
hide
• v
• t
• e
Diacritics in Unicode for Latin script
Gener
Character Name
Chara Unicode code al Scri
Mark
cter point Categ pt
ory
Mn:
• COMBINING
◌᪰ INFINITY
• U+1AB2
Infinity
Mark,
nonspaci
Inherit
ed
ng
• COMBINING
Mn:
DOWNWA
◌᪰ RDS
ARROW
Downwar
ds arrow
Mark,
nonspaci
Inherit
ed
ng
• U+1AB3
• COMBINING Mn:
◌᪰ TRIPLE
DOT
Triple dot
Mark,
nonspaci
Inherit
ed
• U+1AB4 ng
• COMBINING Mn:
◌᪰ X-X
BELOW
X-x
Mark,
nonspaci
Inherit
ed
• U+1AB5 ng
• COMBINING
Mn:
WIGGLY
◌᪰ LINE
BELOW
Wiggly
line
Mark,
nonspaci
Inherit
ed
ng
• U+1AB6
• COMBINING
Mn:
OPEN
◌᪰ MARK
BELOW
Open
mark
Mark,
nonspaci
Inherit
ed
ng
• U+1AB7
• COMBINING
DOUBLE Mn:
◌᪰ OPEN
MARK
Double
open mark
Mark,
nonspaci
Inherit
ed
BELOW ng
• U+1AB8
• COMBINING
LIGHT
Mn:
CENTRALI Light
◌᪰ ZATION
STROKE
centralizat
ion stroke
Mark,
nonspaci
Inherit
ed
BELOW ng
• U+1AB9
hide
• v
• t
• e
Diacritics in Unicode for Latin script
Gener
Character Name
Chara Unicode code al Scri
Mark
cter point Categ pt
ory
• COMBINING
STRONG
Mn:
CENTRALI Strong
◌᪰ ZATION
STROKE
centralizat
ion stroke
Mark,
nonspaci
Inherit
ed
BELOW ng
• U+1ABA
• COMBINING
Mn:
PARENTH
◌᪰ ESES
ABOVE
Parenthese
s
Mark,
nonspaci
Inherit
ed
ng
• U+1ABB
• COMBINING
DOUBLE Mn:
Double
◌᪰ PARENTH
ESES
parenthese
s
Mark,
nonspaci
Inherit
ed
ABOVE ng
• U+1ABC
• COMBINING
Mn:
PARENTH
◌᪰ ESES
BELOW
Parenthese
s
Mark,
nonspaci
Inherit
ed
ng
• U+1ABD
• COMBINING
LATIN Mn:
Latin
◌᪰ SMALL
LETTER W
small
letter w
Mark,
nonspaci
Inherit
ed
BELOW ng
• U+1ABF
• COMBINING
LATIN
Latin Mn:
SMALL
◌᪰ LETTER
TURNED
small
letter
Mark,
nonspaci
Inherit
ed
turned w ng
W BELOW
• U+1AC0
• COMBINING
Mn:
DOTTED
◌᷀ GRAVE
ACCENT
Dotted
grave
Mark,
nonspaci
Inherit
ed
ng
• U+1DC0
hide
• v
• t
• e
Diacritics in Unicode for Latin script
Gener
Character Name
Chara Unicode code al Scri
Mark
cter point Categ pt
ory
• COMBINING
Mn:
DOTTED
◌᷁ ACUTE
ACCENT
Dotted
acute
Mark,
nonspaci
Inherit
ed
ng
• U+1DC1
• COMBINING Mn:
◌᷂ SNAKE
BELOW
Snake
Mark,
nonspaci
Inherit
ed
• U+1DC2 ng
• COMBINING Mn:
◌᷃ SUSPENSI
ON MARK
Suspensio
n mark
Mark,
nonspaci
Inherit
ed
• U+1DC3 ng
• COMBINING Mn:
◌᷄ MACRON-
ACUTE
Macron-
acute
Mark,
nonspaci
Inherit
ed
• U+1DC4 ng
• COMBINING Mn:
◌᷅ GRAVE-
MACRON
Grave-
macron
Mark,
nonspaci
Inherit
ed
• U+1DC5 ng
• COMBINING Mn:
◌᷆ MACRON-
GRAVE
Macron-
grave
Mark,
nonspaci
Inherit
ed
• U+1DC6 ng
• COMBINING Mn:
◌᷇ ACUTE-
MACRON
Acute-
macron
Mark,
nonspaci
Inherit
ed
• U+1DC7 ng
• COMBINING
Mn:
GRAVE- Grave-
◌᷈ ACUTE-
GRAVE
acute-
grave
Mark,
nonspaci
Inherit
ed
ng
• U+1DC8
• COMBINING
Mn:
ACUTE- Acute-
◌᷉ GRAVE-
ACUTE
grave-
acute
Mark,
nonspaci
Inherit
ed
ng
• U+1DC9
hide
• v
• t
• e
Diacritics in Unicode for Latin script
Gener
Character Name
Chara Unicode code al Scri
Mark
cter point Categ pt
ory
• COMBINING
LATIN Mn:
Latin
◌᷊ SMALL
LETTER R
small
letter r
Mark,
nonspaci
Inherit
ed
BELOW ng
• U+1DCA
• COMBINING Mn:
◌᪰ BREVE-
MACRON
Breve-
macron
Mark,
nonspaci
Inherit
ed
• U+1DCB ng
• COMBINING Mn:
◌᪰ MACRON-
BREVE
Macron-
breve
Mark,
nonspaci
Inherit
ed
• U+1DCC ng
• COMBINING
DOUBLE Mn:
◌᪰◌ CIRCUMF
LEX
Double
circumflex
Mark,
nonspaci
Inherit
ed
ABOVE ng
• U+1DCD
• COMBINING Mn:
◌᪰ OGONEK
ABOVE
Ogonek
Mark,
nonspaci
Inherit
ed
• U+1DCE ng
• COMBINING Mn:
◌᪰ ZIGZAG
BELOW
Zigzag
Mark,
nonspaci
Inherit
ed
• U+1DCF ng
Mn:
• COMBINING
◌᪰ IS BELOW
• U+1DD0
Is
Mark,
nonspaci
Inherit
ed
ng
• COMBINING Mn:
◌᪰ UR
ABOVE
Ur
Mark,
nonspaci
Inherit
ed
• U+1DD1 ng
• COMBINING
◌᪰ US
ABOVE
Us
Mn:
Mark,
Inherit
ed
hide
• v
• t
• e
Diacritics in Unicode for Latin script
Gener
Character Name
Chara Unicode code al Scri
Mark
cter point Categ pt
ory
• U+1DD2 nonspaci
ng
• COMBINING
LATIN
Latin
SMALL Mn:
small
◌᪰ LETTER
FLATTEN
letter
flattened
Mark,
nonspaci
Inherit
ed
ED OPEN ng
open a
A ABOVE
• U+1DD3
• COMBINING
LATIN Mn:
Latin
◌᪰ SMALL
LETTER
small
letter ae
Mark,
nonspaci
Inherit
ed
AE ng
• U+1DD4
• COMBINING
LATIN Mn:
Latin
◌᪰ SMALL
LETTER
small
letter ao
Mark,
nonspaci
Inherit
ed
AO ng
• U+1DD5
• COMBINING
LATIN Mn:
Latin
◌᪰ SMALL
LETTER
small
letter av
Mark,
nonspaci
Inherit
ed
AV ng
• U+1DD6
• COMBINING
LATIN Latin Mn:
◌᪰ SMALL
LETTER C
small
letter c
Mark,
nonspaci
Inherit
ed
CEDILLA cedilla ng
• U+1DD7
• COMBINING
LATIN
Latin Mn:
SMALL
◌᪰ LETTER
INSULAR
small
letter
Mark,
nonspaci
Inherit
ed
insular d ng
D
• U+1DD8
hide
• v
• t
• e
Diacritics in Unicode for Latin script
Gener
Character Name
Chara Unicode code al Scri
Mark
cter point Categ pt
ory
• COMBINING
LATIN Mn:
Latin
◌᪰ SMALL
LETTER
small
letter eth
Mark,
nonspaci
Inherit
ed
ETH ng
• U+1DD9
• COMBINING
Mn:
LATIN Latin
◌᪰ SMALL
LETTER G
small
letter g
Mark,
nonspaci
Inherit
ed
ng
• U+1DDA
• COMBINING
LATIN
Latin Mn:
LETTER
◌᪰ SMALL
CAPITAL
letter
small
Mark,
nonspaci
Inherit
ed
capital g ng
G
• U+1DDB
• COMBINING
Mn:
LATIN Latin
◌᪰ SMALL
LETTER K
small
letter k
Mark,
nonspaci
Inherit
ed
ng
• U+1DDC
• COMBINING
Mn:
LATIN Latin
◌᪰ SMALL
LETTER L
small
letter l
Mark,
nonspaci
Inherit
ed
ng
• U+1DDD
• COMBINING
LATIN
Latin Mn:
LETTER
◌᪰ SMALL
CAPITAL
letter
small
Mark,
nonspaci
Inherit
ed
capital l ng
L
• U+1DDE
• COMBINING
LATIN Latin Mn:
◌᪰ LETTER
SMALL
letter
small
Mark,
nonspaci
Inherit
ed
CAPITAL capital m ng
M
hide
• v
• t
• e
Diacritics in Unicode for Latin script
Gener
Character Name
Chara Unicode code al Scri
Mark
cter point Categ pt
ory
• U+1DDF
• COMBINING
Mn:
LATIN Latin
◌᪰ SMALL
LETTER N
small
letter n
Mark,
nonspaci
Inherit
ed
ng
• U+1DE0
• COMBINING
LATIN
Latin Mn:
LETTER
◌᪰ SMALL
CAPITAL
letter
small
Mark,
nonspaci
Inherit
ed
capital n ng
N
• U+1DE1
• COMBINING
LATIN
Latin Mn:
LETTER
◌᪰ SMALL
CAPITAL
letter
small
Mark,
nonspaci
Inherit
ed
capital r ng
R
• U+1DE2
• COMBINING
LATIN Latin Mn:
◌᪰ SMALL
LETTER R
small
letter r
Mark,
nonspaci
Inherit
ed
ROTUNDA rotunda ng
• U+1DE3
• COMBINING
Mn:
LATIN Latin
◌᪰ SMALL
LETTER S
small
letter s
Mark,
nonspaci
Inherit
ed
ng
• U+1DE4
• COMBINING
LATIN Latin Mn:
◌᪰ SMALL
LETTER
small
letter long
Mark,
nonspaci
Inherit
ed
LONG S s ng
• U+1DE5
Latin
◌᪰ • COMBINING
LATIN
small
letter z
Mn:
Mark,
Inherit
ed
hide
• v
• t
• e
Diacritics in Unicode for Latin script
Gener
Character Name
Chara Unicode code al Scri
Mark
cter point Categ pt
ory
SMALL nonspaci
LETTER Z ng
• U+1DE6
• COMBINING
LATIN Latin Mn:
◌᪰ SMALL
LETTER
small
letter
Mark,
nonspaci
Inherit
ed
ALPHA alpha ng
• U+1DE7
• COMBINING
Mn:
LATIN Latin
◌᪰ SMALL
LETTER B
small
letter b
Mark,
nonspaci
Inherit
ed
ng
• U+1DE8
• COMBINING
LATIN Mn:
Latin
◌᪰ SMALL
LETTER
small
letter beta
Mark,
nonspaci
Inherit
ed
BETA ng
• U+1DE9
• COMBINING
LATIN Latin Mn:
◌᪰ SMALL
LETTER
small
letter
Mark,
nonspaci
Inherit
ed
SCHWA schwa ng
• U+1DEA
• COMBINING
Mn:
LATIN Latin
◌᪰ SMALL
LETTER F
small
letter f
Mark,
nonspaci
Inherit
ed
ng
• U+1DEB
• COMBINING
LATIN Latin
SMALL small
Mn:
LETTER L letter l
◌᪰ WITH
DOUBLE
with
double
Mark,
nonspaci
Inherit
ed
MIDDLE ng
middle
TILDE tilde
• U+1DEC
hide
• v
• t
• e
Diacritics in Unicode for Latin script
Gener
Character Name
Chara Unicode code al Scri
Mark
cter point Categ pt
ory
• COMBINING
LATIN
SMALL Latin
LETTER O small Mn:
◌᪰ WITH
LIGHT
letter o
with light
Mark,
nonspaci
Inherit
ed
CENTRALI centralizat ng
ZATION ion stroke
STROKE
• U+1DED
• COMBINING
Mn:
LATIN Latin
◌᪰ SMALL
LETTER P
small
letter p
Mark,
nonspaci
Inherit
ed
ng
• U+1DEE
• COMBINING
LATIN Mn:
Latin
◌᪰ SMALL
LETTER
small
letter esh
Mark,
nonspaci
Inherit
ed
ESH ng
• U+1DEF
• COMBINING
LATIN
SMALL Latin
LETTER U small Mn:
◌᪰ WITH
LIGHT
letter u
with light
Mark,
nonspaci
Inherit
ed
CENTRALI centralizat ng
ZATION ion stroke
STROKE
• U+1DF0
• COMBINING
Mn:
LATIN Latin
◌᪰ SMALL
LETTER W
small
letter w
Mark,
nonspaci
Inherit
ed
ng
• U+1DF1
• COMBINING Latin
Mn:
LATIN small
◌᪰ SMALL
LETTER A
letter a
with
Mark,
nonspaci
Inherit
ed
ng
WITH diaeresis
hide
• v
• t
• e
Diacritics in Unicode for Latin script
Gener
Character Name
Chara Unicode code al Scri
Mark
cter point Categ pt
ory
DIAERESI
S
• U+1DF2
• COMBINING
LATIN
Latin
SMALL Mn:
small
◌᪰ LETTER O
WITH
letter o
with
Mark,
nonspaci
Inherit
ed
DIAERESI ng
diaeresis
S
• U+1DF3
• COMBINING
LATIN
Latin
SMALL Mn:
small
◌᪰ LETTER U
WITH
letter u
with
Mark,
nonspaci
Inherit
ed
DIAERESI ng
diaeresis
S
• U+1DF4
• COMBINING Mn:
◌᪰ UP TACK
ABOVE
Up tack
Mark,
nonspaci
Inherit
ed
• U+1DF5 ng
• COMBINING
Mn:
DOT
◌᪰ ABOVE
LEFT
Dot left
Mark,
nonspaci
Inherit
ed
ng
• U+1DF8
• COMBINING
WIDE Mn:
Wide
◌᪰ INVERTE
D BRIDGE
inverted
bridge
Mark,
nonspaci
Inherit
ed
BELOW ng
• U+1DF9
• COMBINING Mn:
◌᪰ DELETION Deletion
MARK mark
Mark,
nonspaci
Inherit
ed
• U+1DFB ng
hide
• v
• t
• e
Diacritics in Unicode for Latin script
Gener
Character Name
Chara Unicode code al Scri
Mark
cter point Categ pt
ory
• COMBINING
DOUBLE Mn:
Double
◌᪰◌ INVERTE
D BREVE
inverted
breve
Mark,
nonspaci
Inherit
ed
BELOW ng
• U+1DFC
• COMBINING
ALMOST Mn:
◌᪰ EQUAL
TO
Almost
equal to
Mark,
nonspaci
Inherit
ed
BELOW ng
• U+1DFD
• COMBINING
LEFT Mn:
◌᷾ ARROWH
EAD
Left
arrowhead
Mark,
nonspaci
Inherit
ed
ABOVE ng
• U+1DFE
• COMBINING
RIGHT
ARROWH
Right Mn:
EAD AND
◌᷿ DOWN
ARROWH
arrowhead
and down
Mark,
nonspaci
Inherit
ed
arrowhead ng
EAD
BELOW
• U+1DFF
• COMBINING
Mn:
LEFT
◌⃐◌ HARPOON
ABOVE
Left
harpoon
Mark,
nonspaci
Inherit
ed
ng
• U+20D0
• COMBINING
Mn:
RIGHT
◌⃑◌ HARPOON
ABOVE
Right
harpoon
Mark,
nonspaci
Inherit
ed
ng
• U+20D1
Long Mn:
• COMBINING vertical
◌⃒ LONG
VERTICAL
line
Mark,
nonspaci
Inherit
ed
overlay ng
hide
• v
• t
• e
Diacritics in Unicode for Latin script
Gener
Character Name
Chara Unicode code al Scri
Mark
cter point Categ pt
ory
LINE
OVERLAY
• U+20D2
• COMBINING
SHORT Short Mn:
◌⃓ VERTICAL
LINE
vertical
line
Mark,
nonspaci
Inherit
ed
OVERLAY overlay ng
• U+20D3
• COMBINING
ANTICLO Mn:
Anticlock
◌⃔◌ CKWISE
ARROW
wise
arrow
Mark,
nonspaci
Inherit
ed
ABOVE ng
• U+20D4
• COMBINING
CLOCKWI Mn:
◌⃕◌ SE
ARROW
Clockwise
arrow
Mark,
nonspaci
Inherit
ed
ABOVE ng
• U+20D5
• COMBINING
Mn:
LEFT
◌⃖◌ ARROW
ABOVE
Left arrow
Mark,
nonspaci
Inherit
ed
ng
• U+20D6
• COMBINING
Mn:
RIGHT
◌⃗◌ ARROW
ABOVE
Right
arrow
Mark,
nonspaci
Inherit
ed
ng
• U+20D7
• COMBINING Mn:
◌⃘ RING
OVERLAY
Ring
overlay
Mark,
nonspaci
Inherit
ed
• U+20D8 ng
• COMBINING
Mn:
CLOCKWI Clockwise
◌⃙ SE RING
OVERLAY
ring
overlay
Mark,
nonspaci
Inherit
ed
ng
• U+20D9
hide
• v
• t
• e
Diacritics in Unicode for Latin script
Gener
Character Name
Chara Unicode code al Scri
Mark
cter point Categ pt
ory
• COMBINING
ANTICLO Mn:
Anticlock
◌⃚ CKWISE
RING
wise ring
overlay
Mark,
nonspaci
Inherit
ed
OVERLAY ng
• U+20DA
• COMBINING
Mn:
THREE
◌⃛◌ DOTS
ABOVE
Three dots
Mark,
nonspaci
Inherit
ed
ng
• U+20DB
• COMBINING
Mn:
FOUR
◌⃜◌ DOTS
ABOVE
Four dots
Mark,
nonspaci
Inherit
ed
ng
• U+20DC
• COMBINING
LEFT Mn:
◌⃡◌ RIGHT
ARROW
Left right
arrow
Mark,
nonspaci
Inherit
ed
ABOVE ng
• U+20E1
• COMBINING
Mn:
REVERSE Reverse
◌⃥ SOLIDUS
OVERLAY
solidus
overlay
Mark,
nonspaci
Inherit
ed
ng
• U+20E5
• COMBINING
DOUBLE Double Mn:
◌⃦ VERTICAL
STROKE
vertical
stroke
Mark,
nonspaci
Inherit
ed
OVERLAY overlay ng
• U+20E6
• COMBINING Mn:
◌᪰ ANNUITY
SYMBOL
Annuity
symbol
Mark,
nonspaci
Inherit
ed
• U+20E7 ng
• COMBINING
◌⃨ TRIPLE
Triple
underdot
Mn:
Mark,
Inherit
ed
hide
• v
• t
• e
Diacritics in Unicode for Latin script
Gener
Character Name
Chara Unicode code al Scri
Mark
cter point Categ pt
ory
UNDERDO nonspaci
T ng
• U+20E8
• COMBINING
Mn:
WIDE
◌⃩◌ BRIDGE
ABOVE
Wide
bridge
Mark,
nonspaci
Inherit
ed
ng
• U+20E9
• COMBINING
LEFTWAR Mn:
Leftwards
◌⃪ DS
ARROW
arrow
overlay
Mark,
nonspaci
Inherit
ed
OVERLAY ng
• U+20EA
• COMBINING
LONG Long Mn:
◌᪰ DOUBLE
SOLIDUS
double
solidus
Mark,
nonspaci
Inherit
ed
OVERLAY overlay ng
• U+20EB
• COMBINING
RIGHTWA
RDS Rightward
Mn:
HARPOON s harpoon
◌᪰ WITH
BARB
with barb
downward
Mark,
nonspaci
Inherit
ed
DOWNWA ng
s
RDS
• U+20EC
• COMBINING
LEFTWAR
DS Leftwards
Mn:
HARPOON harpoon
◌᪰ WITH
BARB
with barb
downward
Mark,
nonspaci
Inherit
ed
DOWNWA ng
s
RDS
• U+20ED
◌᪰ • COMBINING
LEFT
Left arrow
Mn:
Mark,
Inherit
ed
hide
• v
• t
• e
Diacritics in Unicode for Latin script
Gener
Character Name
Chara Unicode code al Scri
Mark
cter point Categ pt
ory
ARROW nonspaci
BELOW ng
• U+20EE
• COMBINING
Mn:
RIGHT
◌᪰ ARROW
BELOW
Right
arrow
Mark,
nonspaci
Inherit
ed
ng
• U+20EF
• COMBINING Mn:
◌⃰◌ ASTERISK
ABOVE
Asterisk
Mark,
nonspaci
Inherit
ed
• U+20F0 ng
• COMBINING
Mn:
◌︠
LIGATUR
Ligature Mark, Inherit
E LEFT
left half nonspaci ed
HALF
ng
• U+FE20
• COMBINING
Mn:
︡
◌
LIGATUR
E RIGHT
Ligature
right half
Mark,
nonspaci
Inherit
ed
HALF
ng
• U+FE21
• COMBINING
DOUBLE Mn:
◌︢
Double
TILDE Mark, Inherit
tilde left
LEFT nonspaci ed
half
HALF ng
• U+FE22
• COMBINING
DOUBLE Mn:
︣
◌ TILDE
RIGHT
Double
tilde right
Mark,
nonspaci
Inherit
ed
half
HALF ng
• U+FE23
• COMBINING
Mn:
MACRON
◌᪰ LEFT
HALF
Macron
left half
Mark,
nonspaci
Inherit
ed
ng
• U+FE24
hide
• v
• t
• e
Diacritics in Unicode for Latin script
Gener
Character Name
Chara Unicode code al Scri
Mark
cter point Categ pt
ory
• COMBINING
Mn:
MACRON
◌᪰ RIGHT
HALF
Macron
right half
Mark,
nonspaci
Inherit
ed
ng
• U+FE25
• COMBINING
Mn:
CONJOINI
◌᪰◌ NG
MACRON
Conjoinin
g macron
Mark,
nonspaci
Inherit
ed
ng
• U+FE26
• COMBINING
LIGATUR Mn:
◌᪰ E LEFT
HALF
Ligature
left half
Mark,
nonspaci
Inherit
ed
BELOW ng
• U+FE27
• COMBINING
LIGATUR Mn:
◌᪰ E RIGHT
HALF
Ligature
right half
Mark,
nonspaci
Inherit
ed
BELOW ng
• U+FE28
• COMBINING
TILDE Mn:
◌᪰ LEFT
HALF
Tilde left
half
Mark,
nonspaci
Inherit
ed
BELOW ng
• U+FE29
• COMBINING
TILDE Mn:
◌᪰ RIGHT
HALF
Tilde right
half
Mark,
nonspaci
Inherit
ed
BELOW ng
• U+FE2A
• COMBINING
MACRON Mn:
◌᪰ LEFT
HALF
Macron
left half
Mark,
nonspaci
Inherit
ed
BELOW ng
• U+FE2B
hide
• v
• t
• e
Diacritics in Unicode for Latin script
Gener
Character Name
Chara Unicode code al Scri
Mark
cter point Categ pt
ory
• COMBINING
MACRON Mn:
◌᪰ RIGHT
HALF
Macron
right half
Mark,
nonspaci
Inherit
ed
BELOW ng
• U+FE2C
• COMBINING
CONJOINI Mn:
◌᪰ NG
MACRON
Conjoinin
g macron
Mark,
nonspaci
Inherit
ed
BELOW ng
• U+FE2D
See also[edit]
• Latin-script
alphabets
• Alt code
• Category:Letters
with diacritics
• Collating sequence
• Combining
character
• Compose key
• English terms with
diacritical marks
• Heavy metal umlaut
• ISO/IEC 8859 8-bit
extended-Latin-
alphabet European
character
encodings
• Latin alphabet
• List of Latin letters
• List of
precomposed Latin
characters in
Unicode
• List of U.S. cities
with diacritics
• Romanization
• wikt:Appendix:Engli
sh words with
diacritics
Notes[edit]
1. ^ The New Yorker is
reported as being
unique in its
continuing usage of
them. [1]
References[edit]
1. ^ "The New Yorker's
odd mark — the
diaeresis". 16
December 2010.
Archived from the
original on 16
December
2010. Among the
many mysteries of
The New Yorker is
that funny little
umlaut over words
like coöperate and
reëlect. The New
Yorker seems to be
the only publication
on the planet that
uses it, and I always
found it a little
pretentious until I did
some research.
Turns out, it's not an
umlaut. It's a
diaeresis.
2. ^ Sweet, Henry
(1877). A Handbook
of Phonetics.
Cambridge:
Cambridge
University Press.
pp. 174–175. Even
letters with accents
and diacritics [...]
being only cast for a
few founts, act
practically as new
letters. [...] We may
consider the h in sh
and th simply as a
diacritic written for
convenience on a
line with the letter it
modifies.
3. ^ Oxford English
Dictionary
4. ^ Nestle,
Eberhard (1888). Syr
ische Grammatik mit
Litteratur,
Chrestomathie und
Glossar. Berlin: H.
Reuther's
Verlagsbuchhandlun
g. [translated to
English as Syriac
grammar with
bibliography,
chrestomathy and
glossary, by R. S.
Kennedy. London:
Williams & Norgate
1889].
5. ^ Coakley, J. F.
(2002). Robinson's
Paradigms and
Exercises in Syriac
Grammar (5th ed.).
Oxford University
Press. ISBN 978-0-
19-926129-1.
6. ^ Michaelis, Ioannis
Davidis
(1784). Grammatica
Syriaca.
7. ^ Gramática de la
Llingua
Asturiana (PDF) (3rd
ed.). Academia de
la Llingua Asturiana.
2001. section
1.2. ISBN 84-8168-
310-8. Archived
from the
original (PDF) on
2011-05-25.
Retrieved 2011-06-
07.
8. ^ http://www.juls.sav
ba.sk/ediela/psp200
0/psp.pdf page 12,
section I.2
9. ^ S.P. Brock, "An
Introduction to
Syriac Studies", in
J.H. Eaton (Ed.,),
Horizons in Semitic
Studies (1980)
10. ^ Norris, Mary (26
April 2012). "The
Curse of the
Diaeresis". The New
Yorker. Retrieved 18
April 2014.
11. ^ van Geloven,
Sander
(2012). Diakritische
tekens in het
Nederlands (in
Dutch). Utrecht:
Hellebaard. Archived
from the original on
2013-10-29.
12. ^ Steele, Shawn
(2010-01-25). "Most
combining
characters in a
Unicode
glyph/character/what
ever".
Microsoft. Archived f
rom the original on
2019-05-16.
Retrieved 2019-11-
25.
External links[edit]
• Context of Diacritics
| A research
project Archived 20
14-10-12 at
the Wayback
Machine
• Diacritics Project
• Unicode
• Orthographic
diacritics and
multilingual
computing, by J. C.
Wells
• Notes on the use of
the diacritics, by
Markus Lång
• Entering
International
Characters (in
Linux, KDE)
• Standard Character
Set for
Macintosh PDF at
Adobe.com
show
Diacritics
show
Typography
show
Latin script
Categories:
• Diacritics
• Punctuation
• Typography
Navigation menu
• Not logged in
• Talk
• Contributions
• Create account
• Log in
• Article
• Talk
• Read
• Edit
• View history
Search
Search Go
• Main page
• Contents
• Current events
• Random article
• About Wikipedia
• Contact us
• Donate
Contribute
• Help
• Learn to edit
• Community portal
• Recent changes
• Upload file
Tools
• What links here
• Related changes
• Special pages
• Permanent link
• Page information
• Cite this page
• Wikidata item
Print/export
• Download as PDF
• Printable version
In other projects
• Wikimedia Commons
Languages
• العربية
• Deutsch
• Español
• Français
• עברית
• Latina
• Tagalog
• Winaray
• 中文
63 more
Edit links
• This page was last edited on 28 October 2022, at 18:05 (UTC).
• Text is available under the Creative Commons Attribution-
ShareAlike License 3.0; additional terms may apply. By using
this site, you agree to the Terms of Use and Privacy Policy.
Wikipedia® is a registered trademark of the Wikimedia
Foundation, Inc., a non-profit organization.
• Privacy policy
• About Wikipedia
• Disclaimers
• Contact Wikipedia
• Mobile view
• Developers
• Statistics
• Cookie statement
Get citation