1 Internet Engineering Task Force (IETF) P. Faltstrom, Ed.
2 Request for Comments: 5892 Cisco
3 Category: Standards Track August 2010
4 ISSN: 2070-1721
5
6
7 The Unicode Code Points and
8 Internationalized Domain Names for Applications (IDNA)
9
10 Abstract
11
12 This document specifies rules for deciding whether a code point,
13 considered in isolation or in context, is a candidate for inclusion
14 in an Internationalized Domain Name (IDN).
15
16 It is part of the specification of Internationalizing Domain Names in
17 Applications 2008 (IDNA2008).
18
19 Status of This Memo
20
21 This is an Internet Standards Track document.
22
23 This document is a product of the Internet Engineering Task Force
24 (IETF). It represents the consensus of the IETF community. It has
25 received public review and has been approved for publication by the
26 Internet Engineering Steering Group (IESG). Further information on
27 Internet Standards is available in Section 2 of RFC 5741.
28
29 Information about the current status of this document, any errata,
30 and how to provide feedback on it may be obtained at
31 http://www.rfc-editor.org/info/rfc5892.
32
33 Copyright Notice
34
35 Copyright (c) 2010 IETF Trust and the persons identified as the
36 document authors. All rights reserved.
37
38 This document is subject to BCP 78 and the IETF Trust's Legal
39 Provisions Relating to IETF Documents
40 (http://trustee.ietf.org/license-info) in effect on the date of
41 publication of this document. Please review these documents
42 carefully, as they describe your rights and restrictions with respect
43 to this document. Code Components extracted from this document must
44 include Simplified BSD License text as described in Section 4.e of
45 the Trust Legal Provisions and are provided without warranty as
46 described in the Simplified BSD License.
47
48
49
50
51
52 Faltstrom Standards Track [Page 1]
53 RFC 5892 IDNA Code Points August 2010
54
55
56 Table of Contents
57
58 1. Introduction . . . . . . . . . . . . . . . . . . . . . . . . . 3
59 2. Category Definitions Used to Calculate Derived Property
60 Value . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
61 2.1. LetterDigits (A) . . . . . . . . . . . . . . . . . . . . . 5
62 2.2. Unstable (B) . . . . . . . . . . . . . . . . . . . . . . . 6
63 2.3. IgnorableProperties (C) . . . . . . . . . . . . . . . . . 6
64 2.4. IgnorableBlocks (D) . . . . . . . . . . . . . . . . . . . 7
65 2.5. LDH (E) . . . . . . . . . . . . . . . . . . . . . . . . . 7
66 2.6. Exceptions (F) . . . . . . . . . . . . . . . . . . . . . . 7
67 2.7. BackwardCompatible (G) . . . . . . . . . . . . . . . . . . 9
68 2.8. JoinControl (H) . . . . . . . . . . . . . . . . . . . . . 9
69 2.9. OldHangulJamo (I) . . . . . . . . . . . . . . . . . . . . 9
70 2.10. Unassigned (J) . . . . . . . . . . . . . . . . . . . . . . 9
71 3. Calculation of the Derived Property . . . . . . . . . . . . . 10
72 4. Code Points . . . . . . . . . . . . . . . . . . . . . . . . . 10
73 5. IANA Considerations . . . . . . . . . . . . . . . . . . . . . 11
74 5.1. IDNA-Derived Property Value Registry . . . . . . . . . . . 11
75 5.2. IDNA Context Registry . . . . . . . . . . . . . . . . . . 11
76 5.2.1. Template for Context Registry . . . . . . . . . . . . 11
77 6. Security Considerations . . . . . . . . . . . . . . . . . . . 12
78 7. Acknowledgements . . . . . . . . . . . . . . . . . . . . . . . 12
79 Appendix A. Contextual Rules Registry . . . . . . . . . . . . . 13
80 Appendix A.1. ZERO WIDTH NON-JOINER . . . . . . . . . . . . . . . 15
81 Appendix A.2. ZERO WIDTH JOINER . . . . . . . . . . . . . . . . . 16
82 Appendix A.3. MIDDLE DOT . . . . . . . . . . . . . . . . . . . . . 16
83 Appendix A.4. GREEK LOWER NUMERAL SIGN (KERAIA) . . . . . . . . . 17
84 Appendix A.5. HEBREW PUNCTUATION GERESH . . . . . . . . . . . . . 17
85 Appendix A.6. HEBREW PUNCTUATION GERSHAYIM . . . . . . . . . . . . 18
86 Appendix A.7. KATAKANA MIDDLE DOT . . . . . . . . . . . . . . . . 18
87 Appendix A.8. ARABIC-INDIC DIGITS . . . . . . . . . . . . . . . . 19
88 Appendix A.9. EXTENDED ARABIC-INDIC DIGITS . . . . . . . . . . . . 19
89 Appendix B. Code Points 0x0000 - 0x10FFFF . . . . . . . . . . . 20
90 Appendix B.1. Code Points in Unicode Character Database (UCD)
91 Format . . . . . . . . . . . . . . . . . . . . . . . 20
92 8. References . . . . . . . . . . . . . . . . . . . . . . . . . . 69
93 8.1. Normative References . . . . . . . . . . . . . . . . . . . 69
94 8.2. Informative References . . . . . . . . . . . . . . . . . . 69
95
96
97
98
99
100
101
102
103
104
105
106
107 Faltstrom Standards Track [Page 2]
108 RFC 5892 IDNA Code Points August 2010
109
110
111 1. Introduction
112
113 RFC 4690 [RFC4690] suggests an inclusion-based approach for selecting
114 the code points from The Unicode Standard [Unicode52] that should be
115 included in the list of code points that may be used in
116 Internationalized Domain Names.
117
118 Specifically, RFC 4690 [RFC4690] says the following:
119
120 The IAB has concluded that there is a consensus within the broader
121 community that lists of code points should be specified by the use
122 of an inclusion-based mechanism (i.e., identifying the characters
123 that are permitted), rather than by excluding a small number of
124 characters from the total Unicode set as Stringprep [RFC3454] and
125 Nameprep [RFC3491] do today. That conclusion should be reviewed
126 by the IETF community and action taken as appropriate.
127
128 This document reviews and classifies the collections of code points
129 in the Unicode character set by examining various properties of the
130 code points. It then defines an algorithm for determining a derived
131 property value. It specifies a procedure, and not a table, of code
132 points so that the algorithm can be used to determine code point sets
133 independent of the version of Unicode that is in use.
134
135 This document is not intended to specify precisely how these property
136 values are to be applied in IDN labels. That information appears in
137 the Protocol document [RFC5891], but it is important to understand
138 that the assignment of a value of this property to a particular
139 character is not sufficient to determine whether it can be used in a
140 given label. In particular, some combinations of allowed code points
141 are not advisable for use in IDNs due to rules specific to a script
142 or class of characters. The requirement for such rules is linked to
143 the operations in the Protocol document and especially to the
144 characters designated as requiring contextual rules.
145
146 The value of the property is to be interpreted as follows.
147
148 o PROTOCOL VALID: Those that are allowed to be used in IDNs. Code
149 points with this property value are permitted for general use in
150 IDNs. However, that a label consists only of code points that
151 have this property value does not imply that the label can be used
152 in DNS. See the Protocol document for algorithms to make
153 decisions about labels in domain names. The abbreviated term
154 PVALID is used to refer to this value in the rest of this
155 document.
156
157
158
159
160
161
162 Faltstrom Standards Track [Page 3]
163 RFC 5892 IDNA Code Points August 2010
164
165
166 o CONTEXTUAL RULE REQUIRED: Some characteristics of the character,
167 such as it being invisible in certain contexts or problematic in
168 others, require that it not be used in labels unless specific
169 other characters or properties are present. The abbreviated term
170 CONTEXT is used to refer to this value in the rest of this
171 document. There are two subdivisions of CONTEXTUAL RULE REQUIRED,
172 one for Join_controls (called CONTEXTJ) and for other characters
173 (called CONTEXTO). These are discussed in more detail below and
174 in the Protocol document.
175
176 o DISALLOWED: Those that should clearly not be included in IDNs.
177 Code points with this property value are not permitted in IDNs.
178
179 o UNASSIGNED: Those code points that are not designated (i.e., are
180 unassigned) in the Unicode Standard.
181
182 The mechanisms described here allow determination of the value of the
183 property for future versions of Unicode (including characters added
184 after Unicode 5.2). Changes in Unicode properties that do not affect
185 the outcome of this process do not affect IDN. For example, a
186 character can have its Unicode General_Category value (see
187 [Unicode52]) change from So to Sm or from Lo to Ll, without affecting
188 the algorithm results. Moreover, even if such changes were the
189 result, the BackwardCompatible list (Section 2.7) can be adjusted to
190 ensure the stability of the results.
191
192 Some code points need to be allowed in exceptional circumstances but
193 should be excluded in all other cases; these rules are also described
194 in other documents. The most notable of these are the Join Control
195 characters, U+200D ZERO WIDTH JOINER and U+200C ZERO WIDTH
196 NON-JOINER. Both of them have the derived property value CONTEXTJ.
197 A character with the derived property value CONTEXTJ or CONTEXTO
198 (CONTEXTUAL RULE REQUIRED) is not to be used unless an appropriate
199 rule has been established and the context of the character is
200 consistent with that rule. It is invalid to either register a string
201 containing these characters or even to look one up unless such a
202 contextual rule is found and satisfied. Please see Appendix A, "The
203 Contextual Rules Registry", for more information.
204
205 This document is part of a series that, together, constitute a
206 proposal for updating the IDNA standards to resolve issues uncovered
207 in recent years, cover a broader range of scripts, and provide for
208 migration to newer versions of Unicode. See the Rationale document
209 [RFC5894] for a broader discussion.
210
211 The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT",
212 "SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this
213 document are to be interpreted as described in RFC 2119 [RFC2119].
214
215
216
217 Faltstrom Standards Track [Page 4]
218 RFC 5892 IDNA Code Points August 2010
219
220
221 2. Category Definitions Used to Calculate Derived Property Value
222
223 The derived property obtains its value based on a two-step procedure.
224 First, characters are placed in one or more character categories
225 based on either core properties defined by the Unicode Standard or by
226 treating the code point as an exception and addressing the code point
227 by its code point value. These categories are not mutually
228 exclusive.
229
230 In the second step, set operations are used with these categories to
231 determine the values for an IDN-specific property. Those operations
232 are specified in Section 3.
233
234 Unicode property names and property value names may have short
235 abbreviations, such as gc for the General_Category property, and Ll
236 for the Lowercase_Letter property value of the gc property.
237
238 In the following specification of categories, the operation that
239 returns the value of a particular Unicode character property for a
240 code point is designated by using the formal name of that property
241 (from PropertyAliases.txt) followed by '(cp)'. For example, the
242 value of the General_Category property for a code point is indicated
243 by General_Category(cp).
244
245 2.1. LetterDigits (A)
246
247 A: General_Category(cp) is in {Ll, Lu, Lo, Nd, Lm, Mn, Mc}
248
249 These rules identify characters commonly used in mnemonics and often
250 informally described as "language characters". In general, only code
251 points assigned to this category are suitable for use in IDN.
252
253 For more information, see Section 4.5 of The Unicode Standard
254 [Unicode].
255
256 The categories used in this rule are:
257
258 o Ll - Lowercase_Letter
259
260 o Lu - Uppercase_Letter
261
262 o Lo - Other_Letter
263
264 o Nd - Decimal_Number
265
266 o Lm - Modifier_Letter
267
268
269
270
271
272 Faltstrom Standards Track [Page 5]
273 RFC 5892 IDNA Code Points August 2010
274
275
276 o Mn - Nonspacing_Mark
277
278 o Mc - Spacing_Mark
279
280 2.2. Unstable (B)
281
282 B: toNFKC(toCaseFold(toNFKC(cp))) != cp
283
284 This category is used to group the characters that are not stable
285 under Normalization Form K (NFKC) and case folding. In general,
286 these code points are not suitable for use for IDN.
287
288 The toCaseFold() operation is defined in Section 3.13 of The Unicode
289 Standard [Unicode].
290
291 The toNFKC() operation returns the code point in normalization form
292 KC. For more information, see Section 5 of Unicode Standard Annex
293 #15 [TR15].
294
295 It should be noted that NFKC is used, although Normalization Form C
296 (NFC) is used in the "IDNA Protocol" document [RFC5891].
297
298 2.3. IgnorableProperties (C)
299
300 C: Default_Ignorable_Code_Point(cp) = True or
301 White_Space(cp) = True or
302 Noncharacter_Code_Point(cp) = True
303
304 This category is used to group code points that are not recommended
305 for use in identifiers. In general, these code points are not
306 suitable for use in an IDN.
307
308 The definition for Default_Ignorable_Code_Point can be found in
309 DerivedCoreProperties.txt [DerivedCoreProperties] and is at the time
310 of Unicode 5.2:
311
312 Other_Default_Ignorable_Code_Point + Cf (Format characters)
313 + Variation_Selector - White_Space - FFF9..FFFB (Annotation
314 Characters) - 0600..0603, 06DD, 070F (exceptional Cf characters
315 that should be visible)
316
317
318
319
320
321
322
323
324
325
326
327 Faltstrom Standards Track [Page 6]
328 RFC 5892 IDNA Code Points August 2010
329
330
331 2.4. IgnorableBlocks (D)
332
333 D: Block(cp) is in {Combining Diacritical Marks for Symbols,
334 Musical Symbols, Ancient Greek Musical Notation}
335
336 This category is used to identify code points that are not useful in
337 mnemonics or that are otherwise impractical for IDN use. In general,
338 these code points are not suitable for use for IDN.
339
340 The definition of blocks can be found in Blocks.txt [BlockNames].
341
342 2.5. LDH (E)
343
344 E: cp is in {002D, 0030..0039, 0061..007A}
345
346 This category is used in the second step to preserve the traditional
347 "hostname" (LDH -- as described in the Definitions document
348 [RFC5890]) characters ('-', 0-9, and a-z). In general, these code
349 points are suitable for use for IDN. Note that there are other rules
350 regarding the code point U+002D HYPHEN-MINUS that are specified in
351 the IDNA Protocol Specification [RFC5891].
352
353 2.6. Exceptions (F)
354
355 F: cp is in {00B7, 00DF, 0375, 03C2, 05F3, 05F4, 0640, 0660,
356 0661, 0662, 0663, 0664, 0665, 0666, 0667, 0668,
357 0669, 06F0, 06F1, 06F2, 06F3, 06F4, 06F5, 06F6,
358 06F7, 06F8, 06F9, 06FD, 06FE, 07FA, 0F0B, 3007,
359 302E, 302F, 3031, 3032, 3033, 3034, 3035, 303B,
360 30FB}
361
362 This category explicitly lists code points for which the category
363 cannot be assigned using only the core property values that exist in
364 the Unicode standard. The values are according to the table below:
365
366 PVALID -- Would otherwise have been DISALLOWED
367
368 00DF; PVALID # LATIN SMALL LETTER SHARP S
369 03C2; PVALID # GREEK SMALL LETTER FINAL SIGMA
370 06FD; PVALID # ARABIC SIGN SINDHI AMPERSAND
371 06FE; PVALID # ARABIC SIGN SINDHI POSTPOSITION MEN
372 0F0B; PVALID # TIBETAN MARK INTERSYLLABIC TSHEG
373 3007; PVALID # IDEOGRAPHIC NUMBER ZERO
374
375
376
377
378
379
380
381
382 Faltstrom Standards Track [Page 7]
383 RFC 5892 IDNA Code Points August 2010
384
385
386 CONTEXTO -- Would otherwise have been DISALLOWED
387
388 00B7; CONTEXTO # MIDDLE DOT
389 0375; CONTEXTO # GREEK LOWER NUMERAL SIGN (KERAIA)
390 05F3; CONTEXTO # HEBREW PUNCTUATION GERESH
391 05F4; CONTEXTO # HEBREW PUNCTUATION GERSHAYIM
392 30FB; CONTEXTO # KATAKANA MIDDLE DOT
393
394 CONTEXTO -- Would otherwise have been PVALID
395
396 0660; CONTEXTO # ARABIC-INDIC DIGIT ZERO
397 0661; CONTEXTO # ARABIC-INDIC DIGIT ONE
398 0662; CONTEXTO # ARABIC-INDIC DIGIT TWO
399 0663; CONTEXTO # ARABIC-INDIC DIGIT THREE
400 0664; CONTEXTO # ARABIC-INDIC DIGIT FOUR
401 0665; CONTEXTO # ARABIC-INDIC DIGIT FIVE
402 0666; CONTEXTO # ARABIC-INDIC DIGIT SIX
403 0667; CONTEXTO # ARABIC-INDIC DIGIT SEVEN
404 0668; CONTEXTO # ARABIC-INDIC DIGIT EIGHT
405 0669; CONTEXTO # ARABIC-INDIC DIGIT NINE
406 06F0; CONTEXTO # EXTENDED ARABIC-INDIC DIGIT ZERO
407 06F1; CONTEXTO # EXTENDED ARABIC-INDIC DIGIT ONE
408 06F2; CONTEXTO # EXTENDED ARABIC-INDIC DIGIT TWO
409 06F3; CONTEXTO # EXTENDED ARABIC-INDIC DIGIT THREE
410 06F4; CONTEXTO # EXTENDED ARABIC-INDIC DIGIT FOUR
411 06F5; CONTEXTO # EXTENDED ARABIC-INDIC DIGIT FIVE
412 06F6; CONTEXTO # EXTENDED ARABIC-INDIC DIGIT SIX
413 06F7; CONTEXTO # EXTENDED ARABIC-INDIC DIGIT SEVEN
414 06F8; CONTEXTO # EXTENDED ARABIC-INDIC DIGIT EIGHT
415 06F9; CONTEXTO # EXTENDED ARABIC-INDIC DIGIT NINE
416
417 DISALLOWED -- Would otherwise have been PVALID
418
419 0640; DISALLOWED # ARABIC TATWEEL
420 07FA; DISALLOWED # NKO LAJANYALAN
421 302E; DISALLOWED # HANGUL SINGLE DOT TONE MARK
422 302F; DISALLOWED # HANGUL DOUBLE DOT TONE MARK
423 3031; DISALLOWED # VERTICAL KANA REPEAT MARK
424 3032; DISALLOWED # VERTICAL KANA REPEAT WITH VOICED SOUND MARK
425 3033; DISALLOWED # VERTICAL KANA REPEAT MARK UPPER HALF
426 3034; DISALLOWED # VERTICAL KANA REPEAT WITH VOICED SOUND MARK UPPER HA
427 3035; DISALLOWED # VERTICAL KANA REPEAT MARK LOWER HALF
428 303B; DISALLOWED # VERTICAL IDEOGRAPHIC ITERATION MARK
429
430
431
432
433
434
435
436
437 Faltstrom Standards Track [Page 8]
438 RFC 5892 IDNA Code Points August 2010
439
440
441 2.7. BackwardCompatible (G)
442
443 G: cp is in {}
444
445 This category includes the code points that property values in
446 versions of Unicode after 5.2 have changed in such a way that the
447 derived property value would no longer be PVALID or DISALLOWED. If
448 changes are made to future versions of Unicode so that code points
449 might change the property value from PVALID or DISALLOWED, then this
450 table can be updated and keep special exception values so that the
451 property values for code points stay stable.
452
453 2.8. JoinControl (H)
454
455 H: Join_Control(cp) = True
456
457 This category consists of Join Control characters (i.e., they are not
458 in LetterDigits (Section 2.1) but are still required in IDN labels
459 under some circumstances).
460
461 2.9. OldHangulJamo (I)
462
463 I: Hangul_Syllable_Type(cp) is in {L, V, T}
464
465 This category consists of all conjoining Hangul Jamo (Leading Jamo,
466 Vowel Jamo, and Trailing Jamo).
467
468 Elimination of conjoining Hangul Jamo from the set of PVALID
469 characters results in restricting the set of Korean PVALID characters
470 just to preformed, modern Hangul syllable characters. Old Hangul
471 syllables, which must be spelled with sequences of conjoining Hangul
472 Jamo, are not PVALID for IDNs.
473
474 2.10. Unassigned (J)
475
476 J: General_Category(cp) is in {Cn} and
477 Noncharacter_Code_Point(cp) = False
478
479 This category consists of code points in the Unicode character set
480 that are not (yet) assigned. It should be noted that Unicode
481 distinguishes between "unassigned code points" and "unassigned
482 characters". The unassigned code points are all but (Cn -
483 Noncharacters), while the unassigned *characters* are all but (Cn +
484 Cs).
485
486
487
488
489
490
491
492 Faltstrom Standards Track [Page 9]
493 RFC 5892 IDNA Code Points August 2010
494
495
496 3. Calculation of the Derived Property
497
498 As described above (Section 1) and in more detail in the IDNA
499 Protocol document [RFC5891], possible values of the IDN property are:
500
501 o PVALID
502
503 o CONTEXTJ
504
505 o CONTEXTO
506
507 o DISALLOWED
508
509 o UNASSIGNED
510
511 The algorithm to calculate the value of the derived property is as
512 follows. If the name of a rule (such as Exception) is used, that
513 implies the set of code points that the rule defines, while the same
514 name as a function call (such as Exception(cp)) implies the value cp
515 has in the Exceptions table.
516
517 If .cp. .in. Exceptions Then Exceptions(cp);
518 Else If .cp. .in. BackwardCompatible Then BackwardCompatible(cp);
519 Else If .cp. .in. Unassigned Then UNASSIGNED;
520 Else If .cp. .in. LDH Then PVALID;
521 Else If .cp. .in. JoinControl Then CONTEXTJ;
522 Else If .cp. .in. Unstable Then DISALLOWED;
523 Else If .cp. .in. IgnorableProperties Then DISALLOWED;
524 Else If .cp. .in. IgnorableBlocks Then DISALLOWED;
525 Else If .cp. .in. OldHangulJamo Then DISALLOWED;
526 Else If .cp. .in. LetterDigits Then PVALID;
527 Else DISALLOWED;
528
529 4. Code Points
530
531 The categories and rules defined in Sections 2 and 3 apply to all
532 Unicode code points. The table in Appendix B shows, for illustrative
533 purposes, the consequences of the categories and classification
534 rules, and the resulting property values.
535
536 The list of code points that can be found in Appendix B is
537 non-normative. Sections 2 and 3 are normative.
538
539
540
541
542
543
544
545
546
547 Faltstrom Standards Track [Page 10]
548 RFC 5892 IDNA Code Points August 2010
549
550
551 5. IANA Considerations
552
553 5.1. IDNA-Derived Property Value Registry
554
555 IANA has created a registry with the derived properties for the
556 versions of Unicode released after (and including) version 5.2. The
557 derived property value is to be calculated in cooperation with a
558 designated expert [RFC5226] according to the specifications in
559 Sections 2 and 3 and not by copying the non-normative table found in
560 Appendix B.
561
562 If non-backward-compatible changes or other problems arise during the
563 creation or designated expert review of the table of derived property
564 values, they should be flagged for the IESG. Changes to the rules
565 (as specified in Sections 2 and 3), including BackwardCompatible
566 (Section 2.7) (a set that is at release of this document is empty)
567 require IETF Review, as described in RFC 5226 [RFC5226].
568
569 5.2. IDNA Context Registry
570
571 For characters that are defined in the IDNA derived property value
572 registry (Section 5.1) as CONTEXTO or CONTEXTJ and that therefore
573 require a contextual rule, IANA has created and now maintains a list
574 of approved contextual rules. Additions or changes to these rules
575 require IETF Review, as described in [RFC5226].
576
577 Appendix A contains further discussion and a table from which that
578 registry can be initialized.
579
580 5.2.1. Template for Context Registry
581
582 The following information is to be given when a new rule is created.
583
584 Name: Unique name of the rule
585
586 Code point: Rule that should be applied when this code point
587 exists in the label
588
589 Overview: Description in plain English on what the rule verifies
590
591 Lookup: Should the rule be applied at time of lookup?
592
593 Rule Set: The set of rules, with a reference to the defining
594 document.
595
596
597
598
599
600
601
602 Faltstrom Standards Track [Page 11]
603 RFC 5892 IDNA Code Points August 2010
604
605
606 6. Security Considerations
607
608 Security Considerations for this version of IDNA, except for the
609 special issues associated with right-to-left scripts and characters,
610 are described in the Definitions document [RFC5890]. Specific issues
611 for labels containing characters associated with scripts written
612 right to left appear in the Bidi document [RFC5893].
613
614 7. Acknowledgements
615
616 This document would not have been possible to produce without input
617 from many people. The main contributors are (in alphabetical order)
618 Harald Alvestrand, Vint Cerf, Tina Dam, Mark Davis, Gihan Dias,
619 Mouhammet Diop, Michael Everson, Asmus Freytag, Debbie Garside, Paul
620 Hoffman, Kent Karlsson, Cary Karp, Jaeyoun Kim, John Klensin, Olaf
621 Kolkman, Gervase Markham, Ram Mohan, Lisa Moore, Yngve Pettersen,
622 Erik van der Poel, Hualin Qian, Rick Reed, Pete Resnick, Lakmal
623 Silva, Michel Suignard, Andrew Sullivan, Wil Tan, Kenneth Whistler,
624 Chris Wright, and Yoshiro Yoneya.
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657 Faltstrom Standards Track [Page 12]
658 RFC 5892 IDNA Code Points August 2010
659
660
661 Appendix A. Contextual Rules Registry
662
663 As discussed in Section 5.2 and in the IANA Considerations section of
664 the Rationale document [RFC5894], a registry of rules that define the
665 contexts in which particular PROTOCOL-VALID characters, characters
666 associated with a requirement for Contextual Information, are
667 permitted. These rules are expressed as tests on the label in which
668 the characters appear (all, or any part of, the label may be tested).
669
670 The grammatical rules are expressed in pseudo-code. The conventions
671 used for that pseudo-code are explained here.
672
673 Each rule is constructed as a Boolean expression that evaluates to
674 either True or False. A simple "True;" or "False;" rule sets the
675 default result value for the rule set. Subsequent conditional rules
676 that evaluate to True or False may re-set the result value.
677
678 A special value "Undefined" is used to deal with any error
679 conditions, such as an attempt to test a character before the start
680 of a label or after the end of a label. If any term of a rule
681 evaluates to Undefined, further evaluation of the rule immediately
682 terminates, as the result value of the rule will itself be Undefined.
683
684 cp represents the code point to be tested.
685
686 FirstChar is a special term that denotes the first code point in a
687 label.
688
689 LastChar is a special term that denotes the last code point in a
690 label.
691
692 .eq. represents the equality relation.
693
694 A .eq. B evaluates to True if A equals B.
695
696 .is. represents checking the position in a label.
697
698 A .is. B evaluates to True if A and B have same position in
699 the same label.
700
701 .ne. represents the non-equality relation.
702
703 A .ne. B evaluates to True if A is not equal to B.
704
705 .in. represents the set inclusion relation.
706
707 A .in. B evaluates to True if A is a member of the set B.
708
709
710
711
712 Faltstrom Standards Track [Page 13]
713 RFC 5892 IDNA Code Points August 2010
714
715
716 A functional notation, Function_Name(cp), is used to express either
717 string positions within a label, Boolean character property tests of
718 a code point, or a regular expression match. When such function
719 names refer to Boolean character property tests, the function names
720 use the exact Unicode character property name for the property in
721 question, and "cp" is evaluated as the Unicode value of the code
722 point to be tested, rather than as its position in the label. When
723 such function names refer to string positions within a label, "cp" is
724 evaluated as its position in the label.
725
726 RegExpMatch(X) takes as its parameter X a schematic regular
727 expression consisting of a mix of Unicode character property values
728 and literal Unicode code points.
729
730 Script(cp) returns the value of the Unicode Script property, as
731 defined in Scripts.txt in the Unicode Character Database.
732
733 Canonical_Combining_Class(cp) returns the value of the Unicode
734 Canonical_Combining_Class property, as defined in UnicodeData.txt in
735 the Unicode Character Database.
736
737 Before(cp) returns the code point of the character immediately
738 preceding cp in logical order in the string representing the label.
739 Before(FirstChar) evaluates to Undefined.
740
741 After(cp) returns the code point of the character immediately
742 following cp in logical order in the string representing the label.
743 After(LastChar) evaluates to Undefined.
744
745 Note that "Before" and "After" do not refer to the visual display
746 order of the character in a label, which may be reversed or otherwise
747 modified by the bidirectional algorithm for labels including
748 characters from scripts written right to left. Instead, "Before" and
749 "After" refer to the network order of the character in the label.
750
751 The clauses "Then True" and "Then False" imply exit from the
752 pseudo-code routine with the corresponding result.
753
754 Repeated evaluation for all characters in a label makes use of the
755 special construct:
756
757 For All Characters:
758
759 Expression;
760
761 End For;
762
763
764
765
766
767 Faltstrom Standards Track [Page 14]
768 RFC 5892 IDNA Code Points August 2010
769
770
771 This construct requires repeated evaluation of "Expression" for each
772 code point in the label, starting from FirstChar and proceeding to
773 LastChar.
774
775 The different fields in the rules are to be interpreted as follows:
776
The IETF is responsible for the creation and maintenance of the DNS RFCs. The ICANN DNS RFC annotation project provides a forum for collecting community annotations on these RFCs as an aid to understanding for implementers and any interested parties. The annotations displayed here are not the result of the IETF consensus process.
This RFC is included in the DNS RFCs annotation project whose home page is here.
777 Code point:
778 The code point, or code points, to which this rule is to be
779 applied. Normally, this implies that if any of the code points in
780 a label is as defined, then the rules should be applied. If
781 evaluated to True, the code point is OK as used; if evaluated to
782 False, it is not OK.
783
784 Overview:
785 A description of the goal with the rule, in plain English.
786
787 Lookup:
788 True if application of this rule is recommended at lookup time;
789 False otherwise.
790
791 Rule Set:
792 The rule set itself, as described above.
793
794 Appendix A.1. ZERO WIDTH NON-JOINER
795
796 Code point:
797 U+200C
798
799 Overview:
800 This may occur in a formally cursive script (such as Arabic) in a
801 context where it breaks a cursive connection as required for
802 orthographic rules, as in the Persian language, for example. It
803 also may occur in Indic scripts in a consonant-conjunct context
804 (immediately following a virama), to control required display of
805 such conjuncts.
806
807 Lookup:
808 True
809
In A: Code point: The code point, or code points, to which this rule is to be applied. Normally, this implies that if any of the code points in a label is as defined, then the rules should be applied. If evaluated to True, the code point is OK as used; if evaluated to False, it is not OK.
In A: Code point: The code point, or code points, to which this rule is to be applied. Normally, this implies that if any of the code points in a label is as defined, then the rules should be applied. If evaluated to True, the code point is OK as used; if evaluated to False, it is not OK. For the rule to be evaluated to True for the label, it MUST be evaluated separately for every occurrence of the Code point in the label; each of those evaluations must result in True.
810 Rule Set:
811
812 False;
813
814 If Canonical_Combining_Class(Before(cp)) .eq. Virama Then True;
815
816 If RegExpMatch((Joining_Type:{L,D})(Joining_Type:T)*\u200C
817
818 (Joining_Type:T)*(Joining_Type:{R,D})) Then True;
819
820
821
822 Faltstrom Standards Track [Page 15]
823 RFC 5892 IDNA Code Points August 2010
824
825
826 Appendix A.2. ZERO WIDTH JOINER
827
828 Code point:
829 U+200D
830
831 Overview:
832 This may occur in Indic scripts in a consonant-conjunct context
833 (immediately following a virama), to control required display of
834 such conjuncts.
835
836 Lookup:
837 True
838
839 Rule Set:
840
841 False;
842
843 If Canonical_Combining_Class(Before(cp)) .eq. Virama Then True;
844
845 Appendix A.3. MIDDLE DOT
846
847 Code point:
848 U+00B7
849
850 Overview:
851 Between 'l' (U+006C) characters only, used to permit the Catalan
852 character ela geminada to be expressed.
853
854 Lookup:
855 False
856
857 Rule Set:
858
859 False;
860
861 If Before(cp) .eq. U+006C And
862
863 After(cp) .eq. U+006C Then True;
864
865
866
867
868
869
870
871
872
873
874
875
876
877 Faltstrom Standards Track [Page 16]
878 RFC 5892 IDNA Code Points August 2010
879
880
881 Appendix A.4. GREEK LOWER NUMERAL SIGN (KERAIA)
882
883 Code point:
884 U+0375
885
886 Overview:
887 The script of the following character MUST be Greek.
888
889 Lookup:
890 False
891
892 Rule Set:
893
894 False;
895
896 If Script(After(cp)) .eq. Greek Then True;
897
898 Appendix A.5. HEBREW PUNCTUATION GERESH
899
900 Code point:
901 U+05F3
902
903 Overview:
904 The script of the preceding character MUST be Hebrew.
905
906 Lookup:
907 False
908
909 Rule Set:
910
911 False;
912
913 If Script(Before(cp)) .eq. Hebrew Then True;
914
915
916
917
918
919
920
921
922
923
924
925
926
927
928
929
930
931
932 Faltstrom Standards Track [Page 17]
933 RFC 5892 IDNA Code Points August 2010
934
935
936 Appendix A.6. HEBREW PUNCTUATION GERSHAYIM
937
938 Code point:
939 U+05F4
940
941 Overview:
942 The script of the preceding character MUST be Hebrew.
943
944 Lookup:
945 False
946
947 Rule Set:
948
949 False;
950
951 If Script(Before(cp)) .eq. Hebrew Then True;
952
953 Appendix A.7. KATAKANA MIDDLE DOT
954
955 Code point:
956 U+30FB
957
958 Overview:
959 Note that the Script of Katakana Middle Dot is not any of
960 "Hiragana", "Katakana", or "Han". The effect of this rule is to
961 require at least one character in the label to be in one of those
962 scripts.
963
964 Lookup:
965 False
966
967 Rule Set:
968
969 False;
970
971 For All Characters:
972
973 If Script(cp) .in. {Hiragana, Katakana, Han} Then True;
974
975 End For;
976
977
978
979
980
981
982
983
984
985
986
987 Faltstrom Standards Track [Page 18]
988 RFC 5892 IDNA Code Points August 2010
989
990
991 Appendix A.8. ARABIC-INDIC DIGITS
992
993 Code point:
994 0660..0669
995
996 Overview:
997 Can not be mixed with Extended Arabic-Indic Digits.
998
999 Lookup:
1000 False
1001
1002 Rule Set:
1003
1004 True;
1005
1006 For All Characters:
1007
1008 If cp .in. 06F0..06F9 Then False;
1009
1010 End For;
1011
1012 Appendix A.9. EXTENDED ARABIC-INDIC DIGITS
1013
1014 Code point:
1015 06F0..06F9
1016
1017 Overview:
1018 Can not be mixed with Arabic-Indic Digits.
1019
1020 Lookup:
1021 False
1022
1023 Rule Set:
1024
1025 True;
1026
1027 For All Characters:
1028
1029 If cp .in. 0660..0669 Then False;
1030
1031 End For;
1032
1033
1034
1035
1036
1037
1038
1039
1040
1041
1042 Faltstrom Standards Track [Page 19]
1043 RFC 5892 IDNA Code Points August 2010
1044
1045
1046 Appendix B. Code Points 0x0000 - 0x10FFFF
1047
1048 If one applies the rules (Section 3) to the code points 0x0000 to
1049 0x10FFFF to Unicode 5.2, the result is as follows.
1050
1051 This list is non-normative, and only included for illustrative
1052 purposes. Specifically, what is displayed in the third column is not
1053 the formal name of the code point (as defined in Section 4.8 of The
1054 Unicode Standard [Unicode52]). The differences exist, for example,
1055 for the code points that have the code point value as part of the
1056 name (for example, CJK UNIFIED IDEOGRAPH-4E00) and the naming of
1057 Hangul syllables. For many code points, what you see is the official
1058 name.
1059
1060 Appendix B.1. Code Points in Unicode Character Database (UCD) Format
1061
1062 0000..002C ; DISALLOWED # <control>..COMMA
1063 002D ; PVALID # HYPHEN-MINUS
1064 002E..002F ; DISALLOWED # FULL STOP..SOLIDUS
1065 0030..0039 ; PVALID # DIGIT ZERO..DIGIT NINE
1066 003A..0060 ; DISALLOWED # COLON..GRAVE ACCENT
1067 0061..007A ; PVALID # LATIN SMALL LETTER A..LATIN SMALL LETTER Z
1068 007B..00B6 ; DISALLOWED # LEFT CURLY BRACKET..PILCROW SIGN
1069 00B7 ; CONTEXTO # MIDDLE DOT
1070 00B8..00DE ; DISALLOWED # CEDILLA..LATIN CAPITAL LETTER THORN
1071 00DF..00F6 ; PVALID # LATIN SMALL LETTER SHARP S..LATIN SMALL LETT
1072 00F7 ; DISALLOWED # DIVISION SIGN
1073 00F8..00FF ; PVALID # LATIN SMALL LETTER O WITH STROKE..LATIN SMAL
1074 0100 ; DISALLOWED # LATIN CAPITAL LETTER A WITH MACRON
1075 0101 ; PVALID # LATIN SMALL LETTER A WITH MACRON
1076 0102 ; DISALLOWED # LATIN CAPITAL LETTER A WITH BREVE
1077 0103 ; PVALID # LATIN SMALL LETTER A WITH BREVE
1078 0104 ; DISALLOWED # LATIN CAPITAL LETTER A WITH OGONEK
1079 0105 ; PVALID # LATIN SMALL LETTER A WITH OGONEK
1080 0106 ; DISALLOWED # LATIN CAPITAL LETTER C WITH ACUTE
1081 0107 ; PVALID # LATIN SMALL LETTER C WITH ACUTE
1082 0108 ; DISALLOWED # LATIN CAPITAL LETTER C WITH CIRCUMFLEX
1083 0109 ; PVALID # LATIN SMALL LETTER C WITH CIRCUMFLEX
1084 010A ; DISALLOWED # LATIN CAPITAL LETTER C WITH DOT ABOVE
1085 010B ; PVALID # LATIN SMALL LETTER C WITH DOT ABOVE
1086 010C ; DISALLOWED # LATIN CAPITAL LETTER C WITH CARON
1087 010D ; PVALID # LATIN SMALL LETTER C WITH CARON
1088 010E ; DISALLOWED # LATIN CAPITAL LETTER D WITH CARON
1089 010F ; PVALID # LATIN SMALL LETTER D WITH CARON
1090 0110 ; DISALLOWED # LATIN CAPITAL LETTER D WITH STROKE
1091 0111 ; PVALID # LATIN SMALL LETTER D WITH STROKE
1092 0112 ; DISALLOWED # LATIN CAPITAL LETTER E WITH MACRON
1093 0113 ; PVALID # LATIN SMALL LETTER E WITH MACRON
1094
1095
1096
1097 Faltstrom Standards Track [Page 20]
1098 RFC 5892 IDNA Code Points August 2010
1099
1100
1101 0114 ; DISALLOWED # LATIN CAPITAL LETTER E WITH BREVE
1102 0115 ; PVALID # LATIN SMALL LETTER E WITH BREVE
1103 0116 ; DISALLOWED # LATIN CAPITAL LETTER E WITH DOT ABOVE
1104 0117 ; PVALID # LATIN SMALL LETTER E WITH DOT ABOVE
1105 0118 ; DISALLOWED # LATIN CAPITAL LETTER E WITH OGONEK
1106 0119 ; PVALID # LATIN SMALL LETTER E WITH OGONEK
1107 011A ; DISALLOWED # LATIN CAPITAL LETTER E WITH CARON
1108 011B ; PVALID # LATIN SMALL LETTER E WITH CARON
1109 011C ; DISALLOWED # LATIN CAPITAL LETTER G WITH CIRCUMFLEX
1110 011D ; PVALID # LATIN SMALL LETTER G WITH CIRCUMFLEX
1111 011E ; DISALLOWED # LATIN CAPITAL LETTER G WITH BREVE
1112 011F ; PVALID # LATIN SMALL LETTER G WITH BREVE
1113 0120 ; DISALLOWED # LATIN CAPITAL LETTER G WITH DOT ABOVE
1114 0121 ; PVALID # LATIN SMALL LETTER G WITH DOT ABOVE
1115 0122 ; DISALLOWED # LATIN CAPITAL LETTER G WITH CEDILLA
1116 0123 ; PVALID # LATIN SMALL LETTER G WITH CEDILLA
1117 0124 ; DISALLOWED # LATIN CAPITAL LETTER H WITH CIRCUMFLEX
1118 0125 ; PVALID # LATIN SMALL LETTER H WITH CIRCUMFLEX
1119 0126 ; DISALLOWED # LATIN CAPITAL LETTER H WITH STROKE
1120 0127 ; PVALID # LATIN SMALL LETTER H WITH STROKE
1121 0128 ; DISALLOWED # LATIN CAPITAL LETTER I WITH TILDE
1122 0129 ; PVALID # LATIN SMALL LETTER I WITH TILDE
1123 012A ; DISALLOWED # LATIN CAPITAL LETTER I WITH MACRON
1124 012B ; PVALID # LATIN SMALL LETTER I WITH MACRON
1125 012C ; DISALLOWED # LATIN CAPITAL LETTER I WITH BREVE
1126 012D ; PVALID # LATIN SMALL LETTER I WITH BREVE
1127 012E ; DISALLOWED # LATIN CAPITAL LETTER I WITH OGONEK
1128 012F ; PVALID # LATIN SMALL LETTER I WITH OGONEK
1129 0130 ; DISALLOWED # LATIN CAPITAL LETTER I WITH DOT ABOVE
1130 0131 ; PVALID # LATIN SMALL LETTER DOTLESS I
1131 0132..0134 ; DISALLOWED # LATIN CAPITAL LIGATURE IJ..LATIN CAPITAL LET
1132 0135 ; PVALID # LATIN SMALL LETTER J WITH CIRCUMFLEX
1133 0136 ; DISALLOWED # LATIN CAPITAL LETTER K WITH CEDILLA
1134 0137..0138 ; PVALID # LATIN SMALL LETTER K WITH CEDILLA..LATIN SMA
1135 0139 ; DISALLOWED # LATIN CAPITAL LETTER L WITH ACUTE
1136 013A ; PVALID # LATIN SMALL LETTER L WITH ACUTE
1137 013B ; DISALLOWED # LATIN CAPITAL LETTER L WITH CEDILLA
1138 013C ; PVALID # LATIN SMALL LETTER L WITH CEDILLA
1139 013D ; DISALLOWED # LATIN CAPITAL LETTER L WITH CARON
1140 013E ; PVALID # LATIN SMALL LETTER L WITH CARON
1141 013F..0141 ; DISALLOWED # LATIN CAPITAL LETTER L WITH MIDDLE DOT..LATI
1142 0142 ; PVALID # LATIN SMALL LETTER L WITH STROKE
1143 0143 ; DISALLOWED # LATIN CAPITAL LETTER N WITH ACUTE
1144 0144 ; PVALID # LATIN SMALL LETTER N WITH ACUTE
1145 0145 ; DISALLOWED # LATIN CAPITAL LETTER N WITH CEDILLA
1146 0146 ; PVALID # LATIN SMALL LETTER N WITH CEDILLA
1147 0147 ; DISALLOWED # LATIN CAPITAL LETTER N WITH CARON
1148 0148 ; PVALID # LATIN SMALL LETTER N WITH CARON
1149
1150
1151
1152 Faltstrom Standards Track [Page 21]
1153 RFC 5892 IDNA Code Points August 2010
1154
1155
1156 0149..014A ; DISALLOWED # LATIN SMALL LETTER N PRECEDED BY APOSTROPHE.
1157 014B ; PVALID # LATIN SMALL LETTER ENG
1158 014C ; DISALLOWED # LATIN CAPITAL LETTER O WITH MACRON
1159 014D ; PVALID # LATIN SMALL LETTER O WITH MACRON
1160 014E ; DISALLOWED # LATIN CAPITAL LETTER O WITH BREVE
1161 014F ; PVALID # LATIN SMALL LETTER O WITH BREVE
1162 0150 ; DISALLOWED # LATIN CAPITAL LETTER O WITH DOUBLE ACUTE
1163 0151 ; PVALID # LATIN SMALL LETTER O WITH DOUBLE ACUTE
1164 0152 ; DISALLOWED # LATIN CAPITAL LIGATURE OE
1165 0153 ; PVALID # LATIN SMALL LIGATURE OE
1166 0154 ; DISALLOWED # LATIN CAPITAL LETTER R WITH ACUTE
1167 0155 ; PVALID # LATIN SMALL LETTER R WITH ACUTE
1168 0156 ; DISALLOWED # LATIN CAPITAL LETTER R WITH CEDILLA
1169 0157 ; PVALID # LATIN SMALL LETTER R WITH CEDILLA
1170 0158 ; DISALLOWED # LATIN CAPITAL LETTER R WITH CARON
1171 0159 ; PVALID # LATIN SMALL LETTER R WITH CARON
1172 015A ; DISALLOWED # LATIN CAPITAL LETTER S WITH ACUTE
1173 015B ; PVALID # LATIN SMALL LETTER S WITH ACUTE
1174 015C ; DISALLOWED # LATIN CAPITAL LETTER S WITH CIRCUMFLEX
1175 015D ; PVALID # LATIN SMALL LETTER S WITH CIRCUMFLEX
1176 015E ; DISALLOWED # LATIN CAPITAL LETTER S WITH CEDILLA
1177 015F ; PVALID # LATIN SMALL LETTER S WITH CEDILLA
1178 0160 ; DISALLOWED # LATIN CAPITAL LETTER S WITH CARON
1179 0161 ; PVALID # LATIN SMALL LETTER S WITH CARON
1180 0162 ; DISALLOWED # LATIN CAPITAL LETTER T WITH CEDILLA
1181 0163 ; PVALID # LATIN SMALL LETTER T WITH CEDILLA
1182 0164 ; DISALLOWED # LATIN CAPITAL LETTER T WITH CARON
1183 0165 ; PVALID # LATIN SMALL LETTER T WITH CARON
1184 0166 ; DISALLOWED # LATIN CAPITAL LETTER T WITH STROKE
1185 0167 ; PVALID # LATIN SMALL LETTER T WITH STROKE
1186 0168 ; DISALLOWED # LATIN CAPITAL LETTER U WITH TILDE
1187 0169 ; PVALID # LATIN SMALL LETTER U WITH TILDE
1188 016A ; DISALLOWED # LATIN CAPITAL LETTER U WITH MACRON
1189 016B ; PVALID # LATIN SMALL LETTER U WITH MACRON
1190 016C ; DISALLOWED # LATIN CAPITAL LETTER U WITH BREVE
1191 016D ; PVALID # LATIN SMALL LETTER U WITH BREVE
1192 016E ; DISALLOWED # LATIN CAPITAL LETTER U WITH RING ABOVE
1193 016F ; PVALID # LATIN SMALL LETTER U WITH RING ABOVE
1194 0170 ; DISALLOWED # LATIN CAPITAL LETTER U WITH DOUBLE ACUTE
1195 0171 ; PVALID # LATIN SMALL LETTER U WITH DOUBLE ACUTE
1196 0172 ; DISALLOWED # LATIN CAPITAL LETTER U WITH OGONEK
1197 0173 ; PVALID # LATIN SMALL LETTER U WITH OGONEK
1198 0174 ; DISALLOWED # LATIN CAPITAL LETTER W WITH CIRCUMFLEX
1199 0175 ; PVALID # LATIN SMALL LETTER W WITH CIRCUMFLEX
1200 0176 ; DISALLOWED # LATIN CAPITAL LETTER Y WITH CIRCUMFLEX
1201 0177 ; PVALID # LATIN SMALL LETTER Y WITH CIRCUMFLEX
1202 0178..0179 ; DISALLOWED # LATIN CAPITAL LETTER Y WITH DIAERESIS..LATIN
1203 017A ; PVALID # LATIN SMALL LETTER Z WITH ACUTE
1204
1205
1206
1207 Faltstrom Standards Track [Page 22]
1208 RFC 5892 IDNA Code Points August 2010
1209
1210
1211 017B ; DISALLOWED # LATIN CAPITAL LETTER Z WITH DOT ABOVE
1212 017C ; PVALID # LATIN SMALL LETTER Z WITH DOT ABOVE
1213 017D ; DISALLOWED # LATIN CAPITAL LETTER Z WITH CARON
1214 017E ; PVALID # LATIN SMALL LETTER Z WITH CARON
1215 017F ; DISALLOWED # LATIN SMALL LETTER LONG S
1216 0180 ; PVALID # LATIN SMALL LETTER B WITH STROKE
1217 0181..0182 ; DISALLOWED # LATIN CAPITAL LETTER B WITH HOOK..LATIN CAPI
1218 0183 ; PVALID # LATIN SMALL LETTER B WITH TOPBAR
1219 0184 ; DISALLOWED # LATIN CAPITAL LETTER TONE SIX
1220 0185 ; PVALID # LATIN SMALL LETTER TONE SIX
1221 0186..0187 ; DISALLOWED # LATIN CAPITAL LETTER OPEN O..LATIN CAPITAL L
1222 0188 ; PVALID # LATIN SMALL LETTER C WITH HOOK
1223 0189..018B ; DISALLOWED # LATIN CAPITAL LETTER AFRICAN D..LATIN CAPITA
1224 018C..018D ; PVALID # LATIN SMALL LETTER D WITH TOPBAR..LATIN SMAL
1225 018E..0191 ; DISALLOWED # LATIN CAPITAL LETTER REVERSED E..LATIN CAPIT
1226 0192 ; PVALID # LATIN SMALL LETTER F WITH HOOK
1227 0193..0194 ; DISALLOWED # LATIN CAPITAL LETTER G WITH HOOK..LATIN CAPI
1228 0195 ; PVALID # LATIN SMALL LETTER HV
1229 0196..0198 ; DISALLOWED # LATIN CAPITAL LETTER IOTA..LATIN CAPITAL LET
1230 0199..019B ; PVALID # LATIN SMALL LETTER K WITH HOOK..LATIN SMALL
1231 019C..019D ; DISALLOWED # LATIN CAPITAL LETTER TURNED M..LATIN CAPITAL
1232 019E ; PVALID # LATIN SMALL LETTER N WITH LONG RIGHT LEG
1233 019F..01A0 ; DISALLOWED # LATIN CAPITAL LETTER O WITH MIDDLE TILDE..LA
1234 01A1 ; PVALID # LATIN SMALL LETTER O WITH HORN
1235 01A2 ; DISALLOWED # LATIN CAPITAL LETTER OI
1236 01A3 ; PVALID # LATIN SMALL LETTER OI
1237 01A4 ; DISALLOWED # LATIN CAPITAL LETTER P WITH HOOK
1238 01A5 ; PVALID # LATIN SMALL LETTER P WITH HOOK
1239 01A6..01A7 ; DISALLOWED # LATIN LETTER YR..LATIN CAPITAL LETTER TONE T
1240 01A8 ; PVALID # LATIN SMALL LETTER TONE TWO
1241 01A9 ; DISALLOWED # LATIN CAPITAL LETTER ESH
1242 01AA..01AB ; PVALID # LATIN LETTER REVERSED ESH LOOP..LATIN SMALL
1243 01AC ; DISALLOWED # LATIN CAPITAL LETTER T WITH HOOK
1244 01AD ; PVALID # LATIN SMALL LETTER T WITH HOOK
1245 01AE..01AF ; DISALLOWED # LATIN CAPITAL LETTER T WITH RETROFLEX HOOK..
1246 01B0 ; PVALID # LATIN SMALL LETTER U WITH HORN
1247 01B1..01B3 ; DISALLOWED # LATIN CAPITAL LETTER UPSILON..LATIN CAPITAL
1248 01B4 ; PVALID # LATIN SMALL LETTER Y WITH HOOK
1249 01B5 ; DISALLOWED # LATIN CAPITAL LETTER Z WITH STROKE
1250 01B6 ; PVALID # LATIN SMALL LETTER Z WITH STROKE
1251 01B7..01B8 ; DISALLOWED # LATIN CAPITAL LETTER EZH..LATIN CAPITAL LETT
1252 01B9..01BB ; PVALID # LATIN SMALL LETTER EZH REVERSED..LATIN LETTE
1253 01BC ; DISALLOWED # LATIN CAPITAL LETTER TONE FIVE
1254 01BD..01C3 ; PVALID # LATIN SMALL LETTER TONE FIVE..LATIN LETTER R
1255 01C4..01CD ; DISALLOWED # LATIN CAPITAL LETTER DZ WITH CARON..LATIN CA
1256 01CE ; PVALID # LATIN SMALL LETTER A WITH CARON
1257 01CF ; DISALLOWED # LATIN CAPITAL LETTER I WITH CARON
1258 01D0 ; PVALID # LATIN SMALL LETTER I WITH CARON
1259
1260
1261
1262 Faltstrom Standards Track [Page 23]
1263 RFC 5892 IDNA Code Points August 2010
1264
1265
1266 01D1 ; DISALLOWED # LATIN CAPITAL LETTER O WITH CARON
1267 01D2 ; PVALID # LATIN SMALL LETTER O WITH CARON
1268 01D3 ; DISALLOWED # LATIN CAPITAL LETTER U WITH CARON
1269 01D4 ; PVALID # LATIN SMALL LETTER U WITH CARON
1270 01D5 ; DISALLOWED # LATIN CAPITAL LETTER U WITH DIAERESIS AND MA
1271 01D6 ; PVALID # LATIN SMALL LETTER U WITH DIAERESIS AND MACR
1272 01D7 ; DISALLOWED # LATIN CAPITAL LETTER U WITH DIAERESIS AND AC
1273 01D8 ; PVALID # LATIN SMALL LETTER U WITH DIAERESIS AND ACUT
1274 01D9 ; DISALLOWED # LATIN CAPITAL LETTER U WITH DIAERESIS AND CA
1275 01DA ; PVALID # LATIN SMALL LETTER U WITH DIAERESIS AND CARO
1276 01DB ; DISALLOWED # LATIN CAPITAL LETTER U WITH DIAERESIS AND GR
1277 01DC..01DD ; PVALID # LATIN SMALL LETTER U WITH DIAERESIS AND GRAV
1278 01DE ; DISALLOWED # LATIN CAPITAL LETTER A WITH DIAERESIS AND MA
1279 01DF ; PVALID # LATIN SMALL LETTER A WITH DIAERESIS AND MACR
1280 01E0 ; DISALLOWED # LATIN CAPITAL LETTER A WITH DOT ABOVE AND MA
1281 01E1 ; PVALID # LATIN SMALL LETTER A WITH DOT ABOVE AND MACR
1282 01E2 ; DISALLOWED # LATIN CAPITAL LETTER AE WITH MACRON
1283 01E3 ; PVALID # LATIN SMALL LETTER AE WITH MACRON
1284 01E4 ; DISALLOWED # LATIN CAPITAL LETTER G WITH STROKE
1285 01E5 ; PVALID # LATIN SMALL LETTER G WITH STROKE
1286 01E6 ; DISALLOWED # LATIN CAPITAL LETTER G WITH CARON
1287 01E7 ; PVALID # LATIN SMALL LETTER G WITH CARON
1288 01E8 ; DISALLOWED # LATIN CAPITAL LETTER K WITH CARON
1289 01E9 ; PVALID # LATIN SMALL LETTER K WITH CARON
1290 01EA ; DISALLOWED # LATIN CAPITAL LETTER O WITH OGONEK
1291 01EB ; PVALID # LATIN SMALL LETTER O WITH OGONEK
1292 01EC ; DISALLOWED # LATIN CAPITAL LETTER O WITH OGONEK AND MACRO
1293 01ED ; PVALID # LATIN SMALL LETTER O WITH OGONEK AND MACRON
1294 01EE ; DISALLOWED # LATIN CAPITAL LETTER EZH WITH CARON
1295 01EF..01F0 ; PVALID # LATIN SMALL LETTER EZH WITH CARON..LATIN SMA
1296 01F1..01F4 ; DISALLOWED # LATIN CAPITAL LETTER DZ..LATIN CAPITAL LETTE
1297 01F5 ; PVALID # LATIN SMALL LETTER G WITH ACUTE
1298 01F6..01F8 ; DISALLOWED # LATIN CAPITAL LETTER HWAIR..LATIN CAPITAL LE
1299 01F9 ; PVALID # LATIN SMALL LETTER N WITH GRAVE
1300 01FA ; DISALLOWED # LATIN CAPITAL LETTER A WITH RING ABOVE AND A
1301 01FB ; PVALID # LATIN SMALL LETTER A WITH RING ABOVE AND ACU
1302 01FC ; DISALLOWED # LATIN CAPITAL LETTER AE WITH ACUTE
1303 01FD ; PVALID # LATIN SMALL LETTER AE WITH ACUTE
1304 01FE ; DISALLOWED # LATIN CAPITAL LETTER O WITH STROKE AND ACUTE
1305 01FF ; PVALID # LATIN SMALL LETTER O WITH STROKE AND ACUTE
1306 0200 ; DISALLOWED # LATIN CAPITAL LETTER A WITH DOUBLE GRAVE
1307 0201 ; PVALID # LATIN SMALL LETTER A WITH DOUBLE GRAVE
1308 0202 ; DISALLOWED # LATIN CAPITAL LETTER A WITH INVERTED BREVE
1309 0203 ; PVALID # LATIN SMALL LETTER A WITH INVERTED BREVE
1310 0204 ; DISALLOWED # LATIN CAPITAL LETTER E WITH DOUBLE GRAVE
1311 0205 ; PVALID # LATIN SMALL LETTER E WITH DOUBLE GRAVE
1312 0206 ; DISALLOWED # LATIN CAPITAL LETTER E WITH INVERTED BREVE
1313 0207 ; PVALID # LATIN SMALL LETTER E WITH INVERTED BREVE
1314
1315
1316
1317 Faltstrom Standards Track [Page 24]
1318 RFC 5892 IDNA Code Points August 2010
1319
1320
1321 0208 ; DISALLOWED # LATIN CAPITAL LETTER I WITH DOUBLE GRAVE
1322 0209 ; PVALID # LATIN SMALL LETTER I WITH DOUBLE GRAVE
1323 020A ; DISALLOWED # LATIN CAPITAL LETTER I WITH INVERTED BREVE
1324 020B ; PVALID # LATIN SMALL LETTER I WITH INVERTED BREVE
1325 020C ; DISALLOWED # LATIN CAPITAL LETTER O WITH DOUBLE GRAVE
1326 020D ; PVALID # LATIN SMALL LETTER O WITH DOUBLE GRAVE
1327 020E ; DISALLOWED # LATIN CAPITAL LETTER O WITH INVERTED BREVE
1328 020F ; PVALID # LATIN SMALL LETTER O WITH INVERTED BREVE
1329 0210 ; DISALLOWED # LATIN CAPITAL LETTER R WITH DOUBLE GRAVE
1330 0211 ; PVALID # LATIN SMALL LETTER R WITH DOUBLE GRAVE
1331 0212 ; DISALLOWED # LATIN CAPITAL LETTER R WITH INVERTED BREVE
1332 0213 ; PVALID # LATIN SMALL LETTER R WITH INVERTED BREVE
1333 0214 ; DISALLOWED # LATIN CAPITAL LETTER U WITH DOUBLE GRAVE
1334 0215 ; PVALID # LATIN SMALL LETTER U WITH DOUBLE GRAVE
1335 0216 ; DISALLOWED # LATIN CAPITAL LETTER U WITH INVERTED BREVE
1336 0217 ; PVALID # LATIN SMALL LETTER U WITH INVERTED BREVE
1337 0218 ; DISALLOWED # LATIN CAPITAL LETTER S WITH COMMA BELOW
1338 0219 ; PVALID # LATIN SMALL LETTER S WITH COMMA BELOW
1339 021A ; DISALLOWED # LATIN CAPITAL LETTER T WITH COMMA BELOW
1340 021B ; PVALID # LATIN SMALL LETTER T WITH COMMA BELOW
1341 021C ; DISALLOWED # LATIN CAPITAL LETTER YOGH
1342 021D ; PVALID # LATIN SMALL LETTER YOGH
1343 021E ; DISALLOWED # LATIN CAPITAL LETTER H WITH CARON
1344 021F ; PVALID # LATIN SMALL LETTER H WITH CARON
1345 0220 ; DISALLOWED # LATIN CAPITAL LETTER N WITH LONG RIGHT LEG
1346 0221 ; PVALID # LATIN SMALL LETTER D WITH CURL
1347 0222 ; DISALLOWED # LATIN CAPITAL LETTER OU
1348 0223 ; PVALID # LATIN SMALL LETTER OU
1349 0224 ; DISALLOWED # LATIN CAPITAL LETTER Z WITH HOOK
1350 0225 ; PVALID # LATIN SMALL LETTER Z WITH HOOK
1351 0226 ; DISALLOWED # LATIN CAPITAL LETTER A WITH DOT ABOVE
1352 0227 ; PVALID # LATIN SMALL LETTER A WITH DOT ABOVE
1353 0228 ; DISALLOWED # LATIN CAPITAL LETTER E WITH CEDILLA
1354 0229 ; PVALID # LATIN SMALL LETTER E WITH CEDILLA
1355 022A ; DISALLOWED # LATIN CAPITAL LETTER O WITH DIAERESIS AND MA
1356 022B ; PVALID # LATIN SMALL LETTER O WITH DIAERESIS AND MACR
1357 022C ; DISALLOWED # LATIN CAPITAL LETTER O WITH TILDE AND MACRON
1358 022D ; PVALID # LATIN SMALL LETTER O WITH TILDE AND MACRON
1359 022E ; DISALLOWED # LATIN CAPITAL LETTER O WITH DOT ABOVE
1360 022F ; PVALID # LATIN SMALL LETTER O WITH DOT ABOVE
1361 0230 ; DISALLOWED # LATIN CAPITAL LETTER O WITH DOT ABOVE AND MA
1362 0231 ; PVALID # LATIN SMALL LETTER O WITH DOT ABOVE AND MACR
1363 0232 ; DISALLOWED # LATIN CAPITAL LETTER Y WITH MACRON
1364 0233..0239 ; PVALID # LATIN SMALL LETTER Y WITH MACRON..LATIN SMAL
1365 023A..023B ; DISALLOWED # LATIN CAPITAL LETTER A WITH STROKE..LATIN CA
1366 023C ; PVALID # LATIN SMALL LETTER C WITH STROKE
1367 023D..023E ; DISALLOWED # LATIN CAPITAL LETTER L WITH BAR..LATIN CAPIT
1368 023F..0240 ; PVALID # LATIN SMALL LETTER S WITH SWASH TAIL..LATIN
1369
1370
1371
1372 Faltstrom Standards Track [Page 25]
1373 RFC 5892 IDNA Code Points August 2010
1374
1375
1376 0241 ; DISALLOWED # LATIN CAPITAL LETTER GLOTTAL STOP
1377 0242 ; PVALID # LATIN SMALL LETTER GLOTTAL STOP
1378 0243..0246 ; DISALLOWED # LATIN CAPITAL LETTER B WITH STROKE..LATIN CA
1379 0247 ; PVALID # LATIN SMALL LETTER E WITH STROKE
1380 0248 ; DISALLOWED # LATIN CAPITAL LETTER J WITH STROKE
1381 0249 ; PVALID # LATIN SMALL LETTER J WITH STROKE
1382 024A ; DISALLOWED # LATIN CAPITAL LETTER SMALL Q WITH HOOK TAIL
1383 024B ; PVALID # LATIN SMALL LETTER Q WITH HOOK TAIL
1384 024C ; DISALLOWED # LATIN CAPITAL LETTER R WITH STROKE
1385 024D ; PVALID # LATIN SMALL LETTER R WITH STROKE
1386 024E ; DISALLOWED # LATIN CAPITAL LETTER Y WITH STROKE
1387 024F..02AF ; PVALID # LATIN SMALL LETTER Y WITH STROKE..LATIN SMAL
1388 02B0..02B8 ; DISALLOWED # MODIFIER LETTER SMALL H..MODIFIER LETTER SMA
1389 02B9..02C1 ; PVALID # MODIFIER LETTER PRIME..MODIFIER LETTER REVER
1390 02C2..02C5 ; DISALLOWED # MODIFIER LETTER LEFT ARROWHEAD..MODIFIER LET
1391 02C6..02D1 ; PVALID # MODIFIER LETTER CIRCUMFLEX ACCENT..MODIFIER
1392 02D2..02EB ; DISALLOWED # MODIFIER LETTER CENTRED RIGHT HALF RING..MOD
1393 02EC ; PVALID # MODIFIER LETTER VOICING
1394 02ED ; DISALLOWED # MODIFIER LETTER UNASPIRATED
1395 02EE ; PVALID # MODIFIER LETTER DOUBLE APOSTROPHE
1396 02EF..02FF ; DISALLOWED # MODIFIER LETTER LOW DOWN ARROWHEAD..MODIFIER
1397 0300..033F ; PVALID # COMBINING GRAVE ACCENT..COMBINING DOUBLE OVE
1398 0340..0341 ; DISALLOWED # COMBINING GRAVE TONE MARK..COMBINING ACUTE T
1399 0342 ; PVALID # COMBINING GREEK PERISPOMENI
1400 0343..0345 ; DISALLOWED # COMBINING GREEK KORONIS..COMBINING GREEK YPO
1401 0346..034E ; PVALID # COMBINING BRIDGE ABOVE..COMBINING UPWARDS AR
1402 034F ; DISALLOWED # COMBINING GRAPHEME JOINER
1403 0350..036F ; PVALID # COMBINING RIGHT ARROWHEAD ABOVE..COMBINING L
1404 0370 ; DISALLOWED # GREEK CAPITAL LETTER HETA
1405 0371 ; PVALID # GREEK SMALL LETTER HETA
1406 0372 ; DISALLOWED # GREEK CAPITAL LETTER ARCHAIC SAMPI
1407 0373 ; PVALID # GREEK SMALL LETTER ARCHAIC SAMPI
1408 0374 ; DISALLOWED # GREEK NUMERAL SIGN
1409 0375 ; CONTEXTO # GREEK LOWER NUMERAL SIGN
1410 0376 ; DISALLOWED # GREEK CAPITAL LETTER PAMPHYLIAN DIGAMMA
1411 0377 ; PVALID # GREEK SMALL LETTER PAMPHYLIAN DIGAMMA
1412 0378..0379 ; UNASSIGNED # <reserved>..<reserved>
1413 037A ; DISALLOWED # GREEK YPOGEGRAMMENI
1414 037B..037D ; PVALID # GREEK SMALL REVERSED LUNATE SIGMA SYMBOL..GR
1415 037E ; DISALLOWED # GREEK QUESTION MARK
1416 037F..0383 ; UNASSIGNED # <reserved>..<reserved>
1417 0384..038A ; DISALLOWED # GREEK TONOS..GREEK CAPITAL LETTER IOTA WITH
1418 038B ; UNASSIGNED # <reserved>
1419 038C ; DISALLOWED # GREEK CAPITAL LETTER OMICRON WITH TONOS
1420 038D ; UNASSIGNED # <reserved>
1421 038E..038F ; DISALLOWED # GREEK CAPITAL LETTER UPSILON WITH TONOS..GRE
1422 0390 ; PVALID # GREEK SMALL LETTER IOTA WITH DIALYTIKA AND T
1423 0391..03A1 ; DISALLOWED # GREEK CAPITAL LETTER ALPHA..GREEK CAPITAL LE
1424
1425
1426
1427 Faltstrom Standards Track [Page 26]
1428 RFC 5892 IDNA Code Points August 2010
1429
1430
1431 03A2 ; UNASSIGNED # <reserved>
1432 03A3..03AB ; DISALLOWED # GREEK CAPITAL LETTER SIGMA..GREEK CAPITAL LE
1433 03AC..03CE ; PVALID # GREEK SMALL LETTER ALPHA WITH TONOS..GREEK S
1434 03CF..03D6 ; DISALLOWED # GREEK CAPITAL KAI SYMBOL..GREEK PI SYMBOL
1435 03D7 ; PVALID # GREEK KAI SYMBOL
1436 03D8 ; DISALLOWED # GREEK LETTER ARCHAIC KOPPA
1437 03D9 ; PVALID # GREEK SMALL LETTER ARCHAIC KOPPA
1438 03DA ; DISALLOWED # GREEK LETTER STIGMA
1439 03DB ; PVALID # GREEK SMALL LETTER STIGMA
1440 03DC ; DISALLOWED # GREEK LETTER DIGAMMA
1441 03DD ; PVALID # GREEK SMALL LETTER DIGAMMA
1442 03DE ; DISALLOWED # GREEK LETTER KOPPA
1443 03DF ; PVALID # GREEK SMALL LETTER KOPPA
1444 03E0 ; DISALLOWED # GREEK LETTER SAMPI
1445 03E1 ; PVALID # GREEK SMALL LETTER SAMPI
1446 03E2 ; DISALLOWED # COPTIC CAPITAL LETTER SHEI
1447 03E3 ; PVALID # COPTIC SMALL LETTER SHEI
1448 03E4 ; DISALLOWED # COPTIC CAPITAL LETTER FEI
1449 03E5 ; PVALID # COPTIC SMALL LETTER FEI
1450 03E6 ; DISALLOWED # COPTIC CAPITAL LETTER KHEI
1451 03E7 ; PVALID # COPTIC SMALL LETTER KHEI
1452 03E8 ; DISALLOWED # COPTIC CAPITAL LETTER HORI
1453 03E9 ; PVALID # COPTIC SMALL LETTER HORI
1454 03EA ; DISALLOWED # COPTIC CAPITAL LETTER GANGIA
1455 03EB ; PVALID # COPTIC SMALL LETTER GANGIA
1456 03EC ; DISALLOWED # COPTIC CAPITAL LETTER SHIMA
1457 03ED ; PVALID # COPTIC SMALL LETTER SHIMA
1458 03EE ; DISALLOWED # COPTIC CAPITAL LETTER DEI
1459 03EF ; PVALID # COPTIC SMALL LETTER DEI
1460 03F0..03F2 ; DISALLOWED # GREEK KAPPA SYMBOL..GREEK LUNATE SIGMA SYMBO
1461 03F3 ; PVALID # GREEK LETTER YOT
1462 03F4..03F7 ; DISALLOWED # GREEK CAPITAL THETA SYMBOL..GREEK CAPITAL LE
1463 03F8 ; PVALID # GREEK SMALL LETTER SHO
1464 03F9..03FA ; DISALLOWED # GREEK CAPITAL LUNATE SIGMA SYMBOL..GREEK CAP
1465 03FB..03FC ; PVALID # GREEK SMALL LETTER SAN..GREEK RHO WITH STROK
1466 03FD..042F ; DISALLOWED # GREEK CAPITAL REVERSED LUNATE SIGMA SYMBOL..
1467 0430..045F ; PVALID # CYRILLIC SMALL LETTER A..CYRILLIC SMALL LETT
1468 0460 ; DISALLOWED # CYRILLIC CAPITAL LETTER OMEGA
1469 0461 ; PVALID # CYRILLIC SMALL LETTER OMEGA
1470 0462 ; DISALLOWED # CYRILLIC CAPITAL LETTER YAT
1471 0463 ; PVALID # CYRILLIC SMALL LETTER YAT
1472 0464 ; DISALLOWED # CYRILLIC CAPITAL LETTER IOTIFIED E
1473 0465 ; PVALID # CYRILLIC SMALL LETTER IOTIFIED E
1474 0466 ; DISALLOWED # CYRILLIC CAPITAL LETTER LITTLE YUS
1475 0467 ; PVALID # CYRILLIC SMALL LETTER LITTLE YUS
1476 0468 ; DISALLOWED # CYRILLIC CAPITAL LETTER IOTIFIED LITTLE YUS
1477 0469 ; PVALID # CYRILLIC SMALL LETTER IOTIFIED LITTLE YUS
1478 046A ; DISALLOWED # CYRILLIC CAPITAL LETTER BIG YUS
1479
1480
1481
1482 Faltstrom Standards Track [Page 27]
1483 RFC 5892 IDNA Code Points August 2010
1484
1485
1486 046B ; PVALID # CYRILLIC SMALL LETTER BIG YUS
1487 046C ; DISALLOWED # CYRILLIC CAPITAL LETTER IOTIFIED BIG YUS
1488 046D ; PVALID # CYRILLIC SMALL LETTER IOTIFIED BIG YUS
1489 046E ; DISALLOWED # CYRILLIC CAPITAL LETTER KSI
1490 046F ; PVALID # CYRILLIC SMALL LETTER KSI
1491 0470 ; DISALLOWED # CYRILLIC CAPITAL LETTER PSI
1492 0471 ; PVALID # CYRILLIC SMALL LETTER PSI
1493 0472 ; DISALLOWED # CYRILLIC CAPITAL LETTER FITA
1494 0473 ; PVALID # CYRILLIC SMALL LETTER FITA
1495 0474 ; DISALLOWED # CYRILLIC CAPITAL LETTER IZHITSA
1496 0475 ; PVALID # CYRILLIC SMALL LETTER IZHITSA
1497 0476 ; DISALLOWED # CYRILLIC CAPITAL LETTER IZHITSA WITH DOUBLE
1498 0477 ; PVALID # CYRILLIC SMALL LETTER IZHITSA WITH DOUBLE GR
1499 0478 ; DISALLOWED # CYRILLIC CAPITAL LETTER UK
1500 0479 ; PVALID # CYRILLIC SMALL LETTER UK
1501 047A ; DISALLOWED # CYRILLIC CAPITAL LETTER ROUND OMEGA
1502 047B ; PVALID # CYRILLIC SMALL LETTER ROUND OMEGA
1503 047C ; DISALLOWED # CYRILLIC CAPITAL LETTER OMEGA WITH TITLO
1504 047D ; PVALID # CYRILLIC SMALL LETTER OMEGA WITH TITLO
1505 047E ; DISALLOWED # CYRILLIC CAPITAL LETTER OT
1506 047F ; PVALID # CYRILLIC SMALL LETTER OT
1507 0480 ; DISALLOWED # CYRILLIC CAPITAL LETTER KOPPA
1508 0481 ; PVALID # CYRILLIC SMALL LETTER KOPPA
1509 0482 ; DISALLOWED # CYRILLIC THOUSANDS SIGN
1510 0483..0487 ; PVALID # COMBINING CYRILLIC TITLO..COMBINING CYRILLIC
1511 0488..048A ; DISALLOWED # COMBINING CYRILLIC HUNDRED THOUSANDS SIGN..C
1512 048B ; PVALID # CYRILLIC SMALL LETTER SHORT I WITH TAIL
1513 048C ; DISALLOWED # CYRILLIC CAPITAL LETTER SEMISOFT SIGN
1514 048D ; PVALID # CYRILLIC SMALL LETTER SEMISOFT SIGN
1515 048E ; DISALLOWED # CYRILLIC CAPITAL LETTER ER WITH TICK
1516 048F ; PVALID # CYRILLIC SMALL LETTER ER WITH TICK
1517 0490 ; DISALLOWED # CYRILLIC CAPITAL LETTER GHE WITH UPTURN
1518 0491 ; PVALID # CYRILLIC SMALL LETTER GHE WITH UPTURN
1519 0492 ; DISALLOWED # CYRILLIC CAPITAL LETTER GHE WITH STROKE
1520 0493 ; PVALID # CYRILLIC SMALL LETTER GHE WITH STROKE
1521 0494 ; DISALLOWED # CYRILLIC CAPITAL LETTER GHE WITH MIDDLE HOOK
1522 0495 ; PVALID # CYRILLIC SMALL LETTER GHE WITH MIDDLE HOOK
1523 0496 ; DISALLOWED # CYRILLIC CAPITAL LETTER ZHE WITH DESCENDER
1524 0497 ; PVALID # CYRILLIC SMALL LETTER ZHE WITH DESCENDER
1525 0498 ; DISALLOWED # CYRILLIC CAPITAL LETTER ZE WITH DESCENDER
1526 0499 ; PVALID # CYRILLIC SMALL LETTER ZE WITH DESCENDER
1527 049A ; DISALLOWED # CYRILLIC CAPITAL LETTER KA WITH DESCENDER
1528 049B ; PVALID # CYRILLIC SMALL LETTER KA WITH DESCENDER
1529 049C ; DISALLOWED # CYRILLIC CAPITAL LETTER KA WITH VERTICAL STR
1530 049D ; PVALID # CYRILLIC SMALL LETTER KA WITH VERTICAL STROK
1531 049E ; DISALLOWED # CYRILLIC CAPITAL LETTER KA WITH STROKE
1532 049F ; PVALID # CYRILLIC SMALL LETTER KA WITH STROKE
1533 04A0 ; DISALLOWED # CYRILLIC CAPITAL LETTER BASHKIR KA
1534
1535
1536
1537 Faltstrom Standards Track [Page 28]
1538 RFC 5892 IDNA Code Points August 2010
1539
1540
1541 04A1 ; PVALID # CYRILLIC SMALL LETTER BASHKIR KA
1542 04A2 ; DISALLOWED # CYRILLIC CAPITAL LETTER EN WITH DESCENDER
1543 04A3 ; PVALID # CYRILLIC SMALL LETTER EN WITH DESCENDER
1544 04A4 ; DISALLOWED # CYRILLIC CAPITAL LIGATURE EN GHE
1545 04A5 ; PVALID # CYRILLIC SMALL LIGATURE EN GHE
1546 04A6 ; DISALLOWED # CYRILLIC CAPITAL LETTER PE WITH MIDDLE HOOK
1547 04A7 ; PVALID # CYRILLIC SMALL LETTER PE WITH MIDDLE HOOK
1548 04A8 ; DISALLOWED # CYRILLIC CAPITAL LETTER ABKHASIAN HA
1549 04A9 ; PVALID # CYRILLIC SMALL LETTER ABKHASIAN HA
1550 04AA ; DISALLOWED # CYRILLIC CAPITAL LETTER ES WITH DESCENDER
1551 04AB ; PVALID # CYRILLIC SMALL LETTER ES WITH DESCENDER
1552 04AC ; DISALLOWED # CYRILLIC CAPITAL LETTER TE WITH DESCENDER
1553 04AD ; PVALID # CYRILLIC SMALL LETTER TE WITH DESCENDER
1554 04AE ; DISALLOWED # CYRILLIC CAPITAL LETTER STRAIGHT U
1555 04AF ; PVALID # CYRILLIC SMALL LETTER STRAIGHT U
1556 04B0 ; DISALLOWED # CYRILLIC CAPITAL LETTER STRAIGHT U WITH STRO
1557 04B1 ; PVALID # CYRILLIC SMALL LETTER STRAIGHT U WITH STROKE
1558 04B2 ; DISALLOWED # CYRILLIC CAPITAL LETTER HA WITH DESCENDER
1559 04B3 ; PVALID # CYRILLIC SMALL LETTER HA WITH DESCENDER
1560 04B4 ; DISALLOWED # CYRILLIC CAPITAL LIGATURE TE TSE
1561 04B5 ; PVALID # CYRILLIC SMALL LIGATURE TE TSE
1562 04B6 ; DISALLOWED # CYRILLIC CAPITAL LETTER CHE WITH DESCENDER
1563 04B7 ; PVALID # CYRILLIC SMALL LETTER CHE WITH DESCENDER
1564 04B8 ; DISALLOWED # CYRILLIC CAPITAL LETTER CHE WITH VERTICAL ST
1565 04B9 ; PVALID # CYRILLIC SMALL LETTER CHE WITH VERTICAL STRO
1566 04BA ; DISALLOWED # CYRILLIC CAPITAL LETTER SHHA
1567 04BB ; PVALID # CYRILLIC SMALL LETTER SHHA
1568 04BC ; DISALLOWED # CYRILLIC CAPITAL LETTER ABKHASIAN CHE
1569 04BD ; PVALID # CYRILLIC SMALL LETTER ABKHASIAN CHE
1570 04BE ; DISALLOWED # CYRILLIC CAPITAL LETTER ABKHASIAN CHE WITH D
1571 04BF ; PVALID # CYRILLIC SMALL LETTER ABKHASIAN CHE WITH DES
1572 04C0..04C1 ; DISALLOWED # CYRILLIC LETTER PALOCHKA..CYRILLIC CAPITAL L
1573 04C2 ; PVALID # CYRILLIC SMALL LETTER ZHE WITH BREVE
1574 04C3 ; DISALLOWED # CYRILLIC CAPITAL LETTER KA WITH HOOK
1575 04C4 ; PVALID # CYRILLIC SMALL LETTER KA WITH HOOK
1576 04C5 ; DISALLOWED # CYRILLIC CAPITAL LETTER EL WITH TAIL
1577 04C6 ; PVALID # CYRILLIC SMALL LETTER EL WITH TAIL
1578 04C7 ; DISALLOWED # CYRILLIC CAPITAL LETTER EN WITH HOOK
1579 04C8 ; PVALID # CYRILLIC SMALL LETTER EN WITH HOOK
1580 04C9 ; DISALLOWED # CYRILLIC CAPITAL LETTER EN WITH TAIL
1581 04CA ; PVALID # CYRILLIC SMALL LETTER EN WITH TAIL
1582 04CB ; DISALLOWED # CYRILLIC CAPITAL LETTER KHAKASSIAN CHE
1583 04CC ; PVALID # CYRILLIC SMALL LETTER KHAKASSIAN CHE
1584 04CD ; DISALLOWED # CYRILLIC CAPITAL LETTER EM WITH TAIL
1585 04CE..04CF ; PVALID # CYRILLIC SMALL LETTER EM WITH TAIL..CYRILLIC
1586 04D0 ; DISALLOWED # CYRILLIC CAPITAL LETTER A WITH BREVE
1587 04D1 ; PVALID # CYRILLIC SMALL LETTER A WITH BREVE
1588 04D2 ; DISALLOWED # CYRILLIC CAPITAL LETTER A WITH DIAERESIS
1589
1590
1591
1592 Faltstrom Standards Track [Page 29]
1593 RFC 5892 IDNA Code Points August 2010
1594
1595
1596 04D3 ; PVALID # CYRILLIC SMALL LETTER A WITH DIAERESIS
1597 04D4 ; DISALLOWED # CYRILLIC CAPITAL LIGATURE A IE
1598 04D5 ; PVALID # CYRILLIC SMALL LIGATURE A IE
1599 04D6 ; DISALLOWED # CYRILLIC CAPITAL LETTER IE WITH BREVE
1600 04D7 ; PVALID # CYRILLIC SMALL LETTER IE WITH BREVE
1601 04D8 ; DISALLOWED # CYRILLIC CAPITAL LETTER SCHWA
1602 04D9 ; PVALID # CYRILLIC SMALL LETTER SCHWA
1603 04DA ; DISALLOWED # CYRILLIC CAPITAL LETTER SCHWA WITH DIAERESIS
1604 04DB ; PVALID # CYRILLIC SMALL LETTER SCHWA WITH DIAERESIS
1605 04DC ; DISALLOWED # CYRILLIC CAPITAL LETTER ZHE WITH DIAERESIS
1606 04DD ; PVALID # CYRILLIC SMALL LETTER ZHE WITH DIAERESIS
1607 04DE ; DISALLOWED # CYRILLIC CAPITAL LETTER ZE WITH DIAERESIS
1608 04DF ; PVALID # CYRILLIC SMALL LETTER ZE WITH DIAERESIS
1609 04E0 ; DISALLOWED # CYRILLIC CAPITAL LETTER ABKHASIAN DZE
1610 04E1 ; PVALID # CYRILLIC SMALL LETTER ABKHASIAN DZE
1611 04E2 ; DISALLOWED # CYRILLIC CAPITAL LETTER I WITH MACRON
1612 04E3 ; PVALID # CYRILLIC SMALL LETTER I WITH MACRON
1613 04E4 ; DISALLOWED # CYRILLIC CAPITAL LETTER I WITH DIAERESIS
1614 04E5 ; PVALID # CYRILLIC SMALL LETTER I WITH DIAERESIS
1615 04E6 ; DISALLOWED # CYRILLIC CAPITAL LETTER O WITH DIAERESIS
1616 04E7 ; PVALID # CYRILLIC SMALL LETTER O WITH DIAERESIS
1617 04E8 ; DISALLOWED # CYRILLIC CAPITAL LETTER BARRED O
1618 04E9 ; PVALID # CYRILLIC SMALL LETTER BARRED O
1619 04EA ; DISALLOWED # CYRILLIC CAPITAL LETTER BARRED O WITH DIAERE
1620 04EB ; PVALID # CYRILLIC SMALL LETTER BARRED O WITH DIAERESI
1621 04EC ; DISALLOWED # CYRILLIC CAPITAL LETTER E WITH DIAERESIS
1622 04ED ; PVALID # CYRILLIC SMALL LETTER E WITH DIAERESIS
1623 04EE ; DISALLOWED # CYRILLIC CAPITAL LETTER U WITH MACRON
1624 04EF ; PVALID # CYRILLIC SMALL LETTER U WITH MACRON
1625 04F0 ; DISALLOWED # CYRILLIC CAPITAL LETTER U WITH DIAERESIS
1626 04F1 ; PVALID # CYRILLIC SMALL LETTER U WITH DIAERESIS
1627 04F2 ; DISALLOWED # CYRILLIC CAPITAL LETTER U WITH DOUBLE ACUTE
1628 04F3 ; PVALID # CYRILLIC SMALL LETTER U WITH DOUBLE ACUTE
1629 04F4 ; DISALLOWED # CYRILLIC CAPITAL LETTER CHE WITH DIAERESIS
1630 04F5 ; PVALID # CYRILLIC SMALL LETTER CHE WITH DIAERESIS
1631 04F6 ; DISALLOWED # CYRILLIC CAPITAL LETTER GHE WITH DESCENDER
1632 04F7 ; PVALID # CYRILLIC SMALL LETTER GHE WITH DESCENDER
1633 04F8 ; DISALLOWED # CYRILLIC CAPITAL LETTER YERU WITH DIAERESIS
1634 04F9 ; PVALID # CYRILLIC SMALL LETTER YERU WITH DIAERESIS
1635 04FA ; DISALLOWED # CYRILLIC CAPITAL LETTER GHE WITH STROKE AND
1636 04FB ; PVALID # CYRILLIC SMALL LETTER GHE WITH STROKE AND HO
1637 04FC ; DISALLOWED # CYRILLIC CAPITAL LETTER HA WITH HOOK
1638 04FD ; PVALID # CYRILLIC SMALL LETTER HA WITH HOOK
1639 04FE ; DISALLOWED # CYRILLIC CAPITAL LETTER HA WITH STROKE
1640 04FF ; PVALID # CYRILLIC SMALL LETTER HA WITH STROKE
1641 0500 ; DISALLOWED # CYRILLIC CAPITAL LETTER KOMI DE
1642 0501 ; PVALID # CYRILLIC SMALL LETTER KOMI DE
1643 0502 ; DISALLOWED # CYRILLIC CAPITAL LETTER KOMI DJE
1644
1645
1646
1647 Faltstrom Standards Track [Page 30]
1648 RFC 5892 IDNA Code Points August 2010
1649
1650
1651 0503 ; PVALID # CYRILLIC SMALL LETTER KOMI DJE
1652 0504 ; DISALLOWED # CYRILLIC CAPITAL LETTER KOMI ZJE
1653 0505 ; PVALID # CYRILLIC SMALL LETTER KOMI ZJE
1654 0506 ; DISALLOWED # CYRILLIC CAPITAL LETTER KOMI DZJE
1655 0507 ; PVALID # CYRILLIC SMALL LETTER KOMI DZJE
1656 0508 ; DISALLOWED # CYRILLIC CAPITAL LETTER KOMI LJE
1657 0509 ; PVALID # CYRILLIC SMALL LETTER KOMI LJE
1658 050A ; DISALLOWED # CYRILLIC CAPITAL LETTER KOMI NJE
1659 050B ; PVALID # CYRILLIC SMALL LETTER KOMI NJE
1660 050C ; DISALLOWED # CYRILLIC CAPITAL LETTER KOMI SJE
1661 050D ; PVALID # CYRILLIC SMALL LETTER KOMI SJE
1662 050E ; DISALLOWED # CYRILLIC CAPITAL LETTER KOMI TJE
1663 050F ; PVALID # CYRILLIC SMALL LETTER KOMI TJE
1664 0510 ; DISALLOWED # CYRILLIC CAPITAL LETTER REVERSED ZE
1665 0511 ; PVALID # CYRILLIC SMALL LETTER REVERSED ZE
1666 0512 ; DISALLOWED # CYRILLIC CAPITAL LETTER EL WITH HOOK
1667 0513 ; PVALID # CYRILLIC SMALL LETTER EL WITH HOOK
1668 0514 ; DISALLOWED # CYRILLIC CAPITAL LETTER LHA
1669 0515 ; PVALID # CYRILLIC SMALL LETTER LHA
1670 0516 ; DISALLOWED # CYRILLIC CAPITAL LETTER RHA
1671 0517 ; PVALID # CYRILLIC SMALL LETTER RHA
1672 0518 ; DISALLOWED # CYRILLIC CAPITAL LETTER YAE
1673 0519 ; PVALID # CYRILLIC SMALL LETTER YAE
1674 051A ; DISALLOWED # CYRILLIC CAPITAL LETTER QA
1675 051B ; PVALID # CYRILLIC SMALL LETTER QA
1676 051C ; DISALLOWED # CYRILLIC CAPITAL LETTER WE
1677 051D ; PVALID # CYRILLIC SMALL LETTER WE
1678 051E ; DISALLOWED # CYRILLIC CAPITAL LETTER ALEUT KA
1679 051F ; PVALID # CYRILLIC SMALL LETTER ALEUT KA
1680 0520 ; DISALLOWED # CYRILLIC CAPITAL LETTER EL WITH MIDDLE HOOK
1681 0521 ; PVALID # CYRILLIC SMALL LETTER EL WITH MIDDLE HOOK
1682 0522 ; DISALLOWED # CYRILLIC CAPITAL LETTER EN WITH MIDDLE HOOK
1683 0523 ; PVALID # CYRILLIC SMALL LETTER EN WITH MIDDLE HOOK
1684 0524 ; DISALLOWED # CYRILLIC CAPITAL LETTER PE WITH DESCENDER
1685 0525 ; PVALID # CYRILLIC SMALL LETTER PE WITH DESCENDER
1686 0526..0530 ; UNASSIGNED # <reserved>..<reserved>
1687 0531..0556 ; DISALLOWED # ARMENIAN CAPITAL LETTER AYB..ARMENIAN CAPITA
1688 0557..0558 ; UNASSIGNED # <reserved>..<reserved>
1689 0559 ; PVALID # ARMENIAN MODIFIER LETTER LEFT HALF RING
1690 055A..055F ; DISALLOWED # ARMENIAN APOSTROPHE..ARMENIAN ABBREVIATION M
1691 0560 ; UNASSIGNED # <reserved>
1692 0561..0586 ; PVALID # ARMENIAN SMALL LETTER AYB..ARMENIAN SMALL LE
1693 0587 ; DISALLOWED # ARMENIAN SMALL LIGATURE ECH YIWN
1694 0588 ; UNASSIGNED # <reserved>
1695 0589..058A ; DISALLOWED # ARMENIAN FULL STOP..ARMENIAN HYPHEN
1696 058B..0590 ; UNASSIGNED # <reserved>..<reserved>
1697 0591..05BD ; PVALID # HEBREW ACCENT ETNAHTA..HEBREW POINT METEG
1698 05BE ; DISALLOWED # HEBREW PUNCTUATION MAQAF
1699
1700
1701
1702 Faltstrom Standards Track [Page 31]
1703 RFC 5892 IDNA Code Points August 2010
1704
1705
1706 05BF ; PVALID # HEBREW POINT RAFE
1707 05C0 ; DISALLOWED # HEBREW PUNCTUATION PASEQ
1708 05C1..05C2 ; PVALID # HEBREW POINT SHIN DOT..HEBREW POINT SIN DOT
1709 05C3 ; DISALLOWED # HEBREW PUNCTUATION SOF PASUQ
1710 05C4..05C5 ; PVALID # HEBREW MARK UPPER DOT..HEBREW MARK LOWER DOT
1711 05C6 ; DISALLOWED # HEBREW PUNCTUATION NUN HAFUKHA
1712 05C7 ; PVALID # HEBREW POINT QAMATS QATAN
1713 05C8..05CF ; UNASSIGNED # <reserved>..<reserved>
1714 05D0..05EA ; PVALID # HEBREW LETTER ALEF..HEBREW LETTER TAV
1715 05EB..05EF ; UNASSIGNED # <reserved>..<reserved>
1716 05F0..05F2 ; PVALID # HEBREW LIGATURE YIDDISH DOUBLE VAV..HEBREW L
1717 05F3..05F4 ; CONTEXTO # HEBREW PUNCTUATION GERESH..HEBREW PUNCTUATIO
1718 05F5..05FF ; UNASSIGNED # <reserved>..<reserved>
1719 0600..0603 ; DISALLOWED # ARABIC NUMBER SIGN..ARABIC SIGN SAFHA
1720 0604..0605 ; UNASSIGNED # <reserved>..<reserved>
1721 0606..060F ; DISALLOWED # ARABIC-INDIC CUBE ROOT..ARABIC SIGN MISRA
1722 0610..061A ; PVALID # ARABIC SIGN SALLALLAHOU ALAYHE WASSALLAM..AR
1723 061B ; DISALLOWED # ARABIC SEMICOLON
1724 061C..061D ; UNASSIGNED # <reserved>..<reserved>
1725 061E..061F ; DISALLOWED # ARABIC TRIPLE DOT PUNCTUATION MARK..ARABIC Q
1726 0620 ; UNASSIGNED # <reserved>
1727 0621..063F ; PVALID # ARABIC LETTER HAMZA..ARABIC LETTER FARSI YEH
1728 0640 ; DISALLOWED # ARABIC TATWEEL
1729 0641..065E ; PVALID # ARABIC LETTER FEH..ARABIC FATHA WITH TWO DOT
1730 065F ; UNASSIGNED # <reserved>
1731 0660..0669 ; CONTEXTO # ARABIC-INDIC DIGIT ZERO..ARABIC-INDIC DIGIT
1732 066A..066D ; DISALLOWED # ARABIC PERCENT SIGN..ARABIC FIVE POINTED STA
1733 066E..0674 ; PVALID # ARABIC LETTER DOTLESS BEH..ARABIC LETTER HIG
1734 0675..0678 ; DISALLOWED # ARABIC LETTER HIGH HAMZA ALEF..ARABIC LETTER
1735 0679..06D3 ; PVALID # ARABIC LETTER TTEH..ARABIC LETTER YEH BARREE
1736 06D4 ; DISALLOWED # ARABIC FULL STOP
1737 06D5..06DC ; PVALID # ARABIC LETTER AE..ARABIC SMALL HIGH SEEN
1738 06DD..06DE ; DISALLOWED # ARABIC END OF AYAH..ARABIC START OF RUB EL H
1739 06DF..06E8 ; PVALID # ARABIC SMALL HIGH ROUNDED ZERO..ARABIC SMALL
1740 06E9 ; DISALLOWED # ARABIC PLACE OF SAJDAH
1741 06EA..06EF ; PVALID # ARABIC EMPTY CENTRE LOW STOP..ARABIC LETTER
1742 06F0..06F9 ; CONTEXTO # EXTENDED ARABIC-INDIC DIGIT ZERO..EXTENDED A
1743 06FA..06FF ; PVALID # ARABIC LETTER SHEEN WITH DOT BELOW..ARABIC L
1744 0700..070D ; DISALLOWED # SYRIAC END OF PARAGRAPH..SYRIAC HARKLEAN AST
1745 070E ; UNASSIGNED # <reserved>
1746 070F ; DISALLOWED # SYRIAC ABBREVIATION MARK
1747 0710..074A ; PVALID # SYRIAC LETTER ALAPH..SYRIAC BARREKH
1748 074B..074C ; UNASSIGNED # <reserved>..<reserved>
1749 074D..07B1 ; PVALID # SYRIAC LETTER SOGDIAN ZHAIN..THAANA LETTER N
1750 07B2..07BF ; UNASSIGNED # <reserved>..<reserved>
1751 07C0..07F5 ; PVALID # NKO DIGIT ZERO..NKO LOW TONE APOSTROPHE
1752 07F6..07FA ; DISALLOWED # NKO SYMBOL OO DENNEN..NKO LAJANYALAN
1753 07FB..07FF ; UNASSIGNED # <reserved>..<reserved>
1754
1755
1756
1757 Faltstrom Standards Track [Page 32]
1758 RFC 5892 IDNA Code Points August 2010
1759
1760
1761 0800..082D ; PVALID # SAMARITAN LETTER ALAF..SAMARITAN MARK NEQUDA
1762 082E..082F ; UNASSIGNED # <reserved>..<reserved>
1763 0830..083E ; DISALLOWED # SAMARITAN PUNCTUATION NEQUDAA..SAMARITAN PUN
1764 083F..08FF ; UNASSIGNED # <reserved>..<reserved>
1765 0900..0939 ; PVALID # DEVANAGARI SIGN INVERTED CANDRABINDU..DEVANA
1766 093A..093B ; UNASSIGNED # <reserved>..<reserved>
1767 093C..094E ; PVALID # DEVANAGARI SIGN NUKTA..DEVANAGARI VOWEL SIGN
1768 094F ; UNASSIGNED # <reserved>
1769 0950..0955 ; PVALID # DEVANAGARI OM..DEVANAGARI VOWEL SIGN CANDRA
1770 0956..0957 ; UNASSIGNED # <reserved>..<reserved>
1771 0958..095F ; DISALLOWED # DEVANAGARI LETTER QA..DEVANAGARI LETTER YYA
1772 0960..0963 ; PVALID # DEVANAGARI LETTER VOCALIC RR..DEVANAGARI VOW
1773 0964..0965 ; DISALLOWED # DEVANAGARI DANDA..DEVANAGARI DOUBLE DANDA
1774 0966..096F ; PVALID # DEVANAGARI DIGIT ZERO..DEVANAGARI DIGIT NINE
1775 0970 ; DISALLOWED # DEVANAGARI ABBREVIATION SIGN
1776 0971..0972 ; PVALID # DEVANAGARI SIGN HIGH SPACING DOT..DEVANAGARI
1777 0973..0978 ; UNASSIGNED # <reserved>..<reserved>
1778 0979..097F ; PVALID # DEVANAGARI LETTER ZHA..DEVANAGARI LETTER BBA
1779 0980 ; UNASSIGNED # <reserved>
1780 0981..0983 ; PVALID # BENGALI SIGN CANDRABINDU..BENGALI SIGN VISAR
1781 0984 ; UNASSIGNED # <reserved>
1782 0985..098C ; PVALID # BENGALI LETTER A..BENGALI LETTER VOCALIC L
1783 098D..098E ; UNASSIGNED # <reserved>..<reserved>
1784 098F..0990 ; PVALID # BENGALI LETTER E..BENGALI LETTER AI
1785 0991..0992 ; UNASSIGNED # <reserved>..<reserved>
1786 0993..09A8 ; PVALID # BENGALI LETTER O..BENGALI LETTER NA
1787 09A9 ; UNASSIGNED # <reserved>
1788 09AA..09B0 ; PVALID # BENGALI LETTER PA..BENGALI LETTER RA
1789 09B1 ; UNASSIGNED # <reserved>
1790 09B2 ; PVALID # BENGALI LETTER LA
1791 09B3..09B5 ; UNASSIGNED # <reserved>..<reserved>
1792 09B6..09B9 ; PVALID # BENGALI LETTER SHA..BENGALI LETTER HA
1793 09BA..09BB ; UNASSIGNED # <reserved>..<reserved>
1794 09BC..09C4 ; PVALID # BENGALI SIGN NUKTA..BENGALI VOWEL SIGN VOCAL
1795 09C5..09C6 ; UNASSIGNED # <reserved>..<reserved>
1796 09C7..09C8 ; PVALID # BENGALI VOWEL SIGN E..BENGALI VOWEL SIGN AI
1797 09C9..09CA ; UNASSIGNED # <reserved>..<reserved>
1798 09CB..09CE ; PVALID # BENGALI VOWEL SIGN O..BENGALI LETTER KHANDA
1799 09CF..09D6 ; UNASSIGNED # <reserved>..<reserved>
1800 09D7 ; PVALID # BENGALI AU LENGTH MARK
1801 09D8..09DB ; UNASSIGNED # <reserved>..<reserved>
1802 09DC..09DD ; DISALLOWED # BENGALI LETTER RRA..BENGALI LETTER RHA
1803 09DE ; UNASSIGNED # <reserved>
1804 09DF ; DISALLOWED # BENGALI LETTER YYA
1805 09E0..09E3 ; PVALID # BENGALI LETTER VOCALIC RR..BENGALI VOWEL SIG
1806 09E4..09E5 ; UNASSIGNED # <reserved>..<reserved>
1807 09E6..09F1 ; PVALID # BENGALI DIGIT ZERO..BENGALI LETTER RA WITH L
1808 09F2..09FB ; DISALLOWED # BENGALI RUPEE MARK..BENGALI GANDA MARK
1809
1810
1811
1812 Faltstrom Standards Track [Page 33]
1813 RFC 5892 IDNA Code Points August 2010
1814
1815
1816 09FC..0A00 ; UNASSIGNED # <reserved>..<reserved>
1817 0A01..0A03 ; PVALID # GURMUKHI SIGN ADAK BINDI..GURMUKHI SIGN VISA
1818 0A04 ; UNASSIGNED # <reserved>
1819 0A05..0A0A ; PVALID # GURMUKHI LETTER A..GURMUKHI LETTER UU
1820 0A0B..0A0E ; UNASSIGNED # <reserved>..<reserved>
1821 0A0F..0A10 ; PVALID # GURMUKHI LETTER EE..GURMUKHI LETTER AI
1822 0A11..0A12 ; UNASSIGNED # <reserved>..<reserved>
1823 0A13..0A28 ; PVALID # GURMUKHI LETTER OO..GURMUKHI LETTER NA
1824 0A29 ; UNASSIGNED # <reserved>
1825 0A2A..0A30 ; PVALID # GURMUKHI LETTER PA..GURMUKHI LETTER RA
1826 0A31 ; UNASSIGNED # <reserved>
1827 0A32 ; PVALID # GURMUKHI LETTER LA
1828 0A33 ; DISALLOWED # GURMUKHI LETTER LLA
1829 0A34 ; UNASSIGNED # <reserved>
1830 0A35 ; PVALID # GURMUKHI LETTER VA
1831 0A36 ; DISALLOWED # GURMUKHI LETTER SHA
1832 0A37 ; UNASSIGNED # <reserved>
1833 0A38..0A39 ; PVALID # GURMUKHI LETTER SA..GURMUKHI LETTER HA
1834 0A3A..0A3B ; UNASSIGNED # <reserved>..<reserved>
1835 0A3C ; PVALID # GURMUKHI SIGN NUKTA
1836 0A3D ; UNASSIGNED # <reserved>
1837 0A3E..0A42 ; PVALID # GURMUKHI VOWEL SIGN AA..GURMUKHI VOWEL SIGN
1838 0A43..0A46 ; UNASSIGNED # <reserved>..<reserved>
1839 0A47..0A48 ; PVALID # GURMUKHI VOWEL SIGN EE..GURMUKHI VOWEL SIGN
1840 0A49..0A4A ; UNASSIGNED # <reserved>..<reserved>
1841 0A4B..0A4D ; PVALID # GURMUKHI VOWEL SIGN OO..GURMUKHI SIGN VIRAMA
1842 0A4E..0A50 ; UNASSIGNED # <reserved>..<reserved>
1843 0A51 ; PVALID # GURMUKHI SIGN UDAAT
1844 0A52..0A58 ; UNASSIGNED # <reserved>..<reserved>
1845 0A59..0A5B ; DISALLOWED # GURMUKHI LETTER KHHA..GURMUKHI LETTER ZA
1846 0A5C ; PVALID # GURMUKHI LETTER RRA
1847 0A5D ; UNASSIGNED # <reserved>
1848 0A5E ; DISALLOWED # GURMUKHI LETTER FA
1849 0A5F..0A65 ; UNASSIGNED # <reserved>..<reserved>
1850 0A66..0A75 ; PVALID # GURMUKHI DIGIT ZERO..GURMUKHI SIGN YAKASH
1851 0A76..0A80 ; UNASSIGNED # <reserved>..<reserved>
1852 0A81..0A83 ; PVALID # GUJARATI SIGN CANDRABINDU..GUJARATI SIGN VIS
1853 0A84 ; UNASSIGNED # <reserved>
1854 0A85..0A8D ; PVALID # GUJARATI LETTER A..GUJARATI VOWEL CANDRA E
1855 0A8E ; UNASSIGNED # <reserved>
1856 0A8F..0A91 ; PVALID # GUJARATI LETTER E..GUJARATI VOWEL CANDRA O
1857 0A92 ; UNASSIGNED # <reserved>
1858 0A93..0AA8 ; PVALID # GUJARATI LETTER O..GUJARATI LETTER NA
1859 0AA9 ; UNASSIGNED # <reserved>
1860 0AAA..0AB0 ; PVALID # GUJARATI LETTER PA..GUJARATI LETTER RA
1861 0AB1 ; UNASSIGNED # <reserved>
1862 0AB2..0AB3 ; PVALID # GUJARATI LETTER LA..GUJARATI LETTER LLA
1863 0AB4 ; UNASSIGNED # <reserved>
1864
1865
1866
1867 Faltstrom Standards Track [Page 34]
1868 RFC 5892 IDNA Code Points August 2010
1869
1870
1871 0AB5..0AB9 ; PVALID # GUJARATI LETTER VA..GUJARATI LETTER HA
1872 0ABA..0ABB ; UNASSIGNED # <reserved>..<reserved>
1873 0ABC..0AC5 ; PVALID # GUJARATI SIGN NUKTA..GUJARATI VOWEL SIGN CAN
1874 0AC6 ; UNASSIGNED # <reserved>
1875 0AC7..0AC9 ; PVALID # GUJARATI VOWEL SIGN E..GUJARATI VOWEL SIGN C
1876 0ACA ; UNASSIGNED # <reserved>
1877 0ACB..0ACD ; PVALID # GUJARATI VOWEL SIGN O..GUJARATI SIGN VIRAMA
1878 0ACE..0ACF ; UNASSIGNED # <reserved>..<reserved>
1879 0AD0 ; PVALID # GUJARATI OM
1880 0AD1..0ADF ; UNASSIGNED # <reserved>..<reserved>
1881 0AE0..0AE3 ; PVALID # GUJARATI LETTER VOCALIC RR..GUJARATI VOWEL S
1882 0AE4..0AE5 ; UNASSIGNED # <reserved>..<reserved>
1883 0AE6..0AEF ; PVALID # GUJARATI DIGIT ZERO..GUJARATI DIGIT NINE
1884 0AF0 ; UNASSIGNED # <reserved>
1885 0AF1 ; DISALLOWED # GUJARATI RUPEE SIGN
1886 0AF2..0B00 ; UNASSIGNED # <reserved>..<reserved>
1887 0B01..0B03 ; PVALID # ORIYA SIGN CANDRABINDU..ORIYA SIGN VISARGA
1888 0B04 ; UNASSIGNED # <reserved>
1889 0B05..0B0C ; PVALID # ORIYA LETTER A..ORIYA LETTER VOCALIC L
1890 0B0D..0B0E ; UNASSIGNED # <reserved>..<reserved>
1891 0B0F..0B10 ; PVALID # ORIYA LETTER E..ORIYA LETTER AI
1892 0B11..0B12 ; UNASSIGNED # <reserved>..<reserved>
1893 0B13..0B28 ; PVALID # ORIYA LETTER O..ORIYA LETTER NA
1894 0B29 ; UNASSIGNED # <reserved>
1895 0B2A..0B30 ; PVALID # ORIYA LETTER PA..ORIYA LETTER RA
1896 0B31 ; UNASSIGNED # <reserved>
1897 0B32..0B33 ; PVALID # ORIYA LETTER LA..ORIYA LETTER LLA
1898 0B34 ; UNASSIGNED # <reserved>
1899 0B35..0B39 ; PVALID # ORIYA LETTER VA..ORIYA LETTER HA
1900 0B3A..0B3B ; UNASSIGNED # <reserved>..<reserved>
1901 0B3C..0B44 ; PVALID # ORIYA SIGN NUKTA..ORIYA VOWEL SIGN VOCALIC R
1902 0B45..0B46 ; UNASSIGNED # <reserved>..<reserved>
1903 0B47..0B48 ; PVALID # ORIYA VOWEL SIGN E..ORIYA VOWEL SIGN AI
1904 0B49..0B4A ; UNASSIGNED # <reserved>..<reserved>
1905 0B4B..0B4D ; PVALID # ORIYA VOWEL SIGN O..ORIYA SIGN VIRAMA
1906 0B4E..0B55 ; UNASSIGNED # <reserved>..<reserved>
1907 0B56..0B57 ; PVALID # ORIYA AI LENGTH MARK..ORIYA AU LENGTH MARK
1908 0B58..0B5B ; UNASSIGNED # <reserved>..<reserved>
1909 0B5C..0B5D ; DISALLOWED # ORIYA LETTER RRA..ORIYA LETTER RHA
1910 0B5E ; UNASSIGNED # <reserved>
1911 0B5F..0B63 ; PVALID # ORIYA LETTER YYA..ORIYA VOWEL SIGN VOCALIC L
1912 0B64..0B65 ; UNASSIGNED # <reserved>..<reserved>
1913 0B66..0B6F ; PVALID # ORIYA DIGIT ZERO..ORIYA DIGIT NINE
1914 0B70 ; DISALLOWED # ORIYA ISSHAR
1915 0B71 ; PVALID # ORIYA LETTER WA
1916 0B72..0B81 ; UNASSIGNED # <reserved>..<reserved>
1917 0B82..0B83 ; PVALID # TAMIL SIGN ANUSVARA..TAMIL SIGN VISARGA
1918 0B84 ; UNASSIGNED # <reserved>
1919
1920
1921
1922 Faltstrom Standards Track [Page 35]
1923 RFC 5892 IDNA Code Points August 2010
1924
1925
1926 0B85..0B8A ; PVALID # TAMIL LETTER A..TAMIL LETTER UU
1927 0B8B..0B8D ; UNASSIGNED # <reserved>..<reserved>
1928 0B8E..0B90 ; PVALID # TAMIL LETTER E..TAMIL LETTER AI
1929 0B91 ; UNASSIGNED # <reserved>
1930 0B92..0B95 ; PVALID # TAMIL LETTER O..TAMIL LETTER KA
1931 0B96..0B98 ; UNASSIGNED # <reserved>..<reserved>
1932 0B99..0B9A ; PVALID # TAMIL LETTER NGA..TAMIL LETTER CA
1933 0B9B ; UNASSIGNED # <reserved>
1934 0B9C ; PVALID # TAMIL LETTER JA
1935 0B9D ; UNASSIGNED # <reserved>
1936 0B9E..0B9F ; PVALID # TAMIL LETTER NYA..TAMIL LETTER TTA
1937 0BA0..0BA2 ; UNASSIGNED # <reserved>..<reserved>
1938 0BA3..0BA4 ; PVALID # TAMIL LETTER NNA..TAMIL LETTER TA
1939 0BA5..0BA7 ; UNASSIGNED # <reserved>..<reserved>
1940 0BA8..0BAA ; PVALID # TAMIL LETTER NA..TAMIL LETTER PA
1941 0BAB..0BAD ; UNASSIGNED # <reserved>..<reserved>
1942 0BAE..0BB9 ; PVALID # TAMIL LETTER MA..TAMIL LETTER HA
1943 0BBA..0BBD ; UNASSIGNED # <reserved>..<reserved>
1944 0BBE..0BC2 ; PVALID # TAMIL VOWEL SIGN AA..TAMIL VOWEL SIGN UU
1945 0BC3..0BC5 ; UNASSIGNED # <reserved>..<reserved>
1946 0BC6..0BC8 ; PVALID # TAMIL VOWEL SIGN E..TAMIL VOWEL SIGN AI
1947 0BC9 ; UNASSIGNED # <reserved>
1948 0BCA..0BCD ; PVALID # TAMIL VOWEL SIGN O..TAMIL SIGN VIRAMA
1949 0BCE..0BCF ; UNASSIGNED # <reserved>..<reserved>
1950 0BD0 ; PVALID # TAMIL OM
1951 0BD1..0BD6 ; UNASSIGNED # <reserved>..<reserved>
1952 0BD7 ; PVALID # TAMIL AU LENGTH MARK
1953 0BD8..0BE5 ; UNASSIGNED # <reserved>..<reserved>
1954 0BE6..0BEF ; PVALID # TAMIL DIGIT ZERO..TAMIL DIGIT NINE
1955 0BF0..0BFA ; DISALLOWED # TAMIL NUMBER TEN..TAMIL NUMBER SIGN
1956 0BFB..0C00 ; UNASSIGNED # <reserved>..<reserved>
1957 0C01..0C03 ; PVALID # TELUGU SIGN CANDRABINDU..TELUGU SIGN VISARGA
1958 0C04 ; UNASSIGNED # <reserved>
1959 0C05..0C0C ; PVALID # TELUGU LETTER A..TELUGU LETTER VOCALIC L
1960 0C0D ; UNASSIGNED # <reserved>
1961 0C0E..0C10 ; PVALID # TELUGU LETTER E..TELUGU LETTER AI
1962 0C11 ; UNASSIGNED # <reserved>
1963 0C12..0C28 ; PVALID # TELUGU LETTER O..TELUGU LETTER NA
1964 0C29 ; UNASSIGNED # <reserved>
1965 0C2A..0C33 ; PVALID # TELUGU LETTER PA..TELUGU LETTER LLA
1966 0C34 ; UNASSIGNED # <reserved>
1967 0C35..0C39 ; PVALID # TELUGU LETTER VA..TELUGU LETTER HA
1968 0C3A..0C3C ; UNASSIGNED # <reserved>..<reserved>
1969 0C3D..0C44 ; PVALID # TELUGU SIGN AVAGRAHA..TELUGU VOWEL SIGN VOCA
1970 0C45 ; UNASSIGNED # <reserved>
1971 0C46..0C48 ; PVALID # TELUGU VOWEL SIGN E..TELUGU VOWEL SIGN AI
1972 0C49 ; UNASSIGNED # <reserved>
1973 0C4A..0C4D ; PVALID # TELUGU VOWEL SIGN O..TELUGU SIGN VIRAMA
1974
1975
1976
1977 Faltstrom Standards Track [Page 36]
1978 RFC 5892 IDNA Code Points August 2010
1979
1980
1981 0C4E..0C54 ; UNASSIGNED # <reserved>..<reserved>
1982 0C55..0C56 ; PVALID # TELUGU LENGTH MARK..TELUGU AI LENGTH MARK
1983 0C57 ; UNASSIGNED # <reserved>
1984 0C58..0C59 ; PVALID # TELUGU LETTER TSA..TELUGU LETTER DZA
1985 0C5A..0C5F ; UNASSIGNED # <reserved>..<reserved>
1986 0C60..0C63 ; PVALID # TELUGU LETTER VOCALIC RR..TELUGU VOWEL SIGN
1987 0C64..0C65 ; UNASSIGNED # <reserved>..<reserved>
1988 0C66..0C6F ; PVALID # TELUGU DIGIT ZERO..TELUGU DIGIT NINE
1989 0C70..0C77 ; UNASSIGNED # <reserved>..<reserved>
1990 0C78..0C7F ; DISALLOWED # TELUGU FRACTION DIGIT ZERO FOR ODD POWERS OF
1991 0C80..0C81 ; UNASSIGNED # <reserved>..<reserved>
1992 0C82..0C83 ; PVALID # KANNADA SIGN ANUSVARA..KANNADA SIGN VISARGA
1993 0C84 ; UNASSIGNED # <reserved>
1994 0C85..0C8C ; PVALID # KANNADA LETTER A..KANNADA LETTER VOCALIC L
1995 0C8D ; UNASSIGNED # <reserved>
1996 0C8E..0C90 ; PVALID # KANNADA LETTER E..KANNADA LETTER AI
1997 0C91 ; UNASSIGNED # <reserved>
1998 0C92..0CA8 ; PVALID # KANNADA LETTER O..KANNADA LETTER NA
1999 0CA9 ; UNASSIGNED # <reserved>
2000 0CAA..0CB3 ; PVALID # KANNADA LETTER PA..KANNADA LETTER LLA
2001 0CB4 ; UNASSIGNED # <reserved>
2002 0CB5..0CB9 ; PVALID # KANNADA LETTER VA..KANNADA LETTER HA
2003 0CBA..0CBB ; UNASSIGNED # <reserved>..<reserved>
2004 0CBC..0CC4 ; PVALID # KANNADA SIGN NUKTA..KANNADA VOWEL SIGN VOCAL
2005 0CC5 ; UNASSIGNED # <reserved>
2006 0CC6..0CC8 ; PVALID # KANNADA VOWEL SIGN E..KANNADA VOWEL SIGN AI
2007 0CC9 ; UNASSIGNED # <reserved>
2008 0CCA..0CCD ; PVALID # KANNADA VOWEL SIGN O..KANNADA SIGN VIRAMA
2009 0CCE..0CD4 ; UNASSIGNED # <reserved>..<reserved>
2010 0CD5..0CD6 ; PVALID # KANNADA LENGTH MARK..KANNADA AI LENGTH MARK
2011 0CD7..0CDD ; UNASSIGNED # <reserved>..<reserved>
2012 0CDE ; PVALID # KANNADA LETTER FA
2013 0CDF ; UNASSIGNED # <reserved>
2014 0CE0..0CE3 ; PVALID # KANNADA LETTER VOCALIC RR..KANNADA VOWEL SIG
2015 0CE4..0CE5 ; UNASSIGNED # <reserved>..<reserved>
2016 0CE6..0CEF ; PVALID # KANNADA DIGIT ZERO..KANNADA DIGIT NINE
2017 0CF0 ; UNASSIGNED # <reserved>
2018 0CF1..0CF2 ; DISALLOWED # KANNADA SIGN JIHVAMULIYA..KANNADA SIGN UPADH
2019 0CF3..0D01 ; UNASSIGNED # <reserved>..<reserved>
2020 0D02..0D03 ; PVALID # MALAYALAM SIGN ANUSVARA..MALAYALAM SIGN VISA
2021 0D04 ; UNASSIGNED # <reserved>
2022 0D05..0D0C ; PVALID # MALAYALAM LETTER A..MALAYALAM LETTER VOCALIC
2023 0D0D ; UNASSIGNED # <reserved>
2024 0D0E..0D10 ; PVALID # MALAYALAM LETTER E..MALAYALAM LETTER AI
2025 0D11 ; UNASSIGNED # <reserved>
2026 0D12..0D28 ; PVALID # MALAYALAM LETTER O..MALAYALAM LETTER NA
2027 0D29 ; UNASSIGNED # <reserved>
2028 0D2A..0D39 ; PVALID # MALAYALAM LETTER PA..MALAYALAM LETTER HA
2029
2030
2031
2032 Faltstrom Standards Track [Page 37]
2033 RFC 5892 IDNA Code Points August 2010
2034
2035
2036 0D3A..0D3C ; UNASSIGNED # <reserved>..<reserved>
2037 0D3D..0D44 ; PVALID # MALAYALAM SIGN AVAGRAHA..MALAYALAM VOWEL SIG
2038 0D45 ; UNASSIGNED # <reserved>
2039 0D46..0D48 ; PVALID # MALAYALAM VOWEL SIGN E..MALAYALAM VOWEL SIGN
2040 0D49 ; UNASSIGNED # <reserved>
2041 0D4A..0D4D ; PVALID # MALAYALAM VOWEL SIGN O..MALAYALAM SIGN VIRAM
2042 0D4E..0D56 ; UNASSIGNED # <reserved>..<reserved>
2043 0D57 ; PVALID # MALAYALAM AU LENGTH MARK
2044 0D58..0D5F ; UNASSIGNED # <reserved>..<reserved>
2045 0D60..0D63 ; PVALID # MALAYALAM LETTER VOCALIC RR..MALAYALAM VOWEL
2046 0D64..0D65 ; UNASSIGNED # <reserved>..<reserved>
2047 0D66..0D6F ; PVALID # MALAYALAM DIGIT ZERO..MALAYALAM DIGIT NINE
2048 0D70..0D75 ; DISALLOWED # MALAYALAM NUMBER TEN..MALAYALAM FRACTION THR
2049 0D76..0D78 ; UNASSIGNED # <reserved>..<reserved>
2050 0D79 ; DISALLOWED # MALAYALAM DATE MARK
2051 0D7A..0D7F ; PVALID # MALAYALAM LETTER CHILLU NN..MALAYALAM LETTER
2052 0D80..0D81 ; UNASSIGNED # <reserved>..<reserved>
2053 0D82..0D83 ; PVALID # SINHALA SIGN ANUSVARAYA..SINHALA SIGN VISARG
2054 0D84 ; UNASSIGNED # <reserved>
2055 0D85..0D96 ; PVALID # SINHALA LETTER AYANNA..SINHALA LETTER AUYANN
2056 0D97..0D99 ; UNASSIGNED # <reserved>..<reserved>
2057 0D9A..0DB1 ; PVALID # SINHALA LETTER ALPAPRAANA KAYANNA..SINHALA L
2058 0DB2 ; UNASSIGNED # <reserved>
2059 0DB3..0DBB ; PVALID # SINHALA LETTER SANYAKA DAYANNA..SINHALA LETT
2060 0DBC ; UNASSIGNED # <reserved>
2061 0DBD ; PVALID # SINHALA LETTER DANTAJA LAYANNA
2062 0DBE..0DBF ; UNASSIGNED # <reserved>..<reserved>
2063 0DC0..0DC6 ; PVALID # SINHALA LETTER VAYANNA..SINHALA LETTER FAYAN
2064 0DC7..0DC9 ; UNASSIGNED # <reserved>..<reserved>
2065 0DCA ; PVALID # SINHALA SIGN AL-LAKUNA
2066 0DCB..0DCE ; UNASSIGNED # <reserved>..<reserved>
2067 0DCF..0DD4 ; PVALID # SINHALA VOWEL SIGN AELA-PILLA..SINHALA VOWEL
2068 0DD5 ; UNASSIGNED # <reserved>
2069 0DD6 ; PVALID # SINHALA VOWEL SIGN DIGA PAA-PILLA
2070 0DD7 ; UNASSIGNED # <reserved>
2071 0DD8..0DDF ; PVALID # SINHALA VOWEL SIGN GAETTA-PILLA..SINHALA VOW
2072 0DE0..0DF1 ; UNASSIGNED # <reserved>..<reserved>
2073 0DF2..0DF3 ; PVALID # SINHALA VOWEL SIGN DIGA GAETTA-PILLA..SINHAL
2074 0DF4 ; DISALLOWED # SINHALA PUNCTUATION KUNDDALIYA
2075 0DF5..0E00 ; UNASSIGNED # <reserved>..<reserved>
2076 0E01..0E32 ; PVALID # THAI CHARACTER KO KAI..THAI CHARACTER SARA A
2077 0E33 ; DISALLOWED # THAI CHARACTER SARA AM
2078 0E34..0E3A ; PVALID # THAI CHARACTER SARA I..THAI CHARACTER PHINTH
2079 0E3B..0E3E ; UNASSIGNED # <reserved>..<reserved>
2080 0E3F ; DISALLOWED # THAI CURRENCY SYMBOL BAHT
2081 0E40..0E4E ; PVALID # THAI CHARACTER SARA E..THAI CHARACTER YAMAKK
2082 0E4F ; DISALLOWED # THAI CHARACTER FONGMAN
2083 0E50..0E59 ; PVALID # THAI DIGIT ZERO..THAI DIGIT NINE
2084
2085
2086
2087 Faltstrom Standards Track [Page 38]
2088 RFC 5892 IDNA Code Points August 2010
2089
2090
2091 0E5A..0E5B ; DISALLOWED # THAI CHARACTER ANGKHANKHU..THAI CHARACTER KH
2092 0E5C..0E80 ; UNASSIGNED # <reserved>..<reserved>
2093 0E81..0E82 ; PVALID # LAO LETTER KO..LAO LETTER KHO SUNG
2094 0E83 ; UNASSIGNED # <reserved>
2095 0E84 ; PVALID # LAO LETTER KHO TAM
2096 0E85..0E86 ; UNASSIGNED # <reserved>..<reserved>
2097 0E87..0E88 ; PVALID # LAO LETTER NGO..LAO LETTER CO
2098 0E89 ; UNASSIGNED # <reserved>
2099 0E8A ; PVALID # LAO LETTER SO TAM
2100 0E8B..0E8C ; UNASSIGNED # <reserved>..<reserved>
2101 0E8D ; PVALID # LAO LETTER NYO
2102 0E8E..0E93 ; UNASSIGNED # <reserved>..<reserved>
2103 0E94..0E97 ; PVALID # LAO LETTER DO..LAO LETTER THO TAM
2104 0E98 ; UNASSIGNED # <reserved>
2105 0E99..0E9F ; PVALID # LAO LETTER NO..LAO LETTER FO SUNG
2106 0EA0 ; UNASSIGNED # <reserved>
2107 0EA1..0EA3 ; PVALID # LAO LETTER MO..LAO LETTER LO LING
2108 0EA4 ; UNASSIGNED # <reserved>
2109 0EA5 ; PVALID # LAO LETTER LO LOOT
2110 0EA6 ; UNASSIGNED # <reserved>
2111 0EA7 ; PVALID # LAO LETTER WO
2112 0EA8..0EA9 ; UNASSIGNED # <reserved>..<reserved>
2113 0EAA..0EAB ; PVALID # LAO LETTER SO SUNG..LAO LETTER HO SUNG
2114 0EAC ; UNASSIGNED # <reserved>
2115 0EAD..0EB2 ; PVALID # LAO LETTER O..LAO VOWEL SIGN AA
2116 0EB3 ; DISALLOWED # LAO VOWEL SIGN AM
2117 0EB4..0EB9 ; PVALID # LAO VOWEL SIGN I..LAO VOWEL SIGN UU
2118 0EBA ; UNASSIGNED # <reserved>
2119 0EBB..0EBD ; PVALID # LAO VOWEL SIGN MAI KON..LAO SEMIVOWEL SIGN N
2120 0EBE..0EBF ; UNASSIGNED # <reserved>..<reserved>
2121 0EC0..0EC4 ; PVALID # LAO VOWEL SIGN E..LAO VOWEL SIGN AI
2122 0EC5 ; UNASSIGNED # <reserved>
2123 0EC6 ; PVALID # LAO KO LA
2124 0EC7 ; UNASSIGNED # <reserved>
2125 0EC8..0ECD ; PVALID # LAO TONE MAI EK..LAO NIGGAHITA
2126 0ECE..0ECF ; UNASSIGNED # <reserved>..<reserved>
2127 0ED0..0ED9 ; PVALID # LAO DIGIT ZERO..LAO DIGIT NINE
2128 0EDA..0EDB ; UNASSIGNED # <reserved>..<reserved>
2129 0EDC..0EDD ; DISALLOWED # LAO HO NO..LAO HO MO
2130 0EDE..0EFF ; UNASSIGNED # <reserved>..<reserved>
2131 0F00 ; PVALID # TIBETAN SYLLABLE OM
2132 0F01..0F0A ; DISALLOWED # TIBETAN MARK GTER YIG MGO TRUNCATED A..TIBET
2133 0F0B ; PVALID # TIBETAN MARK INTERSYLLABIC TSHEG
2134 0F0C..0F17 ; DISALLOWED # TIBETAN MARK DELIMITER TSHEG BSTAR..TIBETAN
2135 0F18..0F19 ; PVALID # TIBETAN ASTROLOGICAL SIGN -KHYUD PA..TIBETAN
2136 0F1A..0F1F ; DISALLOWED # TIBETAN SIGN RDEL DKAR GCIG..TIBETAN SIGN RD
2137 0F20..0F29 ; PVALID # TIBETAN DIGIT ZERO..TIBETAN DIGIT NINE
2138 0F2A..0F34 ; DISALLOWED # TIBETAN DIGIT HALF ONE..TIBETAN MARK BSDUS R
2139
2140
2141
2142 Faltstrom Standards Track [Page 39]
2143 RFC 5892 IDNA Code Points August 2010
2144
2145
2146 0F35 ; PVALID # TIBETAN MARK NGAS BZUNG NYI ZLA
2147 0F36 ; DISALLOWED # TIBETAN MARK CARET -DZUD RTAGS BZHI MIG CAN
2148 0F37 ; PVALID # TIBETAN MARK NGAS BZUNG SGOR RTAGS
2149 0F38 ; DISALLOWED # TIBETAN MARK CHE MGO
2150 0F39 ; PVALID # TIBETAN MARK TSA -PHRU
2151 0F3A..0F3D ; DISALLOWED # TIBETAN MARK GUG RTAGS GYON..TIBETAN MARK AN
2152 0F3E..0F42 ; PVALID # TIBETAN SIGN YAR TSHES..TIBETAN LETTER GA
2153 0F43 ; DISALLOWED # TIBETAN LETTER GHA
2154 0F44..0F47 ; PVALID # TIBETAN LETTER NGA..TIBETAN LETTER JA
2155 0F48 ; UNASSIGNED # <reserved>
2156 0F49..0F4C ; PVALID # TIBETAN LETTER NYA..TIBETAN LETTER DDA
2157 0F4D ; DISALLOWED # TIBETAN LETTER DDHA
2158 0F4E..0F51 ; PVALID # TIBETAN LETTER NNA..TIBETAN LETTER DA
2159 0F52 ; DISALLOWED # TIBETAN LETTER DHA
2160 0F53..0F56 ; PVALID # TIBETAN LETTER NA..TIBETAN LETTER BA
2161 0F57 ; DISALLOWED # TIBETAN LETTER BHA
2162 0F58..0F5B ; PVALID # TIBETAN LETTER MA..TIBETAN LETTER DZA
2163 0F5C ; DISALLOWED # TIBETAN LETTER DZHA
2164 0F5D..0F68 ; PVALID # TIBETAN LETTER WA..TIBETAN LETTER A
2165 0F69 ; DISALLOWED # TIBETAN LETTER KSSA
2166 0F6A..0F6C ; PVALID # TIBETAN LETTER FIXED-FORM RA..TIBETAN LETTER
2167 0F6D..0F70 ; UNASSIGNED # <reserved>..<reserved>
2168 0F71..0F72 ; PVALID # TIBETAN VOWEL SIGN AA..TIBETAN VOWEL SIGN I
2169 0F73 ; DISALLOWED # TIBETAN VOWEL SIGN II
2170 0F74 ; PVALID # TIBETAN VOWEL SIGN U
2171 0F75..0F79 ; DISALLOWED # TIBETAN VOWEL SIGN UU..TIBETAN VOWEL SIGN VO
2172 0F7A..0F80 ; PVALID # TIBETAN VOWEL SIGN E..TIBETAN VOWEL SIGN REV
2173 0F81 ; DISALLOWED # TIBETAN VOWEL SIGN REVERSED II
2174 0F82..0F84 ; PVALID # TIBETAN SIGN NYI ZLA NAA DA..TIBETAN MARK HA
2175 0F85 ; DISALLOWED # TIBETAN MARK PALUTA
2176 0F86..0F8B ; PVALID # TIBETAN SIGN LCI RTAGS..TIBETAN SIGN GRU MED
2177 0F8C..0F8F ; UNASSIGNED # <reserved>..<reserved>
2178 0F90..0F92 ; PVALID # TIBETAN SUBJOINED LETTER KA..TIBETAN SUBJOIN
2179 0F93 ; DISALLOWED # TIBETAN SUBJOINED LETTER GHA
2180 0F94..0F97 ; PVALID # TIBETAN SUBJOINED LETTER NGA..TIBETAN SUBJOI
2181 0F98 ; UNASSIGNED # <reserved>
2182 0F99..0F9C ; PVALID # TIBETAN SUBJOINED LETTER NYA..TIBETAN SUBJOI
2183 0F9D ; DISALLOWED # TIBETAN SUBJOINED LETTER DDHA
2184 0F9E..0FA1 ; PVALID # TIBETAN SUBJOINED LETTER NNA..TIBETAN SUBJOI
2185 0FA2 ; DISALLOWED # TIBETAN SUBJOINED LETTER DHA
2186 0FA3..0FA6 ; PVALID # TIBETAN SUBJOINED LETTER NA..TIBETAN SUBJOIN
2187 0FA7 ; DISALLOWED # TIBETAN SUBJOINED LETTER BHA
2188 0FA8..0FAB ; PVALID # TIBETAN SUBJOINED LETTER MA..TIBETAN SUBJOIN
2189 0FAC ; DISALLOWED # TIBETAN SUBJOINED LETTER DZHA
2190 0FAD..0FB8 ; PVALID # TIBETAN SUBJOINED LETTER WA..TIBETAN SUBJOIN
2191 0FB9 ; DISALLOWED # TIBETAN SUBJOINED LETTER KSSA
2192 0FBA..0FBC ; PVALID # TIBETAN SUBJOINED LETTER FIXED-FORM WA..TIBE
2193 0FBD ; UNASSIGNED # <reserved>
2194
2195
2196
2197 Faltstrom Standards Track [Page 40]
2198 RFC 5892 IDNA Code Points August 2010
2199
2200
2201 0FBE..0FC5 ; DISALLOWED # TIBETAN KU RU KHA..TIBETAN SYMBOL RDO RJE
2202 0FC6 ; PVALID # TIBETAN SYMBOL PADMA GDAN
2203 0FC7..0FCC ; DISALLOWED # TIBETAN SYMBOL RDO RJE RGYA GRAM..TIBETAN SY
2204 0FCD ; UNASSIGNED # <reserved>
2205 0FCE..0FD8 ; DISALLOWED # TIBETAN SIGN RDEL NAG RDEL DKAR..LEFT-FACING
2206 0FD9..0FFF ; UNASSIGNED # <reserved>..<reserved>
2207 1000..1049 ; PVALID # MYANMAR LETTER KA..MYANMAR DIGIT NINE
2208 104A..104F ; DISALLOWED # MYANMAR SIGN LITTLE SECTION..MYANMAR SYMBOL
2209 1050..109D ; PVALID # MYANMAR LETTER SHA..MYANMAR VOWEL SIGN AITON
2210 109E..10C5 ; DISALLOWED # MYANMAR SYMBOL SHAN ONE..GEORGIAN CAPITAL LE
2211 10C6..10CF ; UNASSIGNED # <reserved>..<reserved>
2212 10D0..10FA ; PVALID # GEORGIAN LETTER AN..GEORGIAN LETTER AIN
2213 10FB..10FC ; DISALLOWED # GEORGIAN PARAGRAPH SEPARATOR..MODIFIER LETTE
2214 10FD..10FF ; UNASSIGNED # <reserved>..<reserved>
2215 1100..11FF ; DISALLOWED # HANGUL CHOSEONG KIYEOK..HANGUL JONGSEONG SSA
2216 1200..1248 ; PVALID # ETHIOPIC SYLLABLE HA..ETHIOPIC SYLLABLE QWA
2217 1249 ; UNASSIGNED # <reserved>
2218 124A..124D ; PVALID # ETHIOPIC SYLLABLE QWI..ETHIOPIC SYLLABLE QWE
2219 124E..124F ; UNASSIGNED # <reserved>..<reserved>
2220 1250..1256 ; PVALID # ETHIOPIC SYLLABLE QHA..ETHIOPIC SYLLABLE QHO
2221 1257 ; UNASSIGNED # <reserved>
2222 1258 ; PVALID # ETHIOPIC SYLLABLE QHWA
2223 1259 ; UNASSIGNED # <reserved>
2224 125A..125D ; PVALID # ETHIOPIC SYLLABLE QHWI..ETHIOPIC SYLLABLE QH
2225 125E..125F ; UNASSIGNED # <reserved>..<reserved>
2226 1260..1288 ; PVALID # ETHIOPIC SYLLABLE BA..ETHIOPIC SYLLABLE XWA
2227 1289 ; UNASSIGNED # <reserved>
2228 128A..128D ; PVALID # ETHIOPIC SYLLABLE XWI..ETHIOPIC SYLLABLE XWE
2229 128E..128F ; UNASSIGNED # <reserved>..<reserved>
2230 1290..12B0 ; PVALID # ETHIOPIC SYLLABLE NA..ETHIOPIC SYLLABLE KWA
2231 12B1 ; UNASSIGNED # <reserved>
2232 12B2..12B5 ; PVALID # ETHIOPIC SYLLABLE KWI..ETHIOPIC SYLLABLE KWE
2233 12B6..12B7 ; UNASSIGNED # <reserved>..<reserved>
2234 12B8..12BE ; PVALID # ETHIOPIC SYLLABLE KXA..ETHIOPIC SYLLABLE KXO
2235 12BF ; UNASSIGNED # <reserved>
2236 12C0 ; PVALID # ETHIOPIC SYLLABLE KXWA
2237 12C1 ; UNASSIGNED # <reserved>
2238 12C2..12C5 ; PVALID # ETHIOPIC SYLLABLE KXWI..ETHIOPIC SYLLABLE KX
2239 12C6..12C7 ; UNASSIGNED # <reserved>..<reserved>
2240 12C8..12D6 ; PVALID # ETHIOPIC SYLLABLE WA..ETHIOPIC SYLLABLE PHAR
2241 12D7 ; UNASSIGNED # <reserved>
2242 12D8..1310 ; PVALID # ETHIOPIC SYLLABLE ZA..ETHIOPIC SYLLABLE GWA
2243 1311 ; UNASSIGNED # <reserved>
2244 1312..1315 ; PVALID # ETHIOPIC SYLLABLE GWI..ETHIOPIC SYLLABLE GWE
2245 1316..1317 ; UNASSIGNED # <reserved>..<reserved>
2246 1318..135A ; PVALID # ETHIOPIC SYLLABLE GGA..ETHIOPIC SYLLABLE FYA
2247 135B..135E ; UNASSIGNED # <reserved>..<reserved>
2248 135F ; PVALID # ETHIOPIC COMBINING GEMINATION MARK
2249
2250
2251
2252 Faltstrom Standards Track [Page 41]
2253 RFC 5892 IDNA Code Points August 2010
2254
2255
2256 1360..137C ; DISALLOWED # ETHIOPIC SECTION MARK..ETHIOPIC NUMBER TEN T
2257 137D..137F ; UNASSIGNED # <reserved>..<reserved>
2258 1380..138F ; PVALID # ETHIOPIC SYLLABLE SEBATBEIT MWA..ETHIOPIC SY
2259 1390..1399 ; DISALLOWED # ETHIOPIC TONAL MARK YIZET..ETHIOPIC TONAL MA
2260 139A..139F ; UNASSIGNED # <reserved>..<reserved>
2261 13A0..13F4 ; PVALID # CHEROKEE LETTER A..CHEROKEE LETTER YV
2262 13F5..13FF ; UNASSIGNED # <reserved>..<reserved>
2263 1400 ; DISALLOWED # CANADIAN SYLLABICS HYPHEN
2264 1401..166C ; PVALID # CANADIAN SYLLABICS E..CANADIAN SYLLABICS CAR
2265 166D..166E ; DISALLOWED # CANADIAN SYLLABICS CHI SIGN..CANADIAN SYLLAB
2266 166F..167F ; PVALID # CANADIAN SYLLABICS QAI..CANADIAN SYLLABICS B
2267 1680 ; DISALLOWED # OGHAM SPACE MARK
2268 1681..169A ; PVALID # OGHAM LETTER BEITH..OGHAM LETTER PEITH
2269 169B..169C ; DISALLOWED # OGHAM FEATHER MARK..OGHAM REVERSED FEATHER M
2270 169D..169F ; UNASSIGNED # <reserved>..<reserved>
2271 16A0..16EA ; PVALID # RUNIC LETTER FEHU FEOH FE F..RUNIC LETTER X
2272 16EB..16F0 ; DISALLOWED # RUNIC SINGLE PUNCTUATION..RUNIC BELGTHOR SYM
2273 16F1..16FF ; UNASSIGNED # <reserved>..<reserved>
2274 1700..170C ; PVALID # TAGALOG LETTER A..TAGALOG LETTER YA
2275 170D ; UNASSIGNED # <reserved>
2276 170E..1714 ; PVALID # TAGALOG LETTER LA..TAGALOG SIGN VIRAMA
2277 1715..171F ; UNASSIGNED # <reserved>..<reserved>
2278 1720..1734 ; PVALID # HANUNOO LETTER A..HANUNOO SIGN PAMUDPOD
2279 1735..1736 ; DISALLOWED # PHILIPPINE SINGLE PUNCTUATION..PHILIPPINE DO
2280 1737..173F ; UNASSIGNED # <reserved>..<reserved>
2281 1740..1753 ; PVALID # BUHID LETTER A..BUHID VOWEL SIGN U
2282 1754..175F ; UNASSIGNED # <reserved>..<reserved>
2283 1760..176C ; PVALID # TAGBANWA LETTER A..TAGBANWA LETTER YA
2284 176D ; UNASSIGNED # <reserved>
2285 176E..1770 ; PVALID # TAGBANWA LETTER LA..TAGBANWA LETTER SA
2286 1771 ; UNASSIGNED # <reserved>
2287 1772..1773 ; PVALID # TAGBANWA VOWEL SIGN I..TAGBANWA VOWEL SIGN U
2288 1774..177F ; UNASSIGNED # <reserved>..<reserved>
2289 1780..17B3 ; PVALID # KHMER LETTER KA..KHMER INDEPENDENT VOWEL QAU
2290 17B4..17B5 ; DISALLOWED # KHMER VOWEL INHERENT AQ..KHMER VOWEL INHEREN
2291 17B6..17D3 ; PVALID # KHMER VOWEL SIGN AA..KHMER SIGN BATHAMASAT
2292 17D4..17D6 ; DISALLOWED # KHMER SIGN KHAN..KHMER SIGN CAMNUC PII KUUH
2293 17D7 ; PVALID # KHMER SIGN LEK TOO
2294 17D8..17DB ; DISALLOWED # KHMER SIGN BEYYAL..KHMER CURRENCY SYMBOL RIE
2295 17DC..17DD ; PVALID # KHMER SIGN AVAKRAHASANYA..KHMER SIGN ATTHACA
2296 17DE..17DF ; UNASSIGNED # <reserved>..<reserved>
2297 17E0..17E9 ; PVALID # KHMER DIGIT ZERO..KHMER DIGIT NINE
2298 17EA..17EF ; UNASSIGNED # <reserved>..<reserved>
2299 17F0..17F9 ; DISALLOWED # KHMER SYMBOL LEK ATTAK SON..KHMER SYMBOL LEK
2300 17FA..17FF ; UNASSIGNED # <reserved>..<reserved>
2301 1800..180E ; DISALLOWED # MONGOLIAN BIRGA..MONGOLIAN VOWEL SEPARATOR
2302 180F ; UNASSIGNED # <reserved>
2303 1810..1819 ; PVALID # MONGOLIAN DIGIT ZERO..MONGOLIAN DIGIT NINE
2304
2305
2306
2307 Faltstrom Standards Track [Page 42]
2308 RFC 5892 IDNA Code Points August 2010
2309
2310
2311 181A..181F ; UNASSIGNED # <reserved>..<reserved>
2312 1820..1877 ; PVALID # MONGOLIAN LETTER A..MONGOLIAN LETTER MANCHU
2313 1878..187F ; UNASSIGNED # <reserved>..<reserved>
2314 1880..18AA ; PVALID # MONGOLIAN LETTER ALI GALI ANUSVARA ONE..MONG
2315 18AB..18AF ; UNASSIGNED # <reserved>..<reserved>
2316 18B0..18F5 ; PVALID # CANADIAN SYLLABICS OY..CANADIAN SYLLABICS CA
2317 18F6..18FF ; UNASSIGNED # <reserved>..<reserved>
2318 1900..191C ; PVALID # LIMBU VOWEL-CARRIER LETTER..LIMBU LETTER HA
2319 191D..191F ; UNASSIGNED # <reserved>..<reserved>
2320 1920..192B ; PVALID # LIMBU VOWEL SIGN A..LIMBU SUBJOINED LETTER W
2321 192C..192F ; UNASSIGNED # <reserved>..<reserved>
2322 1930..193B ; PVALID # LIMBU SMALL LETTER KA..LIMBU SIGN SA-I
2323 193C..193F ; UNASSIGNED # <reserved>..<reserved>
2324 1940 ; DISALLOWED # LIMBU SIGN LOO
2325 1941..1943 ; UNASSIGNED # <reserved>..<reserved>
2326 1944..1945 ; DISALLOWED # LIMBU EXCLAMATION MARK..LIMBU QUESTION MARK
2327 1946..196D ; PVALID # LIMBU DIGIT ZERO..TAI LE LETTER AI
2328 196E..196F ; UNASSIGNED # <reserved>..<reserved>
2329 1970..1974 ; PVALID # TAI LE LETTER TONE-2..TAI LE LETTER TONE-6
2330 1975..197F ; UNASSIGNED # <reserved>..<reserved>
2331 1980..19AB ; PVALID # NEW TAI LUE LETTER HIGH QA..NEW TAI LUE LETT
2332 19AC..19AF ; UNASSIGNED # <reserved>..<reserved>
2333 19B0..19C9 ; PVALID # NEW TAI LUE VOWEL SIGN VOWEL SHORTENER..NEW
2334 19CA..19CF ; UNASSIGNED # <reserved>..<reserved>
2335 19D0..19DA ; PVALID # NEW TAI LUE DIGIT ZERO..NEW TAI LUE THAM DIG
2336 19DB..19DD ; UNASSIGNED # <reserved>..<reserved>
2337 19DE..19FF ; DISALLOWED # NEW TAI LUE SIGN LAE..KHMER SYMBOL DAP-PRAM
2338 1A00..1A1B ; PVALID # BUGINESE LETTER KA..BUGINESE VOWEL SIGN AE
2339 1A1C..1A1D ; UNASSIGNED # <reserved>..<reserved>
2340 1A1E..1A1F ; DISALLOWED # BUGINESE PALLAWA..BUGINESE END OF SECTION
2341 1A20..1A5E ; PVALID # TAI THAM LETTER HIGH KA..TAI THAM CONSONANT
2342 1A5F ; UNASSIGNED # <reserved>
2343 1A60..1A7C ; PVALID # TAI THAM SIGN SAKOT..TAI THAM SIGN KHUEN-LUE
2344 1A7D..1A7E ; UNASSIGNED # <reserved>..<reserved>
2345 1A7F..1A89 ; PVALID # TAI THAM COMBINING CRYPTOGRAMMIC DOT..TAI TH
2346 1A8A..1A8F ; UNASSIGNED # <reserved>..<reserved>
2347 1A90..1A99 ; PVALID # TAI THAM THAM DIGIT ZERO..TAI THAM THAM DIGI
2348 1A9A..1A9F ; UNASSIGNED # <reserved>..<reserved>
2349 1AA0..1AA6 ; DISALLOWED # TAI THAM SIGN WIANG..TAI THAM SIGN REVERSED
2350 1AA7 ; PVALID # TAI THAM SIGN MAI YAMOK
2351 1AA8..1AAD ; DISALLOWED # TAI THAM SIGN KAAN..TAI THAM SIGN CAANG
2352 1AAE..1AFF ; UNASSIGNED # <reserved>..<reserved>
2353 1B00..1B4B ; PVALID # BALINESE SIGN ULU RICEM..BALINESE LETTER ASY
2354 1B4C..1B4F ; UNASSIGNED # <reserved>..<reserved>
2355 1B50..1B59 ; PVALID # BALINESE DIGIT ZERO..BALINESE DIGIT NINE
2356 1B5A..1B6A ; DISALLOWED # BALINESE PANTI..BALINESE MUSICAL SYMBOL DANG
2357 1B6B..1B73 ; PVALID # BALINESE MUSICAL SYMBOL COMBINING TEGEH..BAL
2358 1B74..1B7C ; DISALLOWED # BALINESE MUSICAL SYMBOL RIGHT-HAND OPEN DUG.
2359
2360
2361
2362 Faltstrom Standards Track [Page 43]
2363 RFC 5892 IDNA Code Points August 2010
2364
2365
2366 1B7D..1B7F ; UNASSIGNED # <reserved>..<reserved>
2367 1B80..1BAA ; PVALID # SUNDANESE SIGN PANYECEK..SUNDANESE SIGN PAMA
2368 1BAB..1BAD ; UNASSIGNED # <reserved>..<reserved>
2369 1BAE..1BB9 ; PVALID # SUNDANESE LETTER KHA..SUNDANESE DIGIT NINE
2370 1BBA..1BFF ; UNASSIGNED # <reserved>..<reserved>
2371 1C00..1C37 ; PVALID # LEPCHA LETTER KA..LEPCHA SIGN NUKTA
2372 1C38..1C3A ; UNASSIGNED # <reserved>..<reserved>
2373 1C3B..1C3F ; DISALLOWED # LEPCHA PUNCTUATION TA-ROL..LEPCHA PUNCTUATIO
2374 1C40..1C49 ; PVALID # LEPCHA DIGIT ZERO..LEPCHA DIGIT NINE
2375 1C4A..1C4C ; UNASSIGNED # <reserved>..<reserved>
2376 1C4D..1C7D ; PVALID # LEPCHA LETTER TTA..OL CHIKI AHAD
2377 1C7E..1C7F ; DISALLOWED # OL CHIKI PUNCTUATION MUCAAD..OL CHIKI PUNCTU
2378 1C80..1CCF ; UNASSIGNED # <reserved>..<reserved>
2379 1CD0..1CD2 ; PVALID # VEDIC TONE KARSHANA..VEDIC TONE PRENKHA
2380 1CD3 ; DISALLOWED # VEDIC SIGN NIHSHVASA
2381 1CD4..1CF2 ; PVALID # VEDIC SIGN YAJURVEDIC MIDLINE SVARITA..VEDIC
2382 1CF3..1CFF ; UNASSIGNED # <reserved>..<reserved>
2383 1D00..1D2B ; PVALID # LATIN LETTER SMALL CAPITAL A..CYRILLIC LETTE
2384 1D2C..1D2E ; DISALLOWED # MODIFIER LETTER CAPITAL A..MODIFIER LETTER C
2385 1D2F ; PVALID # MODIFIER LETTER CAPITAL BARRED B
2386 1D30..1D3A ; DISALLOWED # MODIFIER LETTER CAPITAL D..MODIFIER LETTER C
2387 1D3B ; PVALID # MODIFIER LETTER CAPITAL REVERSED N
2388 1D3C..1D4D ; DISALLOWED # MODIFIER LETTER CAPITAL O..MODIFIER LETTER S
2389 1D4E ; PVALID # MODIFIER LETTER SMALL TURNED I
2390 1D4F..1D6A ; DISALLOWED # MODIFIER LETTER SMALL K..GREEK SUBSCRIPT SMA
2391 1D6B..1D77 ; PVALID # LATIN SMALL LETTER UE..LATIN SMALL LETTER TU
2392 1D78 ; DISALLOWED # MODIFIER LETTER CYRILLIC EN
2393 1D79..1D9A ; PVALID # LATIN SMALL LETTER INSULAR G..LATIN SMALL LE
2394 1D9B..1DBF ; DISALLOWED # MODIFIER LETTER SMALL TURNED ALPHA..MODIFIER
2395 1DC0..1DE6 ; PVALID # COMBINING DOTTED GRAVE ACCENT..COMBINING LAT
2396 1DE7..1DFC ; UNASSIGNED # <reserved>..<reserved>
2397 1DFD..1DFF ; PVALID # COMBINING ALMOST EQUAL TO BELOW..COMBINING R
2398 1E00 ; DISALLOWED # LATIN CAPITAL LETTER A WITH RING BELOW
2399 1E01 ; PVALID # LATIN SMALL LETTER A WITH RING BELOW
2400 1E02 ; DISALLOWED # LATIN CAPITAL LETTER B WITH DOT ABOVE
2401 1E03 ; PVALID # LATIN SMALL LETTER B WITH DOT ABOVE
2402 1E04 ; DISALLOWED # LATIN CAPITAL LETTER B WITH DOT BELOW
2403 1E05 ; PVALID # LATIN SMALL LETTER B WITH DOT BELOW
2404 1E06 ; DISALLOWED # LATIN CAPITAL LETTER B WITH LINE BELOW
2405 1E07 ; PVALID # LATIN SMALL LETTER B WITH LINE BELOW
2406 1E08 ; DISALLOWED # LATIN CAPITAL LETTER C WITH CEDILLA AND ACUT
2407 1E09 ; PVALID # LATIN SMALL LETTER C WITH CEDILLA AND ACUTE
2408 1E0A ; DISALLOWED # LATIN CAPITAL LETTER D WITH DOT ABOVE
2409 1E0B ; PVALID # LATIN SMALL LETTER D WITH DOT ABOVE
2410 1E0C ; DISALLOWED # LATIN CAPITAL LETTER D WITH DOT BELOW
2411 1E0D ; PVALID # LATIN SMALL LETTER D WITH DOT BELOW
2412 1E0E ; DISALLOWED # LATIN CAPITAL LETTER D WITH LINE BELOW
2413 1E0F ; PVALID # LATIN SMALL LETTER D WITH LINE BELOW
2414
2415
2416
2417 Faltstrom Standards Track [Page 44]
2418 RFC 5892 IDNA Code Points August 2010
2419
2420
2421 1E10 ; DISALLOWED # LATIN CAPITAL LETTER D WITH CEDILLA
2422 1E11 ; PVALID # LATIN SMALL LETTER D WITH CEDILLA
2423 1E12 ; DISALLOWED # LATIN CAPITAL LETTER D WITH CIRCUMFLEX BELOW
2424 1E13 ; PVALID # LATIN SMALL LETTER D WITH CIRCUMFLEX BELOW
2425 1E14 ; DISALLOWED # LATIN CAPITAL LETTER E WITH MACRON AND GRAVE
2426 1E15 ; PVALID # LATIN SMALL LETTER E WITH MACRON AND GRAVE
2427 1E16 ; DISALLOWED # LATIN CAPITAL LETTER E WITH MACRON AND ACUTE
2428 1E17 ; PVALID # LATIN SMALL LETTER E WITH MACRON AND ACUTE
2429 1E18 ; DISALLOWED # LATIN CAPITAL LETTER E WITH CIRCUMFLEX BELOW
2430 1E19 ; PVALID # LATIN SMALL LETTER E WITH CIRCUMFLEX BELOW
2431 1E1A ; DISALLOWED # LATIN CAPITAL LETTER E WITH TILDE BELOW
2432 1E1B ; PVALID # LATIN SMALL LETTER E WITH TILDE BELOW
2433 1E1C ; DISALLOWED # LATIN CAPITAL LETTER E WITH CEDILLA AND BREV
2434 1E1D ; PVALID # LATIN SMALL LETTER E WITH CEDILLA AND BREVE
2435 1E1E ; DISALLOWED # LATIN CAPITAL LETTER F WITH DOT ABOVE
2436 1E1F ; PVALID # LATIN SMALL LETTER F WITH DOT ABOVE
2437 1E20 ; DISALLOWED # LATIN CAPITAL LETTER G WITH MACRON
2438 1E21 ; PVALID # LATIN SMALL LETTER G WITH MACRON
2439 1E22 ; DISALLOWED # LATIN CAPITAL LETTER H WITH DOT ABOVE
2440 1E23 ; PVALID # LATIN SMALL LETTER H WITH DOT ABOVE
2441 1E24 ; DISALLOWED # LATIN CAPITAL LETTER H WITH DOT BELOW
2442 1E25 ; PVALID # LATIN SMALL LETTER H WITH DOT BELOW
2443 1E26 ; DISALLOWED # LATIN CAPITAL LETTER H WITH DIAERESIS
2444 1E27 ; PVALID # LATIN SMALL LETTER H WITH DIAERESIS
2445 1E28 ; DISALLOWED # LATIN CAPITAL LETTER H WITH CEDILLA
2446 1E29 ; PVALID # LATIN SMALL LETTER H WITH CEDILLA
2447 1E2A ; DISALLOWED # LATIN CAPITAL LETTER H WITH BREVE BELOW
2448 1E2B ; PVALID # LATIN SMALL LETTER H WITH BREVE BELOW
2449 1E2C ; DISALLOWED # LATIN CAPITAL LETTER I WITH TILDE BELOW
2450 1E2D ; PVALID # LATIN SMALL LETTER I WITH TILDE BELOW
2451 1E2E ; DISALLOWED # LATIN CAPITAL LETTER I WITH DIAERESIS AND AC
2452 1E2F ; PVALID # LATIN SMALL LETTER I WITH DIAERESIS AND ACUT
2453 1E30 ; DISALLOWED # LATIN CAPITAL LETTER K WITH ACUTE
2454 1E31 ; PVALID # LATIN SMALL LETTER K WITH ACUTE
2455 1E32 ; DISALLOWED # LATIN CAPITAL LETTER K WITH DOT BELOW
2456 1E33 ; PVALID # LATIN SMALL LETTER K WITH DOT BELOW
2457 1E34 ; DISALLOWED # LATIN CAPITAL LETTER K WITH LINE BELOW
2458 1E35 ; PVALID # LATIN SMALL LETTER K WITH LINE BELOW
2459 1E36 ; DISALLOWED # LATIN CAPITAL LETTER L WITH DOT BELOW
2460 1E37 ; PVALID # LATIN SMALL LETTER L WITH DOT BELOW
2461 1E38 ; DISALLOWED # LATIN CAPITAL LETTER L WITH DOT BELOW AND MA
2462 1E39 ; PVALID # LATIN SMALL LETTER L WITH DOT BELOW AND MACR
2463 1E3A ; DISALLOWED # LATIN CAPITAL LETTER L WITH LINE BELOW
2464 1E3B ; PVALID # LATIN SMALL LETTER L WITH LINE BELOW
2465 1E3C ; DISALLOWED # LATIN CAPITAL LETTER L WITH CIRCUMFLEX BELOW
2466 1E3D ; PVALID # LATIN SMALL LETTER L WITH CIRCUMFLEX BELOW
2467 1E3E ; DISALLOWED # LATIN CAPITAL LETTER M WITH ACUTE
2468 1E3F ; PVALID # LATIN SMALL LETTER M WITH ACUTE
2469
2470
2471
2472 Faltstrom Standards Track [Page 45]
2473 RFC 5892 IDNA Code Points August 2010
2474
2475
2476 1E40 ; DISALLOWED # LATIN CAPITAL LETTER M WITH DOT ABOVE
2477 1E41 ; PVALID # LATIN SMALL LETTER M WITH DOT ABOVE
2478 1E42 ; DISALLOWED # LATIN CAPITAL LETTER M WITH DOT BELOW
2479 1E43 ; PVALID # LATIN SMALL LETTER M WITH DOT BELOW
2480 1E44 ; DISALLOWED # LATIN CAPITAL LETTER N WITH DOT ABOVE
2481 1E45 ; PVALID # LATIN SMALL LETTER N WITH DOT ABOVE
2482 1E46 ; DISALLOWED # LATIN CAPITAL LETTER N WITH DOT BELOW
2483 1E47 ; PVALID # LATIN SMALL LETTER N WITH DOT BELOW
2484 1E48 ; DISALLOWED # LATIN CAPITAL LETTER N WITH LINE BELOW
2485 1E49 ; PVALID # LATIN SMALL LETTER N WITH LINE BELOW
2486 1E4A ; DISALLOWED # LATIN CAPITAL LETTER N WITH CIRCUMFLEX BELOW
2487 1E4B ; PVALID # LATIN SMALL LETTER N WITH CIRCUMFLEX BELOW
2488 1E4C ; DISALLOWED # LATIN CAPITAL LETTER O WITH TILDE AND ACUTE
2489 1E4D ; PVALID # LATIN SMALL LETTER O WITH TILDE AND ACUTE
2490 1E4E ; DISALLOWED # LATIN CAPITAL LETTER O WITH TILDE AND DIAERE
2491 1E4F ; PVALID # LATIN SMALL LETTER O WITH TILDE AND DIAERESI
2492 1E50 ; DISALLOWED # LATIN CAPITAL LETTER O WITH MACRON AND GRAVE
2493 1E51 ; PVALID # LATIN SMALL LETTER O WITH MACRON AND GRAVE
2494 1E52 ; DISALLOWED # LATIN CAPITAL LETTER O WITH MACRON AND ACUTE
2495 1E53 ; PVALID # LATIN SMALL LETTER O WITH MACRON AND ACUTE
2496 1E54 ; DISALLOWED # LATIN CAPITAL LETTER P WITH ACUTE
2497 1E55 ; PVALID # LATIN SMALL LETTER P WITH ACUTE
2498 1E56 ; DISALLOWED # LATIN CAPITAL LETTER P WITH DOT ABOVE
2499 1E57 ; PVALID # LATIN SMALL LETTER P WITH DOT ABOVE
2500 1E58 ; DISALLOWED # LATIN CAPITAL LETTER R WITH DOT ABOVE
2501 1E59 ; PVALID # LATIN SMALL LETTER R WITH DOT ABOVE
2502 1E5A ; DISALLOWED # LATIN CAPITAL LETTER R WITH DOT BELOW
2503 1E5B ; PVALID # LATIN SMALL LETTER R WITH DOT BELOW
2504 1E5C ; DISALLOWED # LATIN CAPITAL LETTER R WITH DOT BELOW AND MA
2505 1E5D ; PVALID # LATIN SMALL LETTER R WITH DOT BELOW AND MACR
2506 1E5E ; DISALLOWED # LATIN CAPITAL LETTER R WITH LINE BELOW
2507 1E5F ; PVALID # LATIN SMALL LETTER R WITH LINE BELOW
2508 1E60 ; DISALLOWED # LATIN CAPITAL LETTER S WITH DOT ABOVE
2509 1E61 ; PVALID # LATIN SMALL LETTER S WITH DOT ABOVE
2510 1E62 ; DISALLOWED # LATIN CAPITAL LETTER S WITH DOT BELOW
2511 1E63 ; PVALID # LATIN SMALL LETTER S WITH DOT BELOW
2512 1E64 ; DISALLOWED # LATIN CAPITAL LETTER S WITH ACUTE AND DOT AB
2513 1E65 ; PVALID # LATIN SMALL LETTER S WITH ACUTE AND DOT ABOV
2514 1E66 ; DISALLOWED # LATIN CAPITAL LETTER S WITH CARON AND DOT AB
2515 1E67 ; PVALID # LATIN SMALL LETTER S WITH CARON AND DOT ABOV
2516 1E68 ; DISALLOWED # LATIN CAPITAL LETTER S WITH DOT BELOW AND DO
2517 1E69 ; PVALID # LATIN SMALL LETTER S WITH DOT BELOW AND DOT
2518 1E6A ; DISALLOWED # LATIN CAPITAL LETTER T WITH DOT ABOVE
2519 1E6B ; PVALID # LATIN SMALL LETTER T WITH DOT ABOVE
2520 1E6C ; DISALLOWED # LATIN CAPITAL LETTER T WITH DOT BELOW
2521 1E6D ; PVALID # LATIN SMALL LETTER T WITH DOT BELOW
2522 1E6E ; DISALLOWED # LATIN CAPITAL LETTER T WITH LINE BELOW
2523 1E6F ; PVALID # LATIN SMALL LETTER T WITH LINE BELOW
2524
2525
2526
2527 Faltstrom Standards Track [Page 46]
2528 RFC 5892 IDNA Code Points August 2010
2529
2530
2531 1E70 ; DISALLOWED # LATIN CAPITAL LETTER T WITH CIRCUMFLEX BELOW
2532 1E71 ; PVALID # LATIN SMALL LETTER T WITH CIRCUMFLEX BELOW
2533 1E72 ; DISALLOWED # LATIN CAPITAL LETTER U WITH DIAERESIS BELOW
2534 1E73 ; PVALID # LATIN SMALL LETTER U WITH DIAERESIS BELOW
2535 1E74 ; DISALLOWED # LATIN CAPITAL LETTER U WITH TILDE BELOW
2536 1E75 ; PVALID # LATIN SMALL LETTER U WITH TILDE BELOW
2537 1E76 ; DISALLOWED # LATIN CAPITAL LETTER U WITH CIRCUMFLEX BELOW
2538 1E77 ; PVALID # LATIN SMALL LETTER U WITH CIRCUMFLEX BELOW
2539 1E78 ; DISALLOWED # LATIN CAPITAL LETTER U WITH TILDE AND ACUTE
2540 1E79 ; PVALID # LATIN SMALL LETTER U WITH TILDE AND ACUTE
2541 1E7A ; DISALLOWED # LATIN CAPITAL LETTER U WITH MACRON AND DIAER
2542 1E7B ; PVALID # LATIN SMALL LETTER U WITH MACRON AND DIAERES
2543 1E7C ; DISALLOWED # LATIN CAPITAL LETTER V WITH TILDE
2544 1E7D ; PVALID # LATIN SMALL LETTER V WITH TILDE
2545 1E7E ; DISALLOWED # LATIN CAPITAL LETTER V WITH DOT BELOW
2546 1E7F ; PVALID # LATIN SMALL LETTER V WITH DOT BELOW
2547 1E80 ; DISALLOWED # LATIN CAPITAL LETTER W WITH GRAVE
2548 1E81 ; PVALID # LATIN SMALL LETTER W WITH GRAVE
2549 1E82 ; DISALLOWED # LATIN CAPITAL LETTER W WITH ACUTE
2550 1E83 ; PVALID # LATIN SMALL LETTER W WITH ACUTE
2551 1E84 ; DISALLOWED # LATIN CAPITAL LETTER W WITH DIAERESIS
2552 1E85 ; PVALID # LATIN SMALL LETTER W WITH DIAERESIS
2553 1E86 ; DISALLOWED # LATIN CAPITAL LETTER W WITH DOT ABOVE
2554 1E87 ; PVALID # LATIN SMALL LETTER W WITH DOT ABOVE
2555 1E88 ; DISALLOWED # LATIN CAPITAL LETTER W WITH DOT BELOW
2556 1E89 ; PVALID # LATIN SMALL LETTER W WITH DOT BELOW
2557 1E8A ; DISALLOWED # LATIN CAPITAL LETTER X WITH DOT ABOVE
2558 1E8B ; PVALID # LATIN SMALL LETTER X WITH DOT ABOVE
2559 1E8C ; DISALLOWED # LATIN CAPITAL LETTER X WITH DIAERESIS
2560 1E8D ; PVALID # LATIN SMALL LETTER X WITH DIAERESIS
2561 1E8E ; DISALLOWED # LATIN CAPITAL LETTER Y WITH DOT ABOVE
2562 1E8F ; PVALID # LATIN SMALL LETTER Y WITH DOT ABOVE
2563 1E90 ; DISALLOWED # LATIN CAPITAL LETTER Z WITH CIRCUMFLEX
2564 1E91 ; PVALID # LATIN SMALL LETTER Z WITH CIRCUMFLEX
2565 1E92 ; DISALLOWED # LATIN CAPITAL LETTER Z WITH DOT BELOW
2566 1E93 ; PVALID # LATIN SMALL LETTER Z WITH DOT BELOW
2567 1E94 ; DISALLOWED # LATIN CAPITAL LETTER Z WITH LINE BELOW
2568 1E95..1E99 ; PVALID # LATIN SMALL LETTER Z WITH LINE BELOW..LATIN
2569 1E9A..1E9B ; DISALLOWED # LATIN SMALL LETTER A WITH RIGHT HALF RING..L
2570 1E9C..1E9D ; PVALID # LATIN SMALL LETTER LONG S WITH DIAGONAL STRO
2571 1E9E ; DISALLOWED # LATIN CAPITAL LETTER SHARP S
2572 1E9F ; PVALID # LATIN SMALL LETTER DELTA
2573 1EA0 ; DISALLOWED # LATIN CAPITAL LETTER A WITH DOT BELOW
2574 1EA1 ; PVALID # LATIN SMALL LETTER A WITH DOT BELOW
2575 1EA2 ; DISALLOWED # LATIN CAPITAL LETTER A WITH HOOK ABOVE
2576 1EA3 ; PVALID # LATIN SMALL LETTER A WITH HOOK ABOVE
2577 1EA4 ; DISALLOWED # LATIN CAPITAL LETTER A WITH CIRCUMFLEX AND A
2578 1EA5 ; PVALID # LATIN SMALL LETTER A WITH CIRCUMFLEX AND ACU
2579
2580
2581
2582 Faltstrom Standards Track [Page 47]
2583 RFC 5892 IDNA Code Points August 2010
2584
2585
2586 1EA6 ; DISALLOWED # LATIN CAPITAL LETTER A WITH CIRCUMFLEX AND G
2587 1EA7 ; PVALID # LATIN SMALL LETTER A WITH CIRCUMFLEX AND GRA
2588 1EA8 ; DISALLOWED # LATIN CAPITAL LETTER A WITH CIRCUMFLEX AND H
2589 1EA9 ; PVALID # LATIN SMALL LETTER A WITH CIRCUMFLEX AND HOO
2590 1EAA ; DISALLOWED # LATIN CAPITAL LETTER A WITH CIRCUMFLEX AND T
2591 1EAB ; PVALID # LATIN SMALL LETTER A WITH CIRCUMFLEX AND TIL
2592 1EAC ; DISALLOWED # LATIN CAPITAL LETTER A WITH CIRCUMFLEX AND D
2593 1EAD ; PVALID # LATIN SMALL LETTER A WITH CIRCUMFLEX AND DOT
2594 1EAE ; DISALLOWED # LATIN CAPITAL LETTER A WITH BREVE AND ACUTE
2595 1EAF ; PVALID # LATIN SMALL LETTER A WITH BREVE AND ACUTE
2596 1EB0 ; DISALLOWED # LATIN CAPITAL LETTER A WITH BREVE AND GRAVE
2597 1EB1 ; PVALID # LATIN SMALL LETTER A WITH BREVE AND GRAVE
2598 1EB2 ; DISALLOWED # LATIN CAPITAL LETTER A WITH BREVE AND HOOK A
2599 1EB3 ; PVALID # LATIN SMALL LETTER A WITH BREVE AND HOOK ABO
2600 1EB4 ; DISALLOWED # LATIN CAPITAL LETTER A WITH BREVE AND TILDE
2601 1EB5 ; PVALID # LATIN SMALL LETTER A WITH BREVE AND TILDE
2602 1EB6 ; DISALLOWED # LATIN CAPITAL LETTER A WITH BREVE AND DOT BE
2603 1EB7 ; PVALID # LATIN SMALL LETTER A WITH BREVE AND DOT BELO
2604 1EB8 ; DISALLOWED # LATIN CAPITAL LETTER E WITH DOT BELOW
2605 1EB9 ; PVALID # LATIN SMALL LETTER E WITH DOT BELOW
2606 1EBA ; DISALLOWED # LATIN CAPITAL LETTER E WITH HOOK ABOVE
2607 1EBB ; PVALID # LATIN SMALL LETTER E WITH HOOK ABOVE
2608 1EBC ; DISALLOWED # LATIN CAPITAL LETTER E WITH TILDE
2609 1EBD ; PVALID # LATIN SMALL LETTER E WITH TILDE
2610 1EBE ; DISALLOWED # LATIN CAPITAL LETTER E WITH CIRCUMFLEX AND A
2611 1EBF ; PVALID # LATIN SMALL LETTER E WITH CIRCUMFLEX AND ACU
2612 1EC0 ; DISALLOWED # LATIN CAPITAL LETTER E WITH CIRCUMFLEX AND G
2613 1EC1 ; PVALID # LATIN SMALL LETTER E WITH CIRCUMFLEX AND GRA
2614 1EC2 ; DISALLOWED # LATIN CAPITAL LETTER E WITH CIRCUMFLEX AND H
2615 1EC3 ; PVALID # LATIN SMALL LETTER E WITH CIRCUMFLEX AND HOO
2616 1EC4 ; DISALLOWED # LATIN CAPITAL LETTER E WITH CIRCUMFLEX AND T
2617 1EC5 ; PVALID # LATIN SMALL LETTER E WITH CIRCUMFLEX AND TIL
2618 1EC6 ; DISALLOWED # LATIN CAPITAL LETTER E WITH CIRCUMFLEX AND D
2619 1EC7 ; PVALID # LATIN SMALL LETTER E WITH CIRCUMFLEX AND DOT
2620 1EC8 ; DISALLOWED # LATIN CAPITAL LETTER I WITH HOOK ABOVE
2621 1EC9 ; PVALID # LATIN SMALL LETTER I WITH HOOK ABOVE
2622 1ECA ; DISALLOWED # LATIN CAPITAL LETTER I WITH DOT BELOW
2623 1ECB ; PVALID # LATIN SMALL LETTER I WITH DOT BELOW
2624 1ECC ; DISALLOWED # LATIN CAPITAL LETTER O WITH DOT BELOW
2625 1ECD ; PVALID # LATIN SMALL LETTER O WITH DOT BELOW
2626 1ECE ; DISALLOWED # LATIN CAPITAL LETTER O WITH HOOK ABOVE
2627 1ECF ; PVALID # LATIN SMALL LETTER O WITH HOOK ABOVE
2628 1ED0 ; DISALLOWED # LATIN CAPITAL LETTER O WITH CIRCUMFLEX AND A
2629 1ED1 ; PVALID # LATIN SMALL LETTER O WITH CIRCUMFLEX AND ACU
2630 1ED2 ; DISALLOWED # LATIN CAPITAL LETTER O WITH CIRCUMFLEX AND G
2631 1ED3 ; PVALID # LATIN SMALL LETTER O WITH CIRCUMFLEX AND GRA
2632 1ED4 ; DISALLOWED # LATIN CAPITAL LETTER O WITH CIRCUMFLEX AND H
2633 1ED5 ; PVALID # LATIN SMALL LETTER O WITH CIRCUMFLEX AND HOO
2634
2635
2636
2637 Faltstrom Standards Track [Page 48]
2638 RFC 5892 IDNA Code Points August 2010
2639
2640
2641 1ED6 ; DISALLOWED # LATIN CAPITAL LETTER O WITH CIRCUMFLEX AND T
2642 1ED7 ; PVALID # LATIN SMALL LETTER O WITH CIRCUMFLEX AND TIL
2643 1ED8 ; DISALLOWED # LATIN CAPITAL LETTER O WITH CIRCUMFLEX AND D
2644 1ED9 ; PVALID # LATIN SMALL LETTER O WITH CIRCUMFLEX AND DOT
2645 1EDA ; DISALLOWED # LATIN CAPITAL LETTER O WITH HORN AND ACUTE
2646 1EDB ; PVALID # LATIN SMALL LETTER O WITH HORN AND ACUTE
2647 1EDC ; DISALLOWED # LATIN CAPITAL LETTER O WITH HORN AND GRAVE
2648 1EDD ; PVALID # LATIN SMALL LETTER O WITH HORN AND GRAVE
2649 1EDE ; DISALLOWED # LATIN CAPITAL LETTER O WITH HORN AND HOOK AB
2650 1EDF ; PVALID # LATIN SMALL LETTER O WITH HORN AND HOOK ABOV
2651 1EE0 ; DISALLOWED # LATIN CAPITAL LETTER O WITH HORN AND TILDE
2652 1EE1 ; PVALID # LATIN SMALL LETTER O WITH HORN AND TILDE
2653 1EE2 ; DISALLOWED # LATIN CAPITAL LETTER O WITH HORN AND DOT BEL
2654 1EE3 ; PVALID # LATIN SMALL LETTER O WITH HORN AND DOT BELOW
2655 1EE4 ; DISALLOWED # LATIN CAPITAL LETTER U WITH DOT BELOW
2656 1EE5 ; PVALID # LATIN SMALL LETTER U WITH DOT BELOW
2657 1EE6 ; DISALLOWED # LATIN CAPITAL LETTER U WITH HOOK ABOVE
2658 1EE7 ; PVALID # LATIN SMALL LETTER U WITH HOOK ABOVE
2659 1EE8 ; DISALLOWED # LATIN CAPITAL LETTER U WITH HORN AND ACUTE
2660 1EE9 ; PVALID # LATIN SMALL LETTER U WITH HORN AND ACUTE
2661 1EEA ; DISALLOWED # LATIN CAPITAL LETTER U WITH HORN AND GRAVE
2662 1EEB ; PVALID # LATIN SMALL LETTER U WITH HORN AND GRAVE
2663 1EEC ; DISALLOWED # LATIN CAPITAL LETTER U WITH HORN AND HOOK AB
2664 1EED ; PVALID # LATIN SMALL LETTER U WITH HORN AND HOOK ABOV
2665 1EEE ; DISALLOWED # LATIN CAPITAL LETTER U WITH HORN AND TILDE
2666 1EEF ; PVALID # LATIN SMALL LETTER U WITH HORN AND TILDE
2667 1EF0 ; DISALLOWED # LATIN CAPITAL LETTER U WITH HORN AND DOT BEL
2668 1EF1 ; PVALID # LATIN SMALL LETTER U WITH HORN AND DOT BELOW
2669 1EF2 ; DISALLOWED # LATIN CAPITAL LETTER Y WITH GRAVE
2670 1EF3 ; PVALID # LATIN SMALL LETTER Y WITH GRAVE
2671 1EF4 ; DISALLOWED # LATIN CAPITAL LETTER Y WITH DOT BELOW
2672 1EF5 ; PVALID # LATIN SMALL LETTER Y WITH DOT BELOW
2673 1EF6 ; DISALLOWED # LATIN CAPITAL LETTER Y WITH HOOK ABOVE
2674 1EF7 ; PVALID # LATIN SMALL LETTER Y WITH HOOK ABOVE
2675 1EF8 ; DISALLOWED # LATIN CAPITAL LETTER Y WITH TILDE
2676 1EF9 ; PVALID # LATIN SMALL LETTER Y WITH TILDE
2677 1EFA ; DISALLOWED # LATIN CAPITAL LETTER MIDDLE-WELSH LL
2678 1EFB ; PVALID # LATIN SMALL LETTER MIDDLE-WELSH LL
2679 1EFC ; DISALLOWED # LATIN CAPITAL LETTER MIDDLE-WELSH V
2680 1EFD ; PVALID # LATIN SMALL LETTER MIDDLE-WELSH V
2681 1EFE ; DISALLOWED # LATIN CAPITAL LETTER Y WITH LOOP
2682 1EFF..1F07 ; PVALID # LATIN SMALL LETTER Y WITH LOOP..GREEK SMALL
2683 1F08..1F0F ; DISALLOWED # GREEK CAPITAL LETTER ALPHA WITH PSILI..GREEK
2684 1F10..1F15 ; PVALID # GREEK SMALL LETTER EPSILON WITH PSILI..GREEK
2685 1F16..1F17 ; UNASSIGNED # <reserved>..<reserved>
2686 1F18..1F1D ; DISALLOWED # GREEK CAPITAL LETTER EPSILON WITH PSILI..GRE
2687 1F1E..1F1F ; UNASSIGNED # <reserved>..<reserved>
2688 1F20..1F27 ; PVALID # GREEK SMALL LETTER ETA WITH PSILI..GREEK SMA
2689
2690
2691
2692 Faltstrom Standards Track [Page 49]
2693 RFC 5892 IDNA Code Points August 2010
2694
2695
2696 1F28..1F2F ; DISALLOWED # GREEK CAPITAL LETTER ETA WITH PSILI..GREEK C
2697 1F30..1F37 ; PVALID # GREEK SMALL LETTER IOTA WITH PSILI..GREEK SM
2698 1F38..1F3F ; DISALLOWED # GREEK CAPITAL LETTER IOTA WITH PSILI..GREEK
2699 1F40..1F45 ; PVALID # GREEK SMALL LETTER OMICRON WITH PSILI..GREEK
2700 1F46..1F47 ; UNASSIGNED # <reserved>..<reserved>
2701 1F48..1F4D ; DISALLOWED # GREEK CAPITAL LETTER OMICRON WITH PSILI..GRE
2702 1F4E..1F4F ; UNASSIGNED # <reserved>..<reserved>
2703 1F50..1F57 ; PVALID # GREEK SMALL LETTER UPSILON WITH PSILI..GREEK
2704 1F58 ; UNASSIGNED # <reserved>
2705 1F59 ; DISALLOWED # GREEK CAPITAL LETTER UPSILON WITH DASIA
2706 1F5A ; UNASSIGNED # <reserved>
2707 1F5B ; DISALLOWED # GREEK CAPITAL LETTER UPSILON WITH DASIA AND
2708 1F5C ; UNASSIGNED # <reserved>
2709 1F5D ; DISALLOWED # GREEK CAPITAL LETTER UPSILON WITH DASIA AND
2710 1F5E ; UNASSIGNED # <reserved>
2711 1F5F ; DISALLOWED # GREEK CAPITAL LETTER UPSILON WITH DASIA AND
2712 1F60..1F67 ; PVALID # GREEK SMALL LETTER OMEGA WITH PSILI..GREEK S
2713 1F68..1F6F ; DISALLOWED # GREEK CAPITAL LETTER OMEGA WITH PSILI..GREEK
2714 1F70 ; PVALID # GREEK SMALL LETTER ALPHA WITH VARIA
2715 1F71 ; DISALLOWED # GREEK SMALL LETTER ALPHA WITH OXIA
2716 1F72 ; PVALID # GREEK SMALL LETTER EPSILON WITH VARIA
2717 1F73 ; DISALLOWED # GREEK SMALL LETTER EPSILON WITH OXIA
2718 1F74 ; PVALID # GREEK SMALL LETTER ETA WITH VARIA
2719 1F75 ; DISALLOWED # GREEK SMALL LETTER ETA WITH OXIA
2720 1F76 ; PVALID # GREEK SMALL LETTER IOTA WITH VARIA
2721 1F77 ; DISALLOWED # GREEK SMALL LETTER IOTA WITH OXIA
2722 1F78 ; PVALID # GREEK SMALL LETTER OMICRON WITH VARIA
2723 1F79 ; DISALLOWED # GREEK SMALL LETTER OMICRON WITH OXIA
2724 1F7A ; PVALID # GREEK SMALL LETTER UPSILON WITH VARIA
2725 1F7B ; DISALLOWED # GREEK SMALL LETTER UPSILON WITH OXIA
2726 1F7C ; PVALID # GREEK SMALL LETTER OMEGA WITH VARIA
2727 1F7D ; DISALLOWED # GREEK SMALL LETTER OMEGA WITH OXIA
2728 1F7E..1F7F ; UNASSIGNED # <reserved>..<reserved>
2729 1F80..1FAF ; DISALLOWED # GREEK SMALL LETTER ALPHA WITH PSILI AND YPOG
2730 1FB0..1FB1 ; PVALID # GREEK SMALL LETTER ALPHA WITH VRACHY..GREEK
2731 1FB2..1FB4 ; DISALLOWED # GREEK SMALL LETTER ALPHA WITH VARIA AND YPOG
2732 1FB5 ; UNASSIGNED # <reserved>
2733 1FB6 ; PVALID # GREEK SMALL LETTER ALPHA WITH PERISPOMENI
2734 1FB7..1FC4 ; DISALLOWED # GREEK SMALL LETTER ALPHA WITH PERISPOMENI AN
2735 1FC5 ; UNASSIGNED # <reserved>
2736 1FC6 ; PVALID # GREEK SMALL LETTER ETA WITH PERISPOMENI
2737 1FC7..1FCF ; DISALLOWED # GREEK SMALL LETTER ETA WITH PERISPOMENI AND
2738 1FD0..1FD2 ; PVALID # GREEK SMALL LETTER IOTA WITH VRACHY..GREEK S
2739 1FD3 ; DISALLOWED # GREEK SMALL LETTER IOTA WITH DIALYTIKA AND O
2740 1FD4..1FD5 ; UNASSIGNED # <reserved>..<reserved>
2741 1FD6..1FD7 ; PVALID # GREEK SMALL LETTER IOTA WITH PERISPOMENI..GR
2742 1FD8..1FDB ; DISALLOWED # GREEK CAPITAL LETTER IOTA WITH VRACHY..GREEK
2743 1FDC ; UNASSIGNED # <reserved>
2744
2745
2746
2747 Faltstrom Standards Track [Page 50]
2748 RFC 5892 IDNA Code Points August 2010
2749
2750
2751 1FDD..1FDF ; DISALLOWED # GREEK DASIA AND VARIA..GREEK DASIA AND PERIS
2752 1FE0..1FE2 ; PVALID # GREEK SMALL LETTER UPSILON WITH VRACHY..GREE
2753 1FE3 ; DISALLOWED # GREEK SMALL LETTER UPSILON WITH DIALYTIKA AN
2754 1FE4..1FE7 ; PVALID # GREEK SMALL LETTER RHO WITH PSILI..GREEK SMA
2755 1FE8..1FEF ; DISALLOWED # GREEK CAPITAL LETTER UPSILON WITH VRACHY..GR
2756 1FF0..1FF1 ; UNASSIGNED # <reserved>..<reserved>
2757 1FF2..1FF4 ; DISALLOWED # GREEK SMALL LETTER OMEGA WITH VARIA AND YPOG
2758 1FF5 ; UNASSIGNED # <reserved>
2759 1FF6 ; PVALID # GREEK SMALL LETTER OMEGA WITH PERISPOMENI
2760 1FF7..1FFE ; DISALLOWED # GREEK SMALL LETTER OMEGA WITH PERISPOMENI AN
2761 1FFF ; UNASSIGNED # <reserved>
2762 2000..200B ; DISALLOWED # EN QUAD..ZERO WIDTH SPACE
2763 200C..200D ; CONTEXTJ # ZERO WIDTH NON-JOINER..ZERO WIDTH JOINER
2764 200E..2064 ; DISALLOWED # LEFT-TO-RIGHT MARK..INVISIBLE PLUS
2765 2065..2069 ; UNASSIGNED # <reserved>..<reserved>
2766 206A..2071 ; DISALLOWED # INHIBIT SYMMETRIC SWAPPING..SUPERSCRIPT LATI
2767 2072..2073 ; UNASSIGNED # <reserved>..<reserved>
2768 2074..208E ; DISALLOWED # SUPERSCRIPT FOUR..SUBSCRIPT RIGHT PARENTHESI
2769 208F ; UNASSIGNED # <reserved>
2770 2090..2094 ; DISALLOWED # LATIN SUBSCRIPT SMALL LETTER A..LATIN SUBSCR
2771 2095..209F ; UNASSIGNED # <reserved>..<reserved>
2772 20A0..20B8 ; DISALLOWED # EURO-CURRENCY SIGN..TENGE SIGN
2773 20B9..20CF ; UNASSIGNED # <reserved>..<reserved>
2774 20D0..20F0 ; DISALLOWED # COMBINING LEFT HARPOON ABOVE..COMBINING ASTE
2775 20F1..20FF ; UNASSIGNED # <reserved>..<reserved>
2776 2100..214D ; DISALLOWED # ACCOUNT OF..AKTIESELSKAB
2777 214E ; PVALID # TURNED SMALL F
2778 214F..2183 ; DISALLOWED # SYMBOL FOR SAMARITAN SOURCE..ROMAN NUMERAL R
2779 2184 ; PVALID # LATIN SMALL LETTER REVERSED C
2780 2185..2189 ; DISALLOWED # ROMAN NUMERAL SIX LATE FORM..VULGAR FRACTION
2781 218A..218F ; UNASSIGNED # <reserved>..<reserved>
2782 2190..23E8 ; DISALLOWED # LEFTWARDS ARROW..DECIMAL EXPONENT SYMBOL
2783 23E9..23FF ; UNASSIGNED # <reserved>..<reserved>
2784 2400..2426 ; DISALLOWED # SYMBOL FOR NULL..SYMBOL FOR SUBSTITUTE FORM
2785 2427..243F ; UNASSIGNED # <reserved>..<reserved>
2786 2440..244A ; DISALLOWED # OCR HOOK..OCR DOUBLE BACKSLASH
2787 244B..245F ; UNASSIGNED # <reserved>..<reserved>
2788 2460..26CD ; DISALLOWED # CIRCLED DIGIT ONE..DISABLED CAR
2789 26CE ; UNASSIGNED # <reserved>
2790 26CF..26E1 ; DISALLOWED # PICK..RESTRICTED LEFT ENTRY-2
2791 26E2 ; UNASSIGNED # <reserved>
2792 26E3 ; DISALLOWED # HEAVY CIRCLE WITH STROKE AND TWO DOTS ABOVE
2793 26E4..26E7 ; UNASSIGNED # <reserved>..<reserved>
2794 26E8..26FF ; DISALLOWED # BLACK CROSS ON SHIELD..WHITE FLAG WITH HORIZ
2795 2700 ; UNASSIGNED # <reserved>
2796 2701..2704 ; DISALLOWED # UPPER BLADE SCISSORS..WHITE SCISSORS
2797 2705 ; UNASSIGNED # <reserved>
2798 2706..2709 ; DISALLOWED # TELEPHONE LOCATION SIGN..ENVELOPE
2799
2800
2801
2802 Faltstrom Standards Track [Page 51]
2803 RFC 5892 IDNA Code Points August 2010
2804
2805
2806 270A..270B ; UNASSIGNED # <reserved>..<reserved>
2807 270C..2727 ; DISALLOWED # VICTORY HAND..WHITE FOUR POINTED STAR
2808 2728 ; UNASSIGNED # <reserved>
2809 2729..274B ; DISALLOWED # STRESS OUTLINED WHITE STAR..HEAVY EIGHT TEAR
2810 274C ; UNASSIGNED # <reserved>
2811 274D ; DISALLOWED # SHADOWED WHITE CIRCLE
2812 274E ; UNASSIGNED # <reserved>
2813 274F..2752 ; DISALLOWED # LOWER RIGHT DROP-SHADOWED WHITE SQUARE..UPPE
2814 2753..2755 ; UNASSIGNED # <reserved>..<reserved>
2815 2756..275E ; DISALLOWED # BLACK DIAMOND MINUS WHITE X..HEAVY DOUBLE CO
2816 275F..2760 ; UNASSIGNED # <reserved>..<reserved>
2817 2761..2794 ; DISALLOWED # CURVED STEM PARAGRAPH SIGN ORNAMENT..HEAVY W
2818 2795..2797 ; UNASSIGNED # <reserved>..<reserved>
2819 2798..27AF ; DISALLOWED # HEAVY SOUTH EAST ARROW..NOTCHED LOWER RIGHT-
2820 27B0 ; UNASSIGNED # <reserved>
2821 27B1..27BE ; DISALLOWED # NOTCHED UPPER RIGHT-SHADOWED WHITE RIGHTWARD
2822 27BF ; UNASSIGNED # <reserved>
2823 27C0..27CA ; DISALLOWED # THREE DIMENSIONAL ANGLE..VERTICAL BAR WITH H
2824 27CB ; UNASSIGNED # <reserved>
2825 27CC ; DISALLOWED # LONG DIVISION
2826 27CD..27CF ; UNASSIGNED # <reserved>..<reserved>
2827 27D0..2B4C ; DISALLOWED # WHITE DIAMOND WITH CENTRED DOT..RIGHTWARDS A
2828 2B4D..2B4F ; UNASSIGNED # <reserved>..<reserved>
2829 2B50..2B59 ; DISALLOWED # WHITE MEDIUM STAR..HEAVY CIRCLED SALTIRE
2830 2B5A..2BFF ; UNASSIGNED # <reserved>..<reserved>
2831 2C00..2C2E ; DISALLOWED # GLAGOLITIC CAPITAL LETTER AZU..GLAGOLITIC CA
2832 2C2F ; UNASSIGNED # <reserved>
2833 2C30..2C5E ; PVALID # GLAGOLITIC SMALL LETTER AZU..GLAGOLITIC SMAL
2834 2C5F ; UNASSIGNED # <reserved>
2835 2C60 ; DISALLOWED # LATIN CAPITAL LETTER L WITH DOUBLE BAR
2836 2C61 ; PVALID # LATIN SMALL LETTER L WITH DOUBLE BAR
2837 2C62..2C64 ; DISALLOWED # LATIN CAPITAL LETTER L WITH MIDDLE TILDE..LA
2838 2C65..2C66 ; PVALID # LATIN SMALL LETTER A WITH STROKE..LATIN SMAL
2839 2C67 ; DISALLOWED # LATIN CAPITAL LETTER H WITH DESCENDER
2840 2C68 ; PVALID # LATIN SMALL LETTER H WITH DESCENDER
2841 2C69 ; DISALLOWED # LATIN CAPITAL LETTER K WITH DESCENDER
2842 2C6A ; PVALID # LATIN SMALL LETTER K WITH DESCENDER
2843 2C6B ; DISALLOWED # LATIN CAPITAL LETTER Z WITH DESCENDER
2844 2C6C ; PVALID # LATIN SMALL LETTER Z WITH DESCENDER
2845 2C6D..2C70 ; DISALLOWED # LATIN CAPITAL LETTER ALPHA..LATIN CAPITAL LE
2846 2C71 ; PVALID # LATIN SMALL LETTER V WITH RIGHT HOOK
2847 2C72 ; DISALLOWED # LATIN CAPITAL LETTER W WITH HOOK
2848 2C73..2C74 ; PVALID # LATIN SMALL LETTER W WITH HOOK..LATIN SMALL
2849 2C75 ; DISALLOWED # LATIN CAPITAL LETTER HALF H
2850 2C76..2C7B ; PVALID # LATIN SMALL LETTER HALF H..LATIN LETTER SMAL
2851 2C7C..2C80 ; DISALLOWED # LATIN SUBSCRIPT SMALL LETTER J..COPTIC CAPIT
2852 2C81 ; PVALID # COPTIC SMALL LETTER ALFA
2853 2C82 ; DISALLOWED # COPTIC CAPITAL LETTER VIDA
2854
2855
2856
2857 Faltstrom Standards Track [Page 52]
2858 RFC 5892 IDNA Code Points August 2010
2859
2860
2861 2C83 ; PVALID # COPTIC SMALL LETTER VIDA
2862 2C84 ; DISALLOWED # COPTIC CAPITAL LETTER GAMMA
2863 2C85 ; PVALID # COPTIC SMALL LETTER GAMMA
2864 2C86 ; DISALLOWED # COPTIC CAPITAL LETTER DALDA
2865 2C87 ; PVALID # COPTIC SMALL LETTER DALDA
2866 2C88 ; DISALLOWED # COPTIC CAPITAL LETTER EIE
2867 2C89 ; PVALID # COPTIC SMALL LETTER EIE
2868 2C8A ; DISALLOWED # COPTIC CAPITAL LETTER SOU
2869 2C8B ; PVALID # COPTIC SMALL LETTER SOU
2870 2C8C ; DISALLOWED # COPTIC CAPITAL LETTER ZATA
2871 2C8D ; PVALID # COPTIC SMALL LETTER ZATA
2872 2C8E ; DISALLOWED # COPTIC CAPITAL LETTER HATE
2873 2C8F ; PVALID # COPTIC SMALL LETTER HATE
2874 2C90 ; DISALLOWED # COPTIC CAPITAL LETTER THETHE
2875 2C91 ; PVALID # COPTIC SMALL LETTER THETHE
2876 2C92 ; DISALLOWED # COPTIC CAPITAL LETTER IAUDA
2877 2C93 ; PVALID # COPTIC SMALL LETTER IAUDA
2878 2C94 ; DISALLOWED # COPTIC CAPITAL LETTER KAPA
2879 2C95 ; PVALID # COPTIC SMALL LETTER KAPA
2880 2C96 ; DISALLOWED # COPTIC CAPITAL LETTER LAULA
2881 2C97 ; PVALID # COPTIC SMALL LETTER LAULA
2882 2C98 ; DISALLOWED # COPTIC CAPITAL LETTER MI
2883 2C99 ; PVALID # COPTIC SMALL LETTER MI
2884 2C9A ; DISALLOWED # COPTIC CAPITAL LETTER NI
2885 2C9B ; PVALID # COPTIC SMALL LETTER NI
2886 2C9C ; DISALLOWED # COPTIC CAPITAL LETTER KSI
2887 2C9D ; PVALID # COPTIC SMALL LETTER KSI
2888 2C9E ; DISALLOWED # COPTIC CAPITAL LETTER O
2889 2C9F ; PVALID # COPTIC SMALL LETTER O
2890 2CA0 ; DISALLOWED # COPTIC CAPITAL LETTER PI
2891 2CA1 ; PVALID # COPTIC SMALL LETTER PI
2892 2CA2 ; DISALLOWED # COPTIC CAPITAL LETTER RO
2893 2CA3 ; PVALID # COPTIC SMALL LETTER RO
2894 2CA4 ; DISALLOWED # COPTIC CAPITAL LETTER SIMA
2895 2CA5 ; PVALID # COPTIC SMALL LETTER SIMA
2896 2CA6 ; DISALLOWED # COPTIC CAPITAL LETTER TAU
2897 2CA7 ; PVALID # COPTIC SMALL LETTER TAU
2898 2CA8 ; DISALLOWED # COPTIC CAPITAL LETTER UA
2899 2CA9 ; PVALID # COPTIC SMALL LETTER UA
2900 2CAA ; DISALLOWED # COPTIC CAPITAL LETTER FI
2901 2CAB ; PVALID # COPTIC SMALL LETTER FI
2902 2CAC ; DISALLOWED # COPTIC CAPITAL LETTER KHI
2903 2CAD ; PVALID # COPTIC SMALL LETTER KHI
2904 2CAE ; DISALLOWED # COPTIC CAPITAL LETTER PSI
2905 2CAF ; PVALID # COPTIC SMALL LETTER PSI
2906 2CB0 ; DISALLOWED # COPTIC CAPITAL LETTER OOU
2907 2CB1 ; PVALID # COPTIC SMALL LETTER OOU
2908 2CB2 ; DISALLOWED # COPTIC CAPITAL LETTER DIALECT-P ALEF
2909
2910
2911
2912 Faltstrom Standards Track [Page 53]
2913 RFC 5892 IDNA Code Points August 2010
2914
2915
2916 2CB3 ; PVALID # COPTIC SMALL LETTER DIALECT-P ALEF
2917 2CB4 ; DISALLOWED # COPTIC CAPITAL LETTER OLD COPTIC AIN
2918 2CB5 ; PVALID # COPTIC SMALL LETTER OLD COPTIC AIN
2919 2CB6 ; DISALLOWED # COPTIC CAPITAL LETTER CRYPTOGRAMMIC EIE
2920 2CB7 ; PVALID # COPTIC SMALL LETTER CRYPTOGRAMMIC EIE
2921 2CB8 ; DISALLOWED # COPTIC CAPITAL LETTER DIALECT-P KAPA
2922 2CB9 ; PVALID # COPTIC SMALL LETTER DIALECT-P KAPA
2923 2CBA ; DISALLOWED # COPTIC CAPITAL LETTER DIALECT-P NI
2924 2CBB ; PVALID # COPTIC SMALL LETTER DIALECT-P NI
2925 2CBC ; DISALLOWED # COPTIC CAPITAL LETTER CRYPTOGRAMMIC NI
2926 2CBD ; PVALID # COPTIC SMALL LETTER CRYPTOGRAMMIC NI
2927 2CBE ; DISALLOWED # COPTIC CAPITAL LETTER OLD COPTIC OOU
2928 2CBF ; PVALID # COPTIC SMALL LETTER OLD COPTIC OOU
2929 2CC0 ; DISALLOWED # COPTIC CAPITAL LETTER SAMPI
2930 2CC1 ; PVALID # COPTIC SMALL LETTER SAMPI
2931 2CC2 ; DISALLOWED # COPTIC CAPITAL LETTER CROSSED SHEI
2932 2CC3 ; PVALID # COPTIC SMALL LETTER CROSSED SHEI
2933 2CC4 ; DISALLOWED # COPTIC CAPITAL LETTER OLD COPTIC SHEI
2934 2CC5 ; PVALID # COPTIC SMALL LETTER OLD COPTIC SHEI
2935 2CC6 ; DISALLOWED # COPTIC CAPITAL LETTER OLD COPTIC ESH
2936 2CC7 ; PVALID # COPTIC SMALL LETTER OLD COPTIC ESH
2937 2CC8 ; DISALLOWED # COPTIC CAPITAL LETTER AKHMIMIC KHEI
2938 2CC9 ; PVALID # COPTIC SMALL LETTER AKHMIMIC KHEI
2939 2CCA ; DISALLOWED # COPTIC CAPITAL LETTER DIALECT-P HORI
2940 2CCB ; PVALID # COPTIC SMALL LETTER DIALECT-P HORI
2941 2CCC ; DISALLOWED # COPTIC CAPITAL LETTER OLD COPTIC HORI
2942 2CCD ; PVALID # COPTIC SMALL LETTER OLD COPTIC HORI
2943 2CCE ; DISALLOWED # COPTIC CAPITAL LETTER OLD COPTIC HA
2944 2CCF ; PVALID # COPTIC SMALL LETTER OLD COPTIC HA
2945 2CD0 ; DISALLOWED # COPTIC CAPITAL LETTER L-SHAPED HA
2946 2CD1 ; PVALID # COPTIC SMALL LETTER L-SHAPED HA
2947 2CD2 ; DISALLOWED # COPTIC CAPITAL LETTER OLD COPTIC HEI
2948 2CD3 ; PVALID # COPTIC SMALL LETTER OLD COPTIC HEI
2949 2CD4 ; DISALLOWED # COPTIC CAPITAL LETTER OLD COPTIC HAT
2950 2CD5 ; PVALID # COPTIC SMALL LETTER OLD COPTIC HAT
2951 2CD6 ; DISALLOWED # COPTIC CAPITAL LETTER OLD COPTIC GANGIA
2952 2CD7 ; PVALID # COPTIC SMALL LETTER OLD COPTIC GANGIA
2953 2CD8 ; DISALLOWED # COPTIC CAPITAL LETTER OLD COPTIC DJA
2954 2CD9 ; PVALID # COPTIC SMALL LETTER OLD COPTIC DJA
2955 2CDA ; DISALLOWED # COPTIC CAPITAL LETTER OLD COPTIC SHIMA
2956 2CDB ; PVALID # COPTIC SMALL LETTER OLD COPTIC SHIMA
2957 2CDC ; DISALLOWED # COPTIC CAPITAL LETTER OLD NUBIAN SHIMA
2958 2CDD ; PVALID # COPTIC SMALL LETTER OLD NUBIAN SHIMA
2959 2CDE ; DISALLOWED # COPTIC CAPITAL LETTER OLD NUBIAN NGI
2960 2CDF ; PVALID # COPTIC SMALL LETTER OLD NUBIAN NGI
2961 2CE0 ; DISALLOWED # COPTIC CAPITAL LETTER OLD NUBIAN NYI
2962 2CE1 ; PVALID # COPTIC SMALL LETTER OLD NUBIAN NYI
2963 2CE2 ; DISALLOWED # COPTIC CAPITAL LETTER OLD NUBIAN WAU
2964
2965
2966
2967 Faltstrom Standards Track [Page 54]
2968 RFC 5892 IDNA Code Points August 2010
2969
2970
2971 2CE3..2CE4 ; PVALID # COPTIC SMALL LETTER OLD NUBIAN WAU..COPTIC S
2972 2CE5..2CEB ; DISALLOWED # COPTIC SYMBOL MI RO..COPTIC CAPITAL LETTER C
2973 2CEC ; PVALID # COPTIC SMALL LETTER CRYPTOGRAMMIC SHEI
2974 2CED ; DISALLOWED # COPTIC CAPITAL LETTER CRYPTOGRAMMIC GANGIA
2975 2CEE..2CF1 ; PVALID # COPTIC SMALL LETTER CRYPTOGRAMMIC GANGIA..CO
2976 2CF2..2CF8 ; UNASSIGNED # <reserved>..<reserved>
2977 2CF9..2CFF ; DISALLOWED # COPTIC OLD NUBIAN FULL STOP..COPTIC MORPHOLO
2978 2D00..2D25 ; PVALID # GEORGIAN SMALL LETTER AN..GEORGIAN SMALL LET
2979 2D26..2D2F ; UNASSIGNED # <reserved>..<reserved>
2980 2D30..2D65 ; PVALID # TIFINAGH LETTER YA..TIFINAGH LETTER YAZZ
2981 2D66..2D6E ; UNASSIGNED # <reserved>..<reserved>
2982 2D6F ; DISALLOWED # TIFINAGH MODIFIER LETTER LABIALIZATION MARK
2983 2D70..2D7F ; UNASSIGNED # <reserved>..<reserved>
2984 2D80..2D96 ; PVALID # ETHIOPIC SYLLABLE LOA..ETHIOPIC SYLLABLE GGW
2985 2D97..2D9F ; UNASSIGNED # <reserved>..<reserved>
2986 2DA0..2DA6 ; PVALID # ETHIOPIC SYLLABLE SSA..ETHIOPIC SYLLABLE SSO
2987 2DA7 ; UNASSIGNED # <reserved>
2988 2DA8..2DAE ; PVALID # ETHIOPIC SYLLABLE CCA..ETHIOPIC SYLLABLE CCO
2989 2DAF ; UNASSIGNED # <reserved>
2990 2DB0..2DB6 ; PVALID # ETHIOPIC SYLLABLE ZZA..ETHIOPIC SYLLABLE ZZO
2991 2DB7 ; UNASSIGNED # <reserved>
2992 2DB8..2DBE ; PVALID # ETHIOPIC SYLLABLE CCHA..ETHIOPIC SYLLABLE CC
2993 2DBF ; UNASSIGNED # <reserved>
2994 2DC0..2DC6 ; PVALID # ETHIOPIC SYLLABLE QYA..ETHIOPIC SYLLABLE QYO
2995 2DC7 ; UNASSIGNED # <reserved>
2996 2DC8..2DCE ; PVALID # ETHIOPIC SYLLABLE KYA..ETHIOPIC SYLLABLE KYO
2997 2DCF ; UNASSIGNED # <reserved>
2998 2DD0..2DD6 ; PVALID # ETHIOPIC SYLLABLE XYA..ETHIOPIC SYLLABLE XYO
2999 2DD7 ; UNASSIGNED # <reserved>
3000 2DD8..2DDE ; PVALID # ETHIOPIC SYLLABLE GYA..ETHIOPIC SYLLABLE GYO
3001 2DDF ; UNASSIGNED # <reserved>
3002 2DE0..2DFF ; PVALID # COMBINING CYRILLIC LETTER BE..COMBINING CYRI
3003 2E00..2E2E ; DISALLOWED # RIGHT ANGLE SUBSTITUTION MARKER..REVERSED QU
3004 2E2F ; PVALID # VERTICAL TILDE
3005 2E30..2E31 ; DISALLOWED # RING POINT..WORD SEPARATOR MIDDLE DOT
3006 2E32..2E7F ; UNASSIGNED # <reserved>..<reserved>
3007 2E80..2E99 ; DISALLOWED # CJK RADICAL REPEAT..CJK RADICAL RAP
3008 2E9A ; UNASSIGNED # <reserved>
3009 2E9B..2EF3 ; DISALLOWED # CJK RADICAL CHOKE..CJK RADICAL C-SIMPLIFIED
3010 2EF4..2EFF ; UNASSIGNED # <reserved>..<reserved>
3011 2F00..2FD5 ; DISALLOWED # KANGXI RADICAL ONE..KANGXI RADICAL FLUTE
3012 2FD6..2FEF ; UNASSIGNED # <reserved>..<reserved>
3013 2FF0..2FFB ; DISALLOWED # IDEOGRAPHIC DESCRIPTION CHARACTER LEFT TO RI
3014 2FFC..2FFF ; UNASSIGNED # <reserved>..<reserved>
3015 3000..3004 ; DISALLOWED # IDEOGRAPHIC SPACE..JAPANESE INDUSTRIAL STAND
3016 3005..3007 ; PVALID # IDEOGRAPHIC ITERATION MARK..IDEOGRAPHIC NUMB
3017 3008..3029 ; DISALLOWED # LEFT ANGLE BRACKET..HANGZHOU NUMERAL NINE
3018 302A..302D ; PVALID # IDEOGRAPHIC LEVEL TONE MARK..IDEOGRAPHIC ENT
3019
3020
3021
3022 Faltstrom Standards Track [Page 55]
3023 RFC 5892 IDNA Code Points August 2010
3024
3025
3026 302E..303B ; DISALLOWED # HANGUL SINGLE DOT TONE MARK..VERTICAL IDEOGR
3027 303C ; PVALID # MASU MARK
3028 303D..303F ; DISALLOWED # PART ALTERNATION MARK..IDEOGRAPHIC HALF FILL
3029 3040 ; UNASSIGNED # <reserved>
3030 3041..3096 ; PVALID # HIRAGANA LETTER SMALL A..HIRAGANA LETTER SMA
3031 3097..3098 ; UNASSIGNED # <reserved>..<reserved>
3032 3099..309A ; PVALID # COMBINING KATAKANA-HIRAGANA VOICED SOUND MAR
3033 309B..309C ; DISALLOWED # KATAKANA-HIRAGANA VOICED SOUND MARK..KATAKAN
3034 309D..309E ; PVALID # HIRAGANA ITERATION MARK..HIRAGANA VOICED ITE
3035 309F..30A0 ; DISALLOWED # HIRAGANA DIGRAPH YORI..KATAKANA-HIRAGANA DOU
3036 30A1..30FA ; PVALID # KATAKANA LETTER SMALL A..KATAKANA LETTER VO
3037 30FB ; CONTEXTO # KATAKANA MIDDLE DOT
3038 30FC..30FE ; PVALID # KATAKANA-HIRAGANA PROLONGED SOUND MARK..KATA
3039 30FF ; DISALLOWED # KATAKANA DIGRAPH KOTO
3040 3100..3104 ; UNASSIGNED # <reserved>..<reserved>
3041 3105..312D ; PVALID # BOPOMOFO LETTER B..BOPOMOFO LETTER IH
3042 312E..3130 ; UNASSIGNED # <reserved>..<reserved>
3043 3131..318E ; DISALLOWED # HANGUL LETTER KIYEOK..HANGUL LETTER ARAEAE
3044 318F ; UNASSIGNED # <reserved>
3045 3190..319F ; DISALLOWED # IDEOGRAPHIC ANNOTATION LINKING MARK..IDEOGRA
3046 31A0..31B7 ; PVALID # BOPOMOFO LETTER BU..BOPOMOFO FINAL LETTER H
3047 31B8..31BF ; UNASSIGNED # <reserved>..<reserved>
3048 31C0..31E3 ; DISALLOWED # CJK STROKE T..CJK STROKE Q
3049 31E4..31EF ; UNASSIGNED # <reserved>..<reserved>
3050 31F0..31FF ; PVALID # KATAKANA LETTER SMALL KU..KATAKANA LETTER SM
3051 3200..321E ; DISALLOWED # PARENTHESIZED HANGUL KIYEOK..PARENTHESIZED K
3052 321F ; UNASSIGNED # <reserved>
3053 3220..32FE ; DISALLOWED # PARENTHESIZED IDEOGRAPH ONE..CIRCLED KATAKAN
3054 32FF ; UNASSIGNED # <reserved>
3055 3300..33FF ; DISALLOWED # SQUARE APAATO..SQUARE GAL
3056 3400..4DB5 ; PVALID # <CJK Ideograph Extension A>..<CJK Ideograph
3057 4DB6..4DBF ; UNASSIGNED # <reserved>..<reserved>
3058 4DC0..4DFF ; DISALLOWED # HEXAGRAM FOR THE CREATIVE HEAVEN..HEXAGRAM F
3059 4E00..9FCB ; PVALID # <CJK Ideograph>..<CJK Ideograph>
3060 9FCC..9FFF ; UNASSIGNED # <reserved>..<reserved>
3061 A000..A48C ; PVALID # YI SYLLABLE IT..YI SYLLABLE YYR
3062 A48D..A48F ; UNASSIGNED # <reserved>..<reserved>
3063 A490..A4C6 ; DISALLOWED # YI RADICAL QOT..YI RADICAL KE
3064 A4C7..A4CF ; UNASSIGNED # <reserved>..<reserved>
3065 A4D0..A4FD ; PVALID # LISU LETTER BA..LISU LETTER TONE MYA JEU
3066 A4FE..A4FF ; DISALLOWED # LISU PUNCTUATION COMMA..LISU PUNCTUATION FUL
3067 A500..A60C ; PVALID # VAI SYLLABLE EE..VAI SYLLABLE LENGTHENER
3068 A60D..A60F ; DISALLOWED # VAI COMMA..VAI QUESTION MARK
3069 A610..A62B ; PVALID # VAI SYLLABLE NDOLE FA..VAI SYLLABLE NDOLE DO
3070 A62C..A63F ; UNASSIGNED # <reserved>..<reserved>
3071 A640 ; DISALLOWED # CYRILLIC CAPITAL LETTER ZEMLYA
3072 A641 ; PVALID # CYRILLIC SMALL LETTER ZEMLYA
3073 A642 ; DISALLOWED # CYRILLIC CAPITAL LETTER DZELO
3074
3075
3076
3077 Faltstrom Standards Track [Page 56]
3078 RFC 5892 IDNA Code Points August 2010
3079
3080
3081 A643 ; PVALID # CYRILLIC SMALL LETTER DZELO
3082 A644 ; DISALLOWED # CYRILLIC CAPITAL LETTER REVERSED DZE
3083 A645 ; PVALID # CYRILLIC SMALL LETTER REVERSED DZE
3084 A646 ; DISALLOWED # CYRILLIC CAPITAL LETTER IOTA
3085 A647 ; PVALID # CYRILLIC SMALL LETTER IOTA
3086 A648 ; DISALLOWED # CYRILLIC CAPITAL LETTER DJERV
3087 A649 ; PVALID # CYRILLIC SMALL LETTER DJERV
3088 A64A ; DISALLOWED # CYRILLIC CAPITAL LETTER MONOGRAPH UK
3089 A64B ; PVALID # CYRILLIC SMALL LETTER MONOGRAPH UK
3090 A64C ; DISALLOWED # CYRILLIC CAPITAL LETTER BROAD OMEGA
3091 A64D ; PVALID # CYRILLIC SMALL LETTER BROAD OMEGA
3092 A64E ; DISALLOWED # CYRILLIC CAPITAL LETTER NEUTRAL YER
3093 A64F ; PVALID # CYRILLIC SMALL LETTER NEUTRAL YER
3094 A650 ; DISALLOWED # CYRILLIC CAPITAL LETTER YERU WITH BACK YER
3095 A651 ; PVALID # CYRILLIC SMALL LETTER YERU WITH BACK YER
3096 A652 ; DISALLOWED # CYRILLIC CAPITAL LETTER IOTIFIED YAT
3097 A653 ; PVALID # CYRILLIC SMALL LETTER IOTIFIED YAT
3098 A654 ; DISALLOWED # CYRILLIC CAPITAL LETTER REVERSED YU
3099 A655 ; PVALID # CYRILLIC SMALL LETTER REVERSED YU
3100 A656 ; DISALLOWED # CYRILLIC CAPITAL LETTER IOTIFIED A
3101 A657 ; PVALID # CYRILLIC SMALL LETTER IOTIFIED A
3102 A658 ; DISALLOWED # CYRILLIC CAPITAL LETTER CLOSED LITTLE YUS
3103 A659 ; PVALID # CYRILLIC SMALL LETTER CLOSED LITTLE YUS
3104 A65A ; DISALLOWED # CYRILLIC CAPITAL LETTER BLENDED YUS
3105 A65B ; PVALID # CYRILLIC SMALL LETTER BLENDED YUS
3106 A65C ; DISALLOWED # CYRILLIC CAPITAL LETTER IOTIFIED CLOSED LITT
3107 A65D ; PVALID # CYRILLIC SMALL LETTER IOTIFIED CLOSED LITTLE
3108 A65E ; DISALLOWED # CYRILLIC CAPITAL LETTER YN
3109 A65F ; PVALID # CYRILLIC SMALL LETTER YN
3110 A660..A661 ; UNASSIGNED # <reserved>..<reserved>
3111 A662 ; DISALLOWED # CYRILLIC CAPITAL LETTER SOFT DE
3112 A663 ; PVALID # CYRILLIC SMALL LETTER SOFT DE
3113 A664 ; DISALLOWED # CYRILLIC CAPITAL LETTER SOFT EL
3114 A665 ; PVALID # CYRILLIC SMALL LETTER SOFT EL
3115 A666 ; DISALLOWED # CYRILLIC CAPITAL LETTER SOFT EM
3116 A667 ; PVALID # CYRILLIC SMALL LETTER SOFT EM
3117 A668 ; DISALLOWED # CYRILLIC CAPITAL LETTER MONOCULAR O
3118 A669 ; PVALID # CYRILLIC SMALL LETTER MONOCULAR O
3119 A66A ; DISALLOWED # CYRILLIC CAPITAL LETTER BINOCULAR O
3120 A66B ; PVALID # CYRILLIC SMALL LETTER BINOCULAR O
3121 A66C ; DISALLOWED # CYRILLIC CAPITAL LETTER DOUBLE MONOCULAR O
3122 A66D..A66F ; PVALID # CYRILLIC SMALL LETTER DOUBLE MONOCULAR O..CO
3123 A670..A673 ; DISALLOWED # COMBINING CYRILLIC TEN MILLIONS SIGN..SLAVON
3124 A674..A67B ; UNASSIGNED # <reserved>..<reserved>
3125 A67C..A67D ; PVALID # COMBINING CYRILLIC KAVYKA..COMBINING CYRILLI
3126 A67E ; DISALLOWED # CYRILLIC KAVYKA
3127 A67F ; PVALID # CYRILLIC PAYEROK
3128 A680 ; DISALLOWED # CYRILLIC CAPITAL LETTER DWE
3129
3130
3131
3132 Faltstrom Standards Track [Page 57]
3133 RFC 5892 IDNA Code Points August 2010
3134
3135
3136 A681 ; PVALID # CYRILLIC SMALL LETTER DWE
3137 A682 ; DISALLOWED # CYRILLIC CAPITAL LETTER DZWE
3138 A683 ; PVALID # CYRILLIC SMALL LETTER DZWE
3139 A684 ; DISALLOWED # CYRILLIC CAPITAL LETTER ZHWE
3140 A685 ; PVALID # CYRILLIC SMALL LETTER ZHWE
3141 A686 ; DISALLOWED # CYRILLIC CAPITAL LETTER CCHE
3142 A687 ; PVALID # CYRILLIC SMALL LETTER CCHE
3143 A688 ; DISALLOWED # CYRILLIC CAPITAL LETTER DZZE
3144 A689 ; PVALID # CYRILLIC SMALL LETTER DZZE
3145 A68A ; DISALLOWED # CYRILLIC CAPITAL LETTER TE WITH MIDDLE HOOK
3146 A68B ; PVALID # CYRILLIC SMALL LETTER TE WITH MIDDLE HOOK
3147 A68C ; DISALLOWED # CYRILLIC CAPITAL LETTER TWE
3148 A68D ; PVALID # CYRILLIC SMALL LETTER TWE
3149 A68E ; DISALLOWED # CYRILLIC CAPITAL LETTER TSWE
3150 A68F ; PVALID # CYRILLIC SMALL LETTER TSWE
3151 A690 ; DISALLOWED # CYRILLIC CAPITAL LETTER TSSE
3152 A691 ; PVALID # CYRILLIC SMALL LETTER TSSE
3153 A692 ; DISALLOWED # CYRILLIC CAPITAL LETTER TCHE
3154 A693 ; PVALID # CYRILLIC SMALL LETTER TCHE
3155 A694 ; DISALLOWED # CYRILLIC CAPITAL LETTER HWE
3156 A695 ; PVALID # CYRILLIC SMALL LETTER HWE
3157 A696 ; DISALLOWED # CYRILLIC CAPITAL LETTER SHWE
3158 A697 ; PVALID # CYRILLIC SMALL LETTER SHWE
3159 A698..A69F ; UNASSIGNED # <reserved>..<reserved>
3160 A6A0..A6E5 ; PVALID # BAMUM LETTER A..BAMUM LETTER KI
3161 A6E6..A6EF ; DISALLOWED # BAMUM LETTER MO..BAMUM LETTER KOGHOM
3162 A6F0..A6F1 ; PVALID # BAMUM COMBINING MARK KOQNDON..BAMUM COMBININ
3163 A6F2..A6F7 ; DISALLOWED # BAMUM NJAEMLI..BAMUM QUESTION MARK
3164 A6F8..A6FF ; UNASSIGNED # <reserved>..<reserved>
3165 A700..A716 ; DISALLOWED # MODIFIER LETTER CHINESE TONE YIN PING..MODIF
3166 A717..A71F ; PVALID # MODIFIER LETTER DOT VERTICAL BAR..MODIFIER L
3167 A720..A722 ; DISALLOWED # MODIFIER LETTER STRESS AND HIGH TONE..LATIN
3168 A723 ; PVALID # LATIN SMALL LETTER EGYPTOLOGICAL ALEF
3169 A724 ; DISALLOWED # LATIN CAPITAL LETTER EGYPTOLOGICAL AIN
3170 A725 ; PVALID # LATIN SMALL LETTER EGYPTOLOGICAL AIN
3171 A726 ; DISALLOWED # LATIN CAPITAL LETTER HENG
3172 A727 ; PVALID # LATIN SMALL LETTER HENG
3173 A728 ; DISALLOWED # LATIN CAPITAL LETTER TZ
3174 A729 ; PVALID # LATIN SMALL LETTER TZ
3175 A72A ; DISALLOWED # LATIN CAPITAL LETTER TRESILLO
3176 A72B ; PVALID # LATIN SMALL LETTER TRESILLO
3177 A72C ; DISALLOWED # LATIN CAPITAL LETTER CUATRILLO
3178 A72D ; PVALID # LATIN SMALL LETTER CUATRILLO
3179 A72E ; DISALLOWED # LATIN CAPITAL LETTER CUATRILLO WITH COMMA
3180 A72F..A731 ; PVALID # LATIN SMALL LETTER CUATRILLO WITH COMMA..LAT
3181 A732 ; DISALLOWED # LATIN CAPITAL LETTER AA
3182 A733 ; PVALID # LATIN SMALL LETTER AA
3183 A734 ; DISALLOWED # LATIN CAPITAL LETTER AO
3184
3185
3186
3187 Faltstrom Standards Track [Page 58]
3188 RFC 5892 IDNA Code Points August 2010
3189
3190
3191 A735 ; PVALID # LATIN SMALL LETTER AO
3192 A736 ; DISALLOWED # LATIN CAPITAL LETTER AU
3193 A737 ; PVALID # LATIN SMALL LETTER AU
3194 A738 ; DISALLOWED # LATIN CAPITAL LETTER AV
3195 A739 ; PVALID # LATIN SMALL LETTER AV
3196 A73A ; DISALLOWED # LATIN CAPITAL LETTER AV WITH HORIZONTAL BAR
3197 A73B ; PVALID # LATIN SMALL LETTER AV WITH HORIZONTAL BAR
3198 A73C ; DISALLOWED # LATIN CAPITAL LETTER AY
3199 A73D ; PVALID # LATIN SMALL LETTER AY
3200 A73E ; DISALLOWED # LATIN CAPITAL LETTER REVERSED C WITH DOT
3201 A73F ; PVALID # LATIN SMALL LETTER REVERSED C WITH DOT
3202 A740 ; DISALLOWED # LATIN CAPITAL LETTER K WITH STROKE
3203 A741 ; PVALID # LATIN SMALL LETTER K WITH STROKE
3204 A742 ; DISALLOWED # LATIN CAPITAL LETTER K WITH DIAGONAL STROKE
3205 A743 ; PVALID # LATIN SMALL LETTER K WITH DIAGONAL STROKE
3206 A744 ; DISALLOWED # LATIN CAPITAL LETTER K WITH STROKE AND DIAGO
3207 A745 ; PVALID # LATIN SMALL LETTER K WITH STROKE AND DIAGONA
3208 A746 ; DISALLOWED # LATIN CAPITAL LETTER BROKEN L
3209 A747 ; PVALID # LATIN SMALL LETTER BROKEN L
3210 A748 ; DISALLOWED # LATIN CAPITAL LETTER L WITH HIGH STROKE
3211 A749 ; PVALID # LATIN SMALL LETTER L WITH HIGH STROKE
3212 A74A ; DISALLOWED # LATIN CAPITAL LETTER O WITH LONG STROKE OVER
3213 A74B ; PVALID # LATIN SMALL LETTER O WITH LONG STROKE OVERLA
3214 A74C ; DISALLOWED # LATIN CAPITAL LETTER O WITH LOOP
3215 A74D ; PVALID # LATIN SMALL LETTER O WITH LOOP
3216 A74E ; DISALLOWED # LATIN CAPITAL LETTER OO
3217 A74F ; PVALID # LATIN SMALL LETTER OO
3218 A750 ; DISALLOWED # LATIN CAPITAL LETTER P WITH STROKE THROUGH D
3219 A751 ; PVALID # LATIN SMALL LETTER P WITH STROKE THROUGH DES
3220 A752 ; DISALLOWED # LATIN CAPITAL LETTER P WITH FLOURISH
3221 A753 ; PVALID # LATIN SMALL LETTER P WITH FLOURISH
3222 A754 ; DISALLOWED # LATIN CAPITAL LETTER P WITH SQUIRREL TAIL
3223 A755 ; PVALID # LATIN SMALL LETTER P WITH SQUIRREL TAIL
3224 A756 ; DISALLOWED # LATIN CAPITAL LETTER Q WITH STROKE THROUGH D
3225 A757 ; PVALID # LATIN SMALL LETTER Q WITH STROKE THROUGH DES
3226 A758 ; DISALLOWED # LATIN CAPITAL LETTER Q WITH DIAGONAL STROKE
3227 A759 ; PVALID # LATIN SMALL LETTER Q WITH DIAGONAL STROKE
3228 A75A ; DISALLOWED # LATIN CAPITAL LETTER R ROTUNDA
3229 A75B ; PVALID # LATIN SMALL LETTER R ROTUNDA
3230 A75C ; DISALLOWED # LATIN CAPITAL LETTER RUM ROTUNDA
3231 A75D ; PVALID # LATIN SMALL LETTER RUM ROTUNDA
3232 A75E ; DISALLOWED # LATIN CAPITAL LETTER V WITH DIAGONAL STROKE
3233 A75F ; PVALID # LATIN SMALL LETTER V WITH DIAGONAL STROKE
3234 A760 ; DISALLOWED # LATIN CAPITAL LETTER VY
3235 A761 ; PVALID # LATIN SMALL LETTER VY
3236 A762 ; DISALLOWED # LATIN CAPITAL LETTER VISIGOTHIC Z
3237 A763 ; PVALID # LATIN SMALL LETTER VISIGOTHIC Z
3238 A764 ; DISALLOWED # LATIN CAPITAL LETTER THORN WITH STROKE
3239
3240
3241
3242 Faltstrom Standards Track [Page 59]
3243 RFC 5892 IDNA Code Points August 2010
3244
3245
3246 A765 ; PVALID # LATIN SMALL LETTER THORN WITH STROKE
3247 A766 ; DISALLOWED # LATIN CAPITAL LETTER THORN WITH STROKE THROU
3248 A767 ; PVALID # LATIN SMALL LETTER THORN WITH STROKE THROUGH
3249 A768 ; DISALLOWED # LATIN CAPITAL LETTER VEND
3250 A769 ; PVALID # LATIN SMALL LETTER VEND
3251 A76A ; DISALLOWED # LATIN CAPITAL LETTER ET
3252 A76B ; PVALID # LATIN SMALL LETTER ET
3253 A76C ; DISALLOWED # LATIN CAPITAL LETTER IS
3254 A76D ; PVALID # LATIN SMALL LETTER IS
3255 A76E ; DISALLOWED # LATIN CAPITAL LETTER CON
3256 A76F ; PVALID # LATIN SMALL LETTER CON
3257 A770 ; DISALLOWED # MODIFIER LETTER US
3258 A771..A778 ; PVALID # LATIN SMALL LETTER DUM..LATIN SMALL LETTER U
3259 A779 ; DISALLOWED # LATIN CAPITAL LETTER INSULAR D
3260 A77A ; PVALID # LATIN SMALL LETTER INSULAR D
3261 A77B ; DISALLOWED # LATIN CAPITAL LETTER INSULAR F
3262 A77C ; PVALID # LATIN SMALL LETTER INSULAR F
3263 A77D..A77E ; DISALLOWED # LATIN CAPITAL LETTER INSULAR G..LATIN CAPITA
3264 A77F ; PVALID # LATIN SMALL LETTER TURNED INSULAR G
3265 A780 ; DISALLOWED # LATIN CAPITAL LETTER TURNED L
3266 A781 ; PVALID # LATIN SMALL LETTER TURNED L
3267 A782 ; DISALLOWED # LATIN CAPITAL LETTER INSULAR R
3268 A783 ; PVALID # LATIN SMALL LETTER INSULAR R
3269 A784 ; DISALLOWED # LATIN CAPITAL LETTER INSULAR S
3270 A785 ; PVALID # LATIN SMALL LETTER INSULAR S
3271 A786 ; DISALLOWED # LATIN CAPITAL LETTER INSULAR T
3272 A787..A788 ; PVALID # LATIN SMALL LETTER INSULAR T..MODIFIER LETTE
3273 A789..A78B ; DISALLOWED # MODIFIER LETTER COLON..LATIN CAPITAL LETTER
3274 A78C ; PVALID # LATIN SMALL LETTER SALTILLO
3275 A78D..A7FA ; UNASSIGNED # <reserved>..<reserved>
3276 A7FB..A827 ; PVALID # LATIN EPIGRAPHIC LETTER REVERSED F..SYLOTI N
3277 A828..A82B ; DISALLOWED # SYLOTI NAGRI POETRY MARK-1..SYLOTI NAGRI POE
3278 A82C..A82F ; UNASSIGNED # <reserved>..<reserved>
3279 A830..A839 ; DISALLOWED # NORTH INDIC FRACTION ONE QUARTER..NORTH INDI
3280 A83A..A83F ; UNASSIGNED # <reserved>..<reserved>
3281 A840..A873 ; PVALID # PHAGS-PA LETTER KA..PHAGS-PA LETTER CANDRABI
3282 A874..A877 ; DISALLOWED # PHAGS-PA SINGLE HEAD MARK..PHAGS-PA MARK DOU
3283 A878..A87F ; UNASSIGNED # <reserved>..<reserved>
3284 A880..A8C4 ; PVALID # SAURASHTRA SIGN ANUSVARA..SAURASHTRA SIGN VI
3285 A8C5..A8CD ; UNASSIGNED # <reserved>..<reserved>
3286 A8CE..A8CF ; DISALLOWED # SAURASHTRA DANDA..SAURASHTRA DOUBLE DANDA
3287 A8D0..A8D9 ; PVALID # SAURASHTRA DIGIT ZERO..SAURASHTRA DIGIT NINE
3288 A8DA..A8DF ; UNASSIGNED # <reserved>..<reserved>
3289 A8E0..A8F7 ; PVALID # COMBINING DEVANAGARI DIGIT ZERO..DEVANAGARI
3290 A8F8..A8FA ; DISALLOWED # DEVANAGARI SIGN PUSHPIKA..DEVANAGARI CARET
3291 A8FB ; PVALID # DEVANAGARI HEADSTROKE
3292 A8FC..A8FF ; UNASSIGNED # <reserved>..<reserved>
3293 A900..A92D ; PVALID # KAYAH LI DIGIT ZERO..KAYAH LI TONE CALYA PLO
3294
3295
3296
3297 Faltstrom Standards Track [Page 60]
3298 RFC 5892 IDNA Code Points August 2010
3299
3300
3301 A92E..A92F ; DISALLOWED # KAYAH LI SIGN CWI..KAYAH LI SIGN SHYA
3302 A930..A953 ; PVALID # REJANG LETTER KA..REJANG VIRAMA
3303 A954..A95E ; UNASSIGNED # <reserved>..<reserved>
3304 A95F..A97C ; DISALLOWED # REJANG SECTION MARK..HANGUL CHOSEONG SSANGYE
3305 A97D..A97F ; UNASSIGNED # <reserved>..<reserved>
3306 A980..A9C0 ; PVALID # JAVANESE SIGN PANYANGGA..JAVANESE PANGKON
3307 A9C1..A9CD ; DISALLOWED # JAVANESE LEFT RERENGGAN..JAVANESE TURNED PAD
3308 A9CE ; UNASSIGNED # <reserved>
3309 A9CF..A9D9 ; PVALID # JAVANESE PANGRANGKEP..JAVANESE DIGIT NINE
3310 A9DA..A9DD ; UNASSIGNED # <reserved>..<reserved>
3311 A9DE..A9DF ; DISALLOWED # JAVANESE PADA TIRTA TUMETES..JAVANESE PADA I
3312 A9E0..A9FF ; UNASSIGNED # <reserved>..<reserved>
3313 AA00..AA36 ; PVALID # CHAM LETTER A..CHAM CONSONANT SIGN WA
3314 AA37..AA3F ; UNASSIGNED # <reserved>..<reserved>
3315 AA40..AA4D ; PVALID # CHAM LETTER FINAL K..CHAM CONSONANT SIGN FIN
3316 AA4E..AA4F ; UNASSIGNED # <reserved>..<reserved>
3317 AA50..AA59 ; PVALID # CHAM DIGIT ZERO..CHAM DIGIT NINE
3318 AA5A..AA5B ; UNASSIGNED # <reserved>..<reserved>
3319 AA5C..AA5F ; DISALLOWED # CHAM PUNCTUATION SPIRAL..CHAM PUNCTUATION TR
3320 AA60..AA76 ; PVALID # MYANMAR LETTER KHAMTI GA..MYANMAR LOGOGRAM K
3321 AA77..AA79 ; DISALLOWED # MYANMAR SYMBOL AITON EXCLAMATION..MYANMAR SY
3322 AA7A..AA7B ; PVALID # MYANMAR LETTER AITON RA..MYANMAR SIGN PAO KA
3323 AA7C..AA7F ; UNASSIGNED # <reserved>..<reserved>
3324 AA80..AAC2 ; PVALID # TAI VIET LETTER LOW KO..TAI VIET TONE MAI SO
3325 AAC3..AADA ; UNASSIGNED # <reserved>..<reserved>
3326 AADB..AADD ; PVALID # TAI VIET SYMBOL KON..TAI VIET SYMBOL SAM
3327 AADE..AADF ; DISALLOWED # TAI VIET SYMBOL HO HOI..TAI VIET SYMBOL KOI
3328 AAE0..ABBF ; UNASSIGNED # <reserved>..<reserved>
3329 ABC0..ABEA ; PVALID # MEETEI MAYEK LETTER KOK..MEETEI MAYEK VOWEL
3330 ABEB ; DISALLOWED # MEETEI MAYEK CHEIKHEI
3331 ABEC..ABED ; PVALID # MEETEI MAYEK LUM IYEK..MEETEI MAYEK APUN IYE
3332 ABEE..ABEF ; UNASSIGNED # <reserved>..<reserved>
3333 ABF0..ABF9 ; PVALID # MEETEI MAYEK DIGIT ZERO..MEETEI MAYEK DIGIT
3334 ABFA..ABFF ; UNASSIGNED # <reserved>..<reserved>
3335 AC00..D7A3 ; PVALID # <Hangul Syllable>..<Hangul Syllable>
3336 D7A4..D7AF ; UNASSIGNED # <reserved>..<reserved>
3337 D7B0..D7C6 ; DISALLOWED # HANGUL JUNGSEONG O-YEO..HANGUL JUNGSEONG ARA
3338 D7C7..D7CA ; UNASSIGNED # <reserved>..<reserved>
3339 D7CB..D7FB ; DISALLOWED # HANGUL JONGSEONG NIEUN-RIEUL..HANGUL JONGSEO
3340 D7FC..D7FF ; UNASSIGNED # <reserved>..<reserved>
3341 D800..FA0D ; DISALLOWED # <Non Private Use High Surrogate>..CJK COMPAT
3342 FA0E..FA0F ; PVALID # CJK COMPATIBILITY IDEOGRAPH-FA0E..CJK COMPAT
3343 FA10 ; DISALLOWED # CJK COMPATIBILITY IDEOGRAPH-FA10
3344 FA11 ; PVALID # CJK COMPATIBILITY IDEOGRAPH-FA11
3345 FA12 ; DISALLOWED # CJK COMPATIBILITY IDEOGRAPH-FA12
3346 FA13..FA14 ; PVALID # CJK COMPATIBILITY IDEOGRAPH-FA13..CJK COMPAT
3347 FA15..FA1E ; DISALLOWED # CJK COMPATIBILITY IDEOGRAPH-FA15..CJK COMPAT
3348 FA1F ; PVALID # CJK COMPATIBILITY IDEOGRAPH-FA1F
3349
3350
3351
3352 Faltstrom Standards Track [Page 61]
3353 RFC 5892 IDNA Code Points August 2010
3354
3355
3356 FA20 ; DISALLOWED # CJK COMPATIBILITY IDEOGRAPH-FA20
3357 FA21 ; PVALID # CJK COMPATIBILITY IDEOGRAPH-FA21
3358 FA22 ; DISALLOWED # CJK COMPATIBILITY IDEOGRAPH-FA22
3359 FA23..FA24 ; PVALID # CJK COMPATIBILITY IDEOGRAPH-FA23..CJK COMPAT
3360 FA25..FA26 ; DISALLOWED # CJK COMPATIBILITY IDEOGRAPH-FA25..CJK COMPAT
3361 FA27..FA29 ; PVALID # CJK COMPATIBILITY IDEOGRAPH-FA27..CJK COMPAT
3362 FA2A..FA2D ; DISALLOWED # CJK COMPATIBILITY IDEOGRAPH-FA2A..CJK COMPAT
3363 FA2E..FA2F ; UNASSIGNED # <reserved>..<reserved>
3364 FA30..FA6D ; DISALLOWED # CJK COMPATIBILITY IDEOGRAPH-FA30..CJK COMPAT
3365 FA6E..FA6F ; UNASSIGNED # <reserved>..<reserved>
3366 FA70..FAD9 ; DISALLOWED # CJK COMPATIBILITY IDEOGRAPH-FA70..CJK COMPAT
3367 FADA..FAFF ; UNASSIGNED # <reserved>..<reserved>
3368 FB00..FB06 ; DISALLOWED # LATIN SMALL LIGATURE FF..LATIN SMALL LIGATUR
3369 FB07..FB12 ; UNASSIGNED # <reserved>..<reserved>
3370 FB13..FB17 ; DISALLOWED # ARMENIAN SMALL LIGATURE MEN NOW..ARMENIAN SM
3371 FB18..FB1C ; UNASSIGNED # <reserved>..<reserved>
3372 FB1D ; DISALLOWED # HEBREW LETTER YOD WITH HIRIQ
3373 FB1E ; PVALID # HEBREW POINT JUDEO-SPANISH VARIKA
3374 FB1F..FB36 ; DISALLOWED # HEBREW LIGATURE YIDDISH YOD YOD PATAH..HEBRE
3375 FB37 ; UNASSIGNED # <reserved>
3376 FB38..FB3C ; DISALLOWED # HEBREW LETTER TET WITH DAGESH..HEBREW LETTER
3377 FB3D ; UNASSIGNED # <reserved>
3378 FB3E ; DISALLOWED # HEBREW LETTER MEM WITH DAGESH
3379 FB3F ; UNASSIGNED # <reserved>
3380 FB40..FB41 ; DISALLOWED # HEBREW LETTER NUN WITH DAGESH..HEBREW LETTER
3381 FB42 ; UNASSIGNED # <reserved>
3382 FB43..FB44 ; DISALLOWED # HEBREW LETTER FINAL PE WITH DAGESH..HEBREW L
3383 FB45 ; UNASSIGNED # <reserved>
3384 FB46..FBB1 ; DISALLOWED # HEBREW LETTER TSADI WITH DAGESH..ARABIC LETT
3385 FBB2..FBD2 ; UNASSIGNED # <reserved>..<reserved>
3386 FBD3..FD3F ; DISALLOWED # ARABIC LETTER NG ISOLATED FORM..ORNATE RIGHT
3387 FD40..FD4F ; UNASSIGNED # <reserved>..<reserved>
3388 FD50..FD8F ; DISALLOWED # ARABIC LIGATURE TEH WITH JEEM WITH MEEM INIT
3389 FD90..FD91 ; UNASSIGNED # <reserved>..<reserved>
3390 FD92..FDC7 ; DISALLOWED # ARABIC LIGATURE MEEM WITH JEEM WITH KHAH INI
3391 FDC8..FDCF ; UNASSIGNED # <reserved>..<reserved>
3392 FDD0..FDFD ; DISALLOWED # <noncharacter>..ARABIC LIGATURE BISMILLAH AR
3393 FDFE..FDFF ; UNASSIGNED # <reserved>..<reserved>
3394 FE00..FE19 ; DISALLOWED # VARIATION SELECTOR-1..PRESENTATION FORM FOR
3395 FE1A..FE1F ; UNASSIGNED # <reserved>..<reserved>
3396 FE20..FE26 ; PVALID # COMBINING LIGATURE LEFT HALF..COMBINING CONJ
3397 FE27..FE2F ; UNASSIGNED # <reserved>..<reserved>
3398 FE30..FE52 ; DISALLOWED # PRESENTATION FORM FOR VERTICAL TWO DOT LEADE
3399 FE53 ; UNASSIGNED # <reserved>
3400 FE54..FE66 ; DISALLOWED # SMALL SEMICOLON..SMALL EQUALS SIGN
3401 FE67 ; UNASSIGNED # <reserved>
3402 FE68..FE6B ; DISALLOWED # SMALL REVERSE SOLIDUS..SMALL COMMERCIAL AT
3403 FE6C..FE6F ; UNASSIGNED # <reserved>..<reserved>
3404
3405
3406
3407 Faltstrom Standards Track [Page 62]
3408 RFC 5892 IDNA Code Points August 2010
3409
3410
3411 FE70..FE72 ; DISALLOWED # ARABIC FATHATAN ISOLATED FORM..ARABIC DAMMAT
3412 FE73 ; PVALID # ARABIC TAIL FRAGMENT
3413 FE74 ; DISALLOWED # ARABIC KASRATAN ISOLATED FORM
3414 FE75 ; UNASSIGNED # <reserved>
3415 FE76..FEFC ; DISALLOWED # ARABIC FATHA ISOLATED FORM..ARABIC LIGATURE
3416 FEFD..FEFE ; UNASSIGNED # <reserved>..<reserved>
3417 FEFF ; DISALLOWED # ZERO WIDTH NO-BREAK SPACE
3418 FF00 ; UNASSIGNED # <reserved>
3419 FF01..FFBE ; DISALLOWED # FULLWIDTH EXCLAMATION MARK..HALFWIDTH HANGUL
3420 FFBF..FFC1 ; UNASSIGNED # <reserved>..<reserved>
3421 FFC2..FFC7 ; DISALLOWED # HALFWIDTH HANGUL LETTER A..HALFWIDTH HANGUL
3422 FFC8..FFC9 ; UNASSIGNED # <reserved>..<reserved>
3423 FFCA..FFCF ; DISALLOWED # HALFWIDTH HANGUL LETTER YEO..HALFWIDTH HANGU
3424 FFD0..FFD1 ; UNASSIGNED # <reserved>..<reserved>
3425 FFD2..FFD7 ; DISALLOWED # HALFWIDTH HANGUL LETTER YO..HALFWIDTH HANGUL
3426 FFD8..FFD9 ; UNASSIGNED # <reserved>..<reserved>
3427 FFDA..FFDC ; DISALLOWED # HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANGUL
3428 FFDD..FFDF ; UNASSIGNED # <reserved>..<reserved>
3429 FFE0..FFE6 ; DISALLOWED # FULLWIDTH CENT SIGN..FULLWIDTH WON SIGN
3430 FFE7 ; UNASSIGNED # <reserved>
3431 FFE8..FFEE ; DISALLOWED # HALFWIDTH FORMS LIGHT VERTICAL..HALFWIDTH WH
3432 FFEF..FFF8 ; UNASSIGNED # <reserved>..<reserved>
3433 FFF9..FFFF ; DISALLOWED # INTERLINEAR ANNOTATION ANCHOR..<noncharacter
3434 10000..1000B; PVALID # LINEAR B SYLLABLE B008 A..LINEAR B SYLLABLE
3435 1000C ; UNASSIGNED # <reserved>
3436 1000D..10026; PVALID # LINEAR B SYLLABLE B036 JO..LINEAR B SYLLABLE
3437 10027 ; UNASSIGNED # <reserved>
3438 10028..1003A; PVALID # LINEAR B SYLLABLE B060 RA..LINEAR B SYLLABLE
3439 1003B ; UNASSIGNED # <reserved>
3440 1003C..1003D; PVALID # LINEAR B SYLLABLE B017 ZA..LINEAR B SYLLABLE
3441 1003E ; UNASSIGNED # <reserved>
3442 1003F..1004D; PVALID # LINEAR B SYLLABLE B020 ZO..LINEAR B SYLLABLE
3443 1004E..1004F; UNASSIGNED # <reserved>..<reserved>
3444 10050..1005D; PVALID # LINEAR B SYMBOL B018..LINEAR B SYMBOL B089
3445 1005E..1007F; UNASSIGNED # <reserved>..<reserved>
3446 10080..100FA; PVALID # LINEAR B IDEOGRAM B100 MAN..LINEAR B IDEOGRA
3447 100FB..100FF; UNASSIGNED # <reserved>..<reserved>
3448 10100..10102; DISALLOWED # AEGEAN WORD SEPARATOR LINE..AEGEAN CHECK MAR
3449 10103..10106; UNASSIGNED # <reserved>..<reserved>
3450 10107..10133; DISALLOWED # AEGEAN NUMBER ONE..AEGEAN NUMBER NINETY THOU
3451 10134..10136; UNASSIGNED # <reserved>..<reserved>
3452 10137..1018A; DISALLOWED # AEGEAN WEIGHT BASE UNIT..GREEK ZERO SIGN
3453 1018B..1018F; UNASSIGNED # <reserved>..<reserved>
3454 10190..1019B; DISALLOWED # ROMAN SEXTANS SIGN..ROMAN CENTURIAL SIGN
3455 1019C..101CF; UNASSIGNED # <reserved>..<reserved>
3456 101D0..101FC; DISALLOWED # PHAISTOS DISC SIGN PEDESTRIAN..PHAISTOS DISC
3457 101FD ; PVALID # PHAISTOS DISC SIGN COMBINING OBLIQUE STROKE
3458 101FE..1027F; UNASSIGNED # <reserved>..<reserved>
3459
3460
3461
3462 Faltstrom Standards Track [Page 63]
3463 RFC 5892 IDNA Code Points August 2010
3464
3465
3466 10280..1029C; PVALID # LYCIAN LETTER A..LYCIAN LETTER X
3467 1029D..1029F; UNASSIGNED # <reserved>..<reserved>
3468 102A0..102D0; PVALID # CARIAN LETTER A..CARIAN LETTER UUU3
3469 102D1..102FF; UNASSIGNED # <reserved>..<reserved>
3470 10300..1031E; PVALID # OLD ITALIC LETTER A..OLD ITALIC LETTER UU
3471 1031F ; UNASSIGNED # <reserved>
3472 10320..10323; DISALLOWED # OLD ITALIC NUMERAL ONE..OLD ITALIC NUMERAL F
3473 10324..1032F; UNASSIGNED # <reserved>..<reserved>
3474 10330..10340; PVALID # GOTHIC LETTER AHSA..GOTHIC LETTER PAIRTHRA
3475 10341 ; DISALLOWED # GOTHIC LETTER NINETY
3476 10342..10349; PVALID # GOTHIC LETTER RAIDA..GOTHIC LETTER OTHAL
3477 1034A ; DISALLOWED # GOTHIC LETTER NINE HUNDRED
3478 1034B..1037F; UNASSIGNED # <reserved>..<reserved>
3479 10380..1039D; PVALID # UGARITIC LETTER ALPA..UGARITIC LETTER SSU
3480 1039E ; UNASSIGNED # <reserved>
3481 1039F ; DISALLOWED # UGARITIC WORD DIVIDER
3482 103A0..103C3; PVALID # OLD PERSIAN SIGN A..OLD PERSIAN SIGN HA
3483 103C4..103C7; UNASSIGNED # <reserved>..<reserved>
3484 103C8..103CF; PVALID # OLD PERSIAN SIGN AURAMAZDAA..OLD PERSIAN SIG
3485 103D0..103D5; DISALLOWED # OLD PERSIAN WORD DIVIDER..OLD PERSIAN NUMBER
3486 103D6..103FF; UNASSIGNED # <reserved>..<reserved>
3487 10400..10427; DISALLOWED # DESERET CAPITAL LETTER LONG I..DESERET CAPIT
3488 10428..1049D; PVALID # DESERET SMALL LETTER LONG I..OSMANYA LETTER
3489 1049E..1049F; UNASSIGNED # <reserved>..<reserved>
3490 104A0..104A9; PVALID # OSMANYA DIGIT ZERO..OSMANYA DIGIT NINE
3491 104AA..107FF; UNASSIGNED # <reserved>..<reserved>
3492 10800..10805; PVALID # CYPRIOT SYLLABLE A..CYPRIOT SYLLABLE JA
3493 10806..10807; UNASSIGNED # <reserved>..<reserved>
3494 10808 ; PVALID # CYPRIOT SYLLABLE JO
3495 10809 ; UNASSIGNED # <reserved>
3496 1080A..10835; PVALID # CYPRIOT SYLLABLE KA..CYPRIOT SYLLABLE WO
3497 10836 ; UNASSIGNED # <reserved>
3498 10837..10838; PVALID # CYPRIOT SYLLABLE XA..CYPRIOT SYLLABLE XE
3499 10839..1083B; UNASSIGNED # <reserved>..<reserved>
3500 1083C ; PVALID # CYPRIOT SYLLABLE ZA
3501 1083D..1083E; UNASSIGNED # <reserved>..<reserved>
3502 1083F..10855; PVALID # CYPRIOT SYLLABLE ZO..IMPERIAL ARAMAIC LETTER
3503 10856 ; UNASSIGNED # <reserved>
3504 10857..1085F; DISALLOWED # IMPERIAL ARAMAIC SECTION SIGN..IMPERIAL ARAM
3505 10860..108FF; UNASSIGNED # <reserved>..<reserved>
3506 10900..10915; PVALID # PHOENICIAN LETTER ALF..PHOENICIAN LETTER TAU
3507 10916..1091B; DISALLOWED # PHOENICIAN NUMBER ONE..PHOENICIAN NUMBER THR
3508 1091C..1091E; UNASSIGNED # <reserved>..<reserved>
3509 1091F ; DISALLOWED # PHOENICIAN WORD SEPARATOR
3510 10920..10939; PVALID # LYDIAN LETTER A..LYDIAN LETTER C
3511 1093A..1093E; UNASSIGNED # <reserved>..<reserved>
3512 1093F ; DISALLOWED # LYDIAN TRIANGULAR MARK
3513 10940..109FF; UNASSIGNED # <reserved>..<reserved>
3514
3515
3516
3517 Faltstrom Standards Track [Page 64]
3518 RFC 5892 IDNA Code Points August 2010
3519
3520
3521 10A00..10A03; PVALID # KHAROSHTHI LETTER A..KHAROSHTHI VOWEL SIGN V
3522 10A04 ; UNASSIGNED # <reserved>
3523 10A05..10A06; PVALID # KHAROSHTHI VOWEL SIGN E..KHAROSHTHI VOWEL SI
3524 10A07..10A0B; UNASSIGNED # <reserved>..<reserved>
3525 10A0C..10A13; PVALID # KHAROSHTHI VOWEL LENGTH MARK..KHAROSHTHI LET
3526 10A14 ; UNASSIGNED # <reserved>
3527 10A15..10A17; PVALID # KHAROSHTHI LETTER CA..KHAROSHTHI LETTER JA
3528 10A18 ; UNASSIGNED # <reserved>
3529 10A19..10A33; PVALID # KHAROSHTHI LETTER NYA..KHAROSHTHI LETTER TTT
3530 10A34..10A37; UNASSIGNED # <reserved>..<reserved>
3531 10A38..10A3A; PVALID # KHAROSHTHI SIGN BAR ABOVE..KHAROSHTHI SIGN D
3532 10A3B..10A3E; UNASSIGNED # <reserved>..<reserved>
3533 10A3F ; PVALID # KHAROSHTHI VIRAMA
3534 10A40..10A47; DISALLOWED # KHAROSHTHI DIGIT ONE..KHAROSHTHI NUMBER ONE
3535 10A48..10A4F; UNASSIGNED # <reserved>..<reserved>
3536 10A50..10A58; DISALLOWED # KHAROSHTHI PUNCTUATION DOT..KHAROSHTHI PUNCT
3537 10A59..10A5F; UNASSIGNED # <reserved>..<reserved>
3538 10A60..10A7C; PVALID # OLD SOUTH ARABIAN LETTER HE..OLD SOUTH ARABI
3539 10A7D..10A7F; DISALLOWED # OLD SOUTH ARABIAN NUMBER ONE..OLD SOUTH ARAB
3540 10A80..10AFF; UNASSIGNED # <reserved>..<reserved>
3541 10B00..10B35; PVALID # AVESTAN LETTER A..AVESTAN LETTER HE
3542 10B36..10B38; UNASSIGNED # <reserved>..<reserved>
3543 10B39..10B3F; DISALLOWED # AVESTAN ABBREVIATION MARK..LARGE ONE RING OV
3544 10B40..10B55; PVALID # INSCRIPTIONAL PARTHIAN LETTER ALEPH..INSCRIP
3545 10B56..10B57; UNASSIGNED # <reserved>..<reserved>
3546 10B58..10B5F; DISALLOWED # INSCRIPTIONAL PARTHIAN NUMBER ONE..INSCRIPTI
3547 10B60..10B72; PVALID # INSCRIPTIONAL PAHLAVI LETTER ALEPH..INSCRIPT
3548 10B73..10B77; UNASSIGNED # <reserved>..<reserved>
3549 10B78..10B7F; DISALLOWED # INSCRIPTIONAL PAHLAVI NUMBER ONE..INSCRIPTIO
3550 10B80..10BFF; UNASSIGNED # <reserved>..<reserved>
3551 10C00..10C48; PVALID # OLD TURKIC LETTER ORKHON A..OLD TURKIC LETTE
3552 10C49..10E5F; UNASSIGNED # <reserved>..<reserved>
3553 10E60..10E7E; DISALLOWED # RUMI DIGIT ONE..RUMI FRACTION TWO THIRDS
3554 10E7F..1107F; UNASSIGNED # <reserved>..<reserved>
3555 11080..110BA; PVALID # KAITHI SIGN CANDRABINDU..KAITHI SIGN NUKTA
3556 110BB..110C1; DISALLOWED # KAITHI ABBREVIATION SIGN..KAITHI DOUBLE DAND
3557 110C2..11FFF; UNASSIGNED # <reserved>..<reserved>
3558 12000..1236E; PVALID # CUNEIFORM SIGN A..CUNEIFORM SIGN ZUM
3559 1236F..123FF; UNASSIGNED # <reserved>..<reserved>
3560 12400..12462; DISALLOWED # CUNEIFORM NUMERIC SIGN TWO ASH..CUNEIFORM NU
3561 12463..1246F; UNASSIGNED # <reserved>..<reserved>
3562 12470..12473; DISALLOWED # CUNEIFORM PUNCTUATION SIGN OLD ASSYRIAN WORD
3563 12474..12FFF; UNASSIGNED # <reserved>..<reserved>
3564 13000..1342E; PVALID # EGYPTIAN HIEROGLYPH A001..EGYPTIAN HIEROGLYP
3565 1342F..1CFFF; UNASSIGNED # <reserved>..<reserved>
3566 1D000..1D0F5; DISALLOWED # BYZANTINE MUSICAL SYMBOL PSILI..BYZANTINE MU
3567 1D0F6..1D0FF; UNASSIGNED # <reserved>..<reserved>
3568 1D100..1D126; DISALLOWED # MUSICAL SYMBOL SINGLE BARLINE..MUSICAL SYMBO
3569
3570
3571
3572 Faltstrom Standards Track [Page 65]
3573 RFC 5892 IDNA Code Points August 2010
3574
3575
3576 1D127..1D128; UNASSIGNED # <reserved>..<reserved>
3577 1D129..1D1DD; DISALLOWED # MUSICAL SYMBOL MULTIPLE MEASURE REST..MUSICA
3578 1D1DE..1D1FF; UNASSIGNED # <reserved>..<reserved>
3579 1D200..1D245; DISALLOWED # GREEK VOCAL NOTATION SYMBOL-1..GREEK MUSICAL
3580 1D246..1D2FF; UNASSIGNED # <reserved>..<reserved>
3581 1D300..1D356; DISALLOWED # MONOGRAM FOR EARTH..TETRAGRAM FOR FOSTERING
3582 1D357..1D35F; UNASSIGNED # <reserved>..<reserved>
3583 1D360..1D371; DISALLOWED # COUNTING ROD UNIT DIGIT ONE..COUNTING ROD TE
3584 1D372..1D3FF; UNASSIGNED # <reserved>..<reserved>
3585 1D400..1D454; DISALLOWED # MATHEMATICAL BOLD CAPITAL A..MATHEMATICAL IT
3586 1D455 ; UNASSIGNED # <reserved>
3587 1D456..1D49C; DISALLOWED # MATHEMATICAL ITALIC SMALL I..MATHEMATICAL SC
3588 1D49D ; UNASSIGNED # <reserved>
3589 1D49E..1D49F; DISALLOWED # MATHEMATICAL SCRIPT CAPITAL C..MATHEMATICAL
3590 1D4A0..1D4A1; UNASSIGNED # <reserved>..<reserved>
3591 1D4A2 ; DISALLOWED # MATHEMATICAL SCRIPT CAPITAL G
3592 1D4A3..1D4A4; UNASSIGNED # <reserved>..<reserved>
3593 1D4A5..1D4A6; DISALLOWED # MATHEMATICAL SCRIPT CAPITAL J..MATHEMATICAL
3594 1D4A7..1D4A8; UNASSIGNED # <reserved>..<reserved>
3595 1D4A9..1D4AC; DISALLOWED # MATHEMATICAL SCRIPT CAPITAL N..MATHEMATICAL
3596 1D4AD ; UNASSIGNED # <reserved>
3597 1D4AE..1D4B9; DISALLOWED # MATHEMATICAL SCRIPT CAPITAL S..MATHEMATICAL
3598 1D4BA ; UNASSIGNED # <reserved>
3599 1D4BB ; DISALLOWED # MATHEMATICAL SCRIPT SMALL F
3600 1D4BC ; UNASSIGNED # <reserved>
3601 1D4BD..1D4C3; DISALLOWED # MATHEMATICAL SCRIPT SMALL H..MATHEMATICAL SC
3602 1D4C4 ; UNASSIGNED # <reserved>
3603 1D4C5..1D505; DISALLOWED # MATHEMATICAL SCRIPT SMALL P..MATHEMATICAL FR
3604 1D506 ; UNASSIGNED # <reserved>
3605 1D507..1D50A; DISALLOWED # MATHEMATICAL FRAKTUR CAPITAL D..MATHEMATICAL
3606 1D50B..1D50C; UNASSIGNED # <reserved>..<reserved>
3607 1D50D..1D514; DISALLOWED # MATHEMATICAL FRAKTUR CAPITAL J..MATHEMATICAL
3608 1D515 ; UNASSIGNED # <reserved>
3609 1D516..1D51C; DISALLOWED # MATHEMATICAL FRAKTUR CAPITAL S..MATHEMATICAL
3610 1D51D ; UNASSIGNED # <reserved>
3611 1D51E..1D539; DISALLOWED # MATHEMATICAL FRAKTUR SMALL A..MATHEMATICAL D
3612 1D53A ; UNASSIGNED # <reserved>
3613 1D53B..1D53E; DISALLOWED # MATHEMATICAL DOUBLE-STRUCK CAPITAL D..MATHEM
3614 1D53F ; UNASSIGNED # <reserved>
3615 1D540..1D544; DISALLOWED # MATHEMATICAL DOUBLE-STRUCK CAPITAL I..MATHEM
3616 1D545 ; UNASSIGNED # <reserved>
3617 1D546 ; DISALLOWED # MATHEMATICAL DOUBLE-STRUCK CAPITAL O
3618 1D547..1D549; UNASSIGNED # <reserved>..<reserved>
3619 1D54A..1D550; DISALLOWED # MATHEMATICAL DOUBLE-STRUCK CAPITAL S..MATHEM
3620 1D551 ; UNASSIGNED # <reserved>
3621 1D552..1D6A5; DISALLOWED # MATHEMATICAL DOUBLE-STRUCK SMALL A..MATHEMAT
3622 1D6A6..1D6A7; UNASSIGNED # <reserved>..<reserved>
3623 1D6A8..1D7CB; DISALLOWED # MATHEMATICAL BOLD CAPITAL ALPHA..MATHEMATICA
3624
3625
3626
3627 Faltstrom Standards Track [Page 66]
3628 RFC 5892 IDNA Code Points August 2010
3629
3630
3631 1D7CC..1D7CD; UNASSIGNED # <reserved>..<reserved>
3632 1D7CE..1D7FF; DISALLOWED # MATHEMATICAL BOLD DIGIT ZERO..MATHEMATICAL M
3633 1D800..1EFFF; UNASSIGNED # <reserved>..<reserved>
3634 1F000..1F02B; DISALLOWED # MAHJONG TILE EAST WIND..MAHJONG TILE BACK
3635 1F02C..1F02F; UNASSIGNED # <reserved>..<reserved>
3636 1F030..1F093; DISALLOWED # DOMINO TILE HORIZONTAL BACK..DOMINO TILE VER
3637 1F094..1F0FF; UNASSIGNED # <reserved>..<reserved>
3638 1F100..1F10A; DISALLOWED # DIGIT ZERO FULL STOP..DIGIT NINE COMMA
3639 1F10B..1F10F; UNASSIGNED # <reserved>..<reserved>
3640 1F110..1F12E; DISALLOWED # PARENTHESIZED LATIN CAPITAL LETTER A..CIRCLE
3641 1F12F..1F130; UNASSIGNED # <reserved>..<reserved>
3642 1F131 ; DISALLOWED # SQUARED LATIN CAPITAL LETTER B
3643 1F132..1F13C; UNASSIGNED # <reserved>..<reserved>
3644 1F13D ; DISALLOWED # SQUARED LATIN CAPITAL LETTER N
3645 1F13E ; UNASSIGNED # <reserved>
3646 1F13F ; DISALLOWED # SQUARED LATIN CAPITAL LETTER P
3647 1F140..1F141; UNASSIGNED # <reserved>..<reserved>
3648 1F142 ; DISALLOWED # SQUARED LATIN CAPITAL LETTER S
3649 1F143..1F145; UNASSIGNED # <reserved>..<reserved>
3650 1F146 ; DISALLOWED # SQUARED LATIN CAPITAL LETTER W
3651 1F147..1F149; UNASSIGNED # <reserved>..<reserved>
3652 1F14A..1F14E; DISALLOWED # SQUARED HV..SQUARED PPV
3653 1F14F..1F156; UNASSIGNED # <reserved>..<reserved>
3654 1F157 ; DISALLOWED # NEGATIVE CIRCLED LATIN CAPITAL LETTER H
3655 1F158..1F15E; UNASSIGNED # <reserved>..<reserved>
3656 1F15F ; DISALLOWED # NEGATIVE CIRCLED LATIN CAPITAL LETTER P
3657 1F160..1F178; UNASSIGNED # <reserved>..<reserved>
3658 1F179 ; DISALLOWED # NEGATIVE SQUARED LATIN CAPITAL LETTER J
3659 1F17A ; UNASSIGNED # <reserved>
3660 1F17B..1F17C; DISALLOWED # NEGATIVE SQUARED LATIN CAPITAL LETTER L..NEG
3661 1F17D..1F17E; UNASSIGNED # <reserved>..<reserved>
3662 1F17F ; DISALLOWED # NEGATIVE SQUARED LATIN CAPITAL LETTER P
3663 1F180..1F189; UNASSIGNED # <reserved>..<reserved>
3664 1F18A..1F18D; DISALLOWED # CROSSED NEGATIVE SQUARED LATIN CAPITAL LETTE
3665 1F18E..1F18F; UNASSIGNED # <reserved>..<reserved>
3666 1F190 ; DISALLOWED # SQUARE DJ
3667 1F191..1F1FF; UNASSIGNED # <reserved>..<reserved>
3668 1F200 ; DISALLOWED # SQUARE HIRAGANA HOKA
3669 1F201..1F20F; UNASSIGNED # <reserved>..<reserved>
3670 1F210..1F231; DISALLOWED # SQUARED CJK UNIFIED IDEOGRAPH-624B..SQUARED
3671 1F232..1F23F; UNASSIGNED # <reserved>..<reserved>
3672 1F240..1F248; DISALLOWED # TORTOISE SHELL BRACKETED CJK UNIFIED IDEOGRA
3673 1F249..1FFFD; UNASSIGNED # <reserved>..<reserved>
3674 1FFFE..1FFFF; DISALLOWED # <noncharacter>..<noncharacter>
3675 20000..2A6D6; PVALID # <CJK Ideograph Extension B>..<CJK Ideograph
3676 2A6D7..2A6FF; UNASSIGNED # <reserved>..<reserved>
3677 2A700..2B734; PVALID # <CJK Ideograph Extension C>..<CJK Ideograph
3678 2B735..2F7FF; UNASSIGNED # <reserved>..<reserved>
3679
3680
3681
3682 Faltstrom Standards Track [Page 67]
3683 RFC 5892 IDNA Code Points August 2010
3684
3685
3686 2F800..2FA1D; DISALLOWED # CJK COMPATIBILITY IDEOGRAPH-2F800..CJK COMPA
3687 2FA1E..2FFFD; UNASSIGNED # <reserved>..<reserved>
3688 2FFFE..2FFFF; DISALLOWED # <noncharacter>..<noncharacter>
3689 30000..3FFFD; UNASSIGNED # <reserved>..<reserved>
3690 3FFFE..3FFFF; DISALLOWED # <noncharacter>..<noncharacter>
3691 40000..4FFFD; UNASSIGNED # <reserved>..<reserved>
3692 4FFFE..4FFFF; DISALLOWED # <noncharacter>..<noncharacter>
3693 50000..5FFFD; UNASSIGNED # <reserved>..<reserved>
3694 5FFFE..5FFFF; DISALLOWED # <noncharacter>..<noncharacter>
3695 60000..6FFFD; UNASSIGNED # <reserved>..<reserved>
3696 6FFFE..6FFFF; DISALLOWED # <noncharacter>..<noncharacter>
3697 70000..7FFFD; UNASSIGNED # <reserved>..<reserved>
3698 7FFFE..7FFFF; DISALLOWED # <noncharacter>..<noncharacter>
3699 80000..8FFFD; UNASSIGNED # <reserved>..<reserved>
3700 8FFFE..8FFFF; DISALLOWED # <noncharacter>..<noncharacter>
3701 90000..9FFFD; UNASSIGNED # <reserved>..<reserved>
3702 9FFFE..9FFFF; DISALLOWED # <noncharacter>..<noncharacter>
3703 A0000..AFFFD; UNASSIGNED # <reserved>..<reserved>
3704 AFFFE..AFFFF; DISALLOWED # <noncharacter>..<noncharacter>
3705 B0000..BFFFD; UNASSIGNED # <reserved>..<reserved>
3706 BFFFE..BFFFF; DISALLOWED # <noncharacter>..<noncharacter>
3707 C0000..CFFFD; UNASSIGNED # <reserved>..<reserved>
3708 CFFFE..CFFFF; DISALLOWED # <noncharacter>..<noncharacter>
3709 D0000..DFFFD; UNASSIGNED # <reserved>..<reserved>
3710 DFFFE..DFFFF; DISALLOWED # <noncharacter>..<noncharacter>
3711 E0000 ; UNASSIGNED # <reserved>
3712 E0001 ; DISALLOWED # LANGUAGE TAG
3713 E0002..E001F; UNASSIGNED # <reserved>..<reserved>
3714 E0020..E007F; DISALLOWED # TAG SPACE..CANCEL TAG
3715 E0080..E00FF; UNASSIGNED # <reserved>..<reserved>
3716 E0100..E01EF; DISALLOWED # VARIATION SELECTOR-17..VARIATION SELECTOR-25
3717 E01F0..EFFFD; UNASSIGNED # <reserved>..<reserved>
3718 EFFFE..10FFFF; DISALLOWED # <noncharacter>..<noncharacter>
3719
3720
3721
3722
3723
3724
3725
3726
3727
3728
3729
3730
3731
3732
3733
3734
3735
3736
3737 Faltstrom Standards Track [Page 68]
3738 RFC 5892 IDNA Code Points August 2010
3739
3740
3741 8. References
3742
3743 8.1. Normative References
3744
3745 [RFC2119] Bradner, S., "Key words for use in RFCs to Indicate
3746 Requirement Levels", BCP 14, RFC 2119, March 1997.
3747
3748 [TR15] Davis, M. and M. Duerst, "Unicode Standard Annex #15,
3749 Unicode Normalization Forms, an integral part of the
3750 Unicode Standard",
3751 <http://unicode.org/unicode/reports/tr15/>.
3752
3753 [Unicode] The Unicode Consortium, "The Unicode Standard, Version
3754 5.0", 2007. Boston, MA, USA: Addison-Wesley. ISBN
3755 0-321-48091-0. This printed reference has now been
3756 updated online to reflect additional code points. For
3757 code points, the reference at the time this document was
3758 published is to Unicode 5.2.
3759
3760 [Unicode52] The Unicode Consortium. The Unicode Standard, Version
3761 5.2.0, defined by: "The Unicode Standard, Version
3762 5.2.0", (Mountain View, CA: The Unicode Consortium,
3763 2009. ISBN 978-1-936213-00-9).
3764 <http://www.unicode.org/versions/Unicode5.2.0/>.
3765
3766 8.2. Informative References
3767
3768 [BlockNames] "Blocks-5.2.0.txt", Unicode Character Database,
3769 May 2009,
3770 <http://unicode.org/Public/5.2.0/ucd/Blocks.txt>.
3771
3772 [DerivedCoreProperties]
3773 "DerivedCoreProperties-5.2.0.txt", Unicode Character
3774 Database, August 2009, <http://unicode.org/Public/5.2.0/
3775 ucd/DerivedCoreProperties.txt>.
3776
3777 [RFC3454] Hoffman, P. and M. Blanchet, "Preparation of
3778 Internationalized Strings ("stringprep")", RFC 3454,
3779 December 2002.
3780
3781 [RFC3491] Hoffman, P. and M. Blanchet, "Nameprep: A Stringprep
3782 Profile for Internationalized Domain Names (IDN)",
3783 RFC 3491, March 2003.
3784
3785 [RFC4690] Klensin, J., Faltstrom, P., Karp, C., and IAB, "Review
3786 and Recommendations for Internationalized Domain Names
3787 (IDNs)", RFC 4690, September 2006.
3788
3789
3790
3791
3792 Faltstrom Standards Track [Page 69]
3793 RFC 5892 IDNA Code Points August 2010
3794
3795
3796 [RFC5226] Narten, T. and H. Alvestrand, "Guidelines for Writing an
3797 IANA Considerations Section in RFCs", BCP 26, RFC 5226,
3798 May 2008.
3799
3800 [RFC5890] Klensin, J., "Internationalized Domain Names for
3801 Applications (IDNA): Definitions and Document
3802 Framework", RFC 5890, August 2010.
3803
3804 [RFC5891] Klensin, J., "Internationalized Domain Names in
3805 Applications (IDNA): Protocol", RFC 5891, August 2010.
3806
3807 [RFC5893] Alvestrand, H., Ed. and C. Karp, "Right-to-Left Scripts
3808 for Internationalized Domain Names for Applications
3809 (IDNA)", RFC 5893, August 2010.
3810
3811 [RFC5894] Klensin, J., "Internationalized Domain Names for
3812 Applications (IDNA): Background, Explanation, and
3813 Rationale", RFC 5894, August 2010.
3814
3815 Author's Address
3816
3817 Patrik Faltstrom (editor)
3818 Cisco
3819
3820 EMail: paf@cisco.com
3821
3822
3823
3824
3825
3826
3827
3828
3829
3830
3831
3832
3833
3834
3835
3836
3837
3838
3839
3840
3841
3842
3843
3844
3845
3846
3847 Faltstrom Standards Track [Page 70]
3848
In A.1: Rule Set: False; If Canonical_Combining_Class(Before(cp)) .eq. Virama Then True; If RegExpMatch((Joining_Type:{L,D})(Joining_Type:T)*\u200C (Joining_Type:T)*(Joining_Type:{R,D})) Then True;
In A.1: Rule Set: False; If Canonical_Combining_Class(Before(cp)) .eq. Virama Then True;If RegExpMatch((Joining_Type:{L,D})(Joining_Type:T)*\u200CIf cp .eq. \u200C And RegExpMatch((Joining_Type:{L,D})(Joining_Type:T)*cp (Joining_Type:T)*(Joining_Type:{R,D})) Then True;