Skip to content

Commit 40fe9f2

Browse files
authored
ヨリ (#1027)
* UnicodeData.txt line from L2/24-279 * LineBreak.txt line from L2/24-279 * Katakana * Regenerate UCD * Failing test * ea=W * Regenerate UCD * end; * Ignore IDNA2008_Category
1 parent 2307d48 commit 40fe9f2

19 files changed

+95
-90
lines changed

unicodetools/data/ucd/dev/DerivedAge.txt

Lines changed: 3 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
# DerivedAge-18.0.0.txt
2-
# Date: 2025-11-28, 15:46:32 GMT
2+
# Date: 2025-11-28, 16:09:49 GMT
33
# © 2025 Unicode®, Inc.
44
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
55
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
@@ -2144,8 +2144,7 @@ FDC8..FDCE ; 17.0 # [7] ARABIC LIGATURE RAHIMAHU ALLAAH TAAALAA..ARABIC LIG
21442144
18D1F..18D20 ; 18.0 # [2] TANGUT IDEOGRAPH-18D1F..TANGUT IDEOGRAPH-18D20
21452145
18E00..19191 ; 18.0 # [914] JURCHEN CHARACTER-18E00..JURCHEN CHARACTER-19191
21462146
191A0..191D2 ; 18.0 # [51] JURCHEN RADICAL-01..JURCHEN RADICAL-51
2147-
1B123..1B125 ; 18.0 # [3] HIRAGANA DIGRAPH KOTO..KATAKANA DIGRAPH TOTE
2148-
1B127..1B128 ; 18.0 # [2] KATAKANA LETTER ALTERNATE NE..KATAKANA LETTER ALTERNATE WI
2147+
1B123..1B128 ; 18.0 # [6] HIRAGANA DIGRAPH KOTO..KATAKANA LETTER ALTERNATE WI
21492148
1B168 ; 18.0 # KATAKANA LETTER SMALL ARCHAIC YE
21502149
1DF1F..1DF24 ; 18.0 # [6] LATIN SMALL LETTER D-ETH DIGRAPH..LATIN SMALL LETTER T-THETA DIGRAPH
21512150
1DF2B..1DF56 ; 18.0 # [44] LATIN SMALL LETTER DEZH DIGRAPH WITH CURL..LATIN LETTER GLOTTAL STOP WITH DOUBLE STROKE
@@ -2155,6 +2154,6 @@ FDC8..FDCE ; 17.0 # [7] ARABIC LIGATURE RAHIMAHU ALLAAH TAAALAA..ARABIC LIG
21552154
2B81E ; 18.0 # CJK UNIFIED IDEOGRAPH-2B81E
21562155
3D000..3FC3F ; 18.0 # [11328] SEAL CHARACTER-3D000..SEAL CHARACTER-3FC3F
21572156

2158-
# Total code points: 12831
2157+
# Total code points: 12832
21592158

21602159
# EOF

unicodetools/data/ucd/dev/DerivedCoreProperties.txt

Lines changed: 13 additions & 19 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
# DerivedCoreProperties-18.0.0.txt
2-
# Date: 2025-11-28, 15:46:55 GMT
2+
# Date: 2025-11-28, 16:10:12 GMT
33
# © 2025 Unicode®, Inc.
44
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
55
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
@@ -1352,8 +1352,7 @@ FFDA..FFDC ; Alphabetic # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANG
13521352
1AFF0..1AFF3 ; Alphabetic # Lm [4] KATAKANA LETTER MINNAN TONE-2..KATAKANA LETTER MINNAN TONE-5
13531353
1AFF5..1AFFB ; Alphabetic # Lm [7] KATAKANA LETTER MINNAN TONE-7..KATAKANA LETTER MINNAN NASALIZED TONE-5
13541354
1AFFD..1AFFE ; Alphabetic # Lm [2] KATAKANA LETTER MINNAN NASALIZED TONE-7..KATAKANA LETTER MINNAN NASALIZED TONE-8
1355-
1B000..1B125 ; Alphabetic # Lo [294] KATAKANA LETTER ARCHAIC E..KATAKANA DIGRAPH TOTE
1356-
1B127..1B128 ; Alphabetic # Lo [2] KATAKANA LETTER ALTERNATE NE..KATAKANA LETTER ALTERNATE WI
1355+
1B000..1B128 ; Alphabetic # Lo [297] KATAKANA LETTER ARCHAIC E..KATAKANA LETTER ALTERNATE WI
13571356
1B132 ; Alphabetic # Lo HIRAGANA LETTER SMALL KO
13581357
1B150..1B152 ; Alphabetic # Lo [3] HIRAGANA LETTER SMALL WI..HIRAGANA LETTER SMALL WO
13591358
1B155 ; Alphabetic # Lo KATAKANA LETTER SMALL KO
@@ -1479,7 +1478,7 @@ FFDA..FFDC ; Alphabetic # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANG
14791478
31350..33479 ; Alphabetic # Lo [8490] CJK UNIFIED IDEOGRAPH-31350..CJK UNIFIED IDEOGRAPH-33479
14801479
3D000..3FC3F ; Alphabetic # Lo [11328] SEAL CHARACTER-3D000..SEAL CHARACTER-3FC3F
14811480

1482-
# Total code points: 160215
1481+
# Total code points: 160216
14831482

14841483
# ================================================
14851484

@@ -6992,8 +6991,7 @@ FFDA..FFDC ; ID_Start # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANGUL
69926991
1AFF0..1AFF3 ; ID_Start # Lm [4] KATAKANA LETTER MINNAN TONE-2..KATAKANA LETTER MINNAN TONE-5
69936992
1AFF5..1AFFB ; ID_Start # Lm [7] KATAKANA LETTER MINNAN TONE-7..KATAKANA LETTER MINNAN NASALIZED TONE-5
69946993
1AFFD..1AFFE ; ID_Start # Lm [2] KATAKANA LETTER MINNAN NASALIZED TONE-7..KATAKANA LETTER MINNAN NASALIZED TONE-8
6995-
1B000..1B125 ; ID_Start # Lo [294] KATAKANA LETTER ARCHAIC E..KATAKANA DIGRAPH TOTE
6996-
1B127..1B128 ; ID_Start # Lo [2] KATAKANA LETTER ALTERNATE NE..KATAKANA LETTER ALTERNATE WI
6994+
1B000..1B128 ; ID_Start # Lo [297] KATAKANA LETTER ARCHAIC E..KATAKANA LETTER ALTERNATE WI
69976995
1B132 ; ID_Start # Lo HIRAGANA LETTER SMALL KO
69986996
1B150..1B152 ; ID_Start # Lo [3] HIRAGANA LETTER SMALL WI..HIRAGANA LETTER SMALL WO
69996997
1B155 ; ID_Start # Lo KATAKANA LETTER SMALL KO
@@ -7104,7 +7102,7 @@ FFDA..FFDC ; ID_Start # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANGUL
71047102
31350..33479 ; ID_Start # Lo [8490] CJK UNIFIED IDEOGRAPH-31350..CJK UNIFIED IDEOGRAPH-33479
71057103
3D000..3FC3F ; ID_Start # Lo [11328] SEAL CHARACTER-3D000..SEAL CHARACTER-3FC3F
71067104

7107-
# Total code points: 158707
7105+
# Total code points: 158708
71087106

71097107
# ================================================
71107108

@@ -8397,8 +8395,7 @@ FFDA..FFDC ; ID_Continue # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HAN
83978395
1AFF0..1AFF3 ; ID_Continue # Lm [4] KATAKANA LETTER MINNAN TONE-2..KATAKANA LETTER MINNAN TONE-5
83988396
1AFF5..1AFFB ; ID_Continue # Lm [7] KATAKANA LETTER MINNAN TONE-7..KATAKANA LETTER MINNAN NASALIZED TONE-5
83998397
1AFFD..1AFFE ; ID_Continue # Lm [2] KATAKANA LETTER MINNAN NASALIZED TONE-7..KATAKANA LETTER MINNAN NASALIZED TONE-8
8400-
1B000..1B125 ; ID_Continue # Lo [294] KATAKANA LETTER ARCHAIC E..KATAKANA DIGRAPH TOTE
8401-
1B127..1B128 ; ID_Continue # Lo [2] KATAKANA LETTER ALTERNATE NE..KATAKANA LETTER ALTERNATE WI
8398+
1B000..1B128 ; ID_Continue # Lo [297] KATAKANA LETTER ARCHAIC E..KATAKANA LETTER ALTERNATE WI
84028399
1B132 ; ID_Continue # Lo HIRAGANA LETTER SMALL KO
84038400
1B150..1B152 ; ID_Continue # Lo [3] HIRAGANA LETTER SMALL WI..HIRAGANA LETTER SMALL WO
84048401
1B155 ; ID_Continue # Lo KATAKANA LETTER SMALL KO
@@ -8551,7 +8548,7 @@ FFDA..FFDC ; ID_Continue # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HAN
85518548
3D000..3FC3F ; ID_Continue # Lo [11328] SEAL CHARACTER-3D000..SEAL CHARACTER-3FC3F
85528549
E0100..E01EF ; ID_Continue # Mn [240] VARIATION SELECTOR-17..VARIATION SELECTOR-256
85538550

8554-
# Total code points: 162053
8551+
# Total code points: 162054
85558552

85568553
# ================================================
85578554

@@ -9241,8 +9238,7 @@ FFDA..FFDC ; XID_Start # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANGU
92419238
1AFF0..1AFF3 ; XID_Start # Lm [4] KATAKANA LETTER MINNAN TONE-2..KATAKANA LETTER MINNAN TONE-5
92429239
1AFF5..1AFFB ; XID_Start # Lm [7] KATAKANA LETTER MINNAN TONE-7..KATAKANA LETTER MINNAN NASALIZED TONE-5
92439240
1AFFD..1AFFE ; XID_Start # Lm [2] KATAKANA LETTER MINNAN NASALIZED TONE-7..KATAKANA LETTER MINNAN NASALIZED TONE-8
9244-
1B000..1B125 ; XID_Start # Lo [294] KATAKANA LETTER ARCHAIC E..KATAKANA DIGRAPH TOTE
9245-
1B127..1B128 ; XID_Start # Lo [2] KATAKANA LETTER ALTERNATE NE..KATAKANA LETTER ALTERNATE WI
9241+
1B000..1B128 ; XID_Start # Lo [297] KATAKANA LETTER ARCHAIC E..KATAKANA LETTER ALTERNATE WI
92469242
1B132 ; XID_Start # Lo HIRAGANA LETTER SMALL KO
92479243
1B150..1B152 ; XID_Start # Lo [3] HIRAGANA LETTER SMALL WI..HIRAGANA LETTER SMALL WO
92489244
1B155 ; XID_Start # Lo KATAKANA LETTER SMALL KO
@@ -9353,7 +9349,7 @@ FFDA..FFDC ; XID_Start # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANGU
93539349
31350..33479 ; XID_Start # Lo [8490] CJK UNIFIED IDEOGRAPH-31350..CJK UNIFIED IDEOGRAPH-33479
93549350
3D000..3FC3F ; XID_Start # Lo [11328] SEAL CHARACTER-3D000..SEAL CHARACTER-3FC3F
93559351

9356-
# Total code points: 158684
9352+
# Total code points: 158685
93579353

93589354
# ================================================
93599355

@@ -10647,8 +10643,7 @@ FFDA..FFDC ; XID_Continue # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HA
1064710643
1AFF0..1AFF3 ; XID_Continue # Lm [4] KATAKANA LETTER MINNAN TONE-2..KATAKANA LETTER MINNAN TONE-5
1064810644
1AFF5..1AFFB ; XID_Continue # Lm [7] KATAKANA LETTER MINNAN TONE-7..KATAKANA LETTER MINNAN NASALIZED TONE-5
1064910645
1AFFD..1AFFE ; XID_Continue # Lm [2] KATAKANA LETTER MINNAN NASALIZED TONE-7..KATAKANA LETTER MINNAN NASALIZED TONE-8
10650-
1B000..1B125 ; XID_Continue # Lo [294] KATAKANA LETTER ARCHAIC E..KATAKANA DIGRAPH TOTE
10651-
1B127..1B128 ; XID_Continue # Lo [2] KATAKANA LETTER ALTERNATE NE..KATAKANA LETTER ALTERNATE WI
10646+
1B000..1B128 ; XID_Continue # Lo [297] KATAKANA LETTER ARCHAIC E..KATAKANA LETTER ALTERNATE WI
1065210647
1B132 ; XID_Continue # Lo HIRAGANA LETTER SMALL KO
1065310648
1B150..1B152 ; XID_Continue # Lo [3] HIRAGANA LETTER SMALL WI..HIRAGANA LETTER SMALL WO
1065410649
1B155 ; XID_Continue # Lo KATAKANA LETTER SMALL KO
@@ -10801,7 +10796,7 @@ FFDA..FFDC ; XID_Continue # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HA
1080110796
3D000..3FC3F ; XID_Continue # Lo [11328] SEAL CHARACTER-3D000..SEAL CHARACTER-3FC3F
1080210797
E0100..E01EF ; XID_Continue # Mn [240] VARIATION SELECTOR-17..VARIATION SELECTOR-256
1080310798

10804-
# Total code points: 162034
10799+
# Total code points: 162035
1080510800

1080610801
# ================================================
1080710802

@@ -12891,8 +12886,7 @@ FFFC..FFFD ; Grapheme_Base # So [2] OBJECT REPLACEMENT CHARACTER..REPLACEME
1289112886
1AFF0..1AFF3 ; Grapheme_Base # Lm [4] KATAKANA LETTER MINNAN TONE-2..KATAKANA LETTER MINNAN TONE-5
1289212887
1AFF5..1AFFB ; Grapheme_Base # Lm [7] KATAKANA LETTER MINNAN TONE-7..KATAKANA LETTER MINNAN NASALIZED TONE-5
1289312888
1AFFD..1AFFE ; Grapheme_Base # Lm [2] KATAKANA LETTER MINNAN NASALIZED TONE-7..KATAKANA LETTER MINNAN NASALIZED TONE-8
12894-
1B000..1B125 ; Grapheme_Base # Lo [294] KATAKANA LETTER ARCHAIC E..KATAKANA DIGRAPH TOTE
12895-
1B127..1B128 ; Grapheme_Base # Lo [2] KATAKANA LETTER ALTERNATE NE..KATAKANA LETTER ALTERNATE WI
12889+
1B000..1B128 ; Grapheme_Base # Lo [297] KATAKANA LETTER ARCHAIC E..KATAKANA LETTER ALTERNATE WI
1289612890
1B132 ; Grapheme_Base # Lo HIRAGANA LETTER SMALL KO
1289712891
1B150..1B152 ; Grapheme_Base # Lo [3] HIRAGANA LETTER SMALL WI..HIRAGANA LETTER SMALL WO
1289812892
1B155 ; Grapheme_Base # Lo KATAKANA LETTER SMALL KO
@@ -13104,7 +13098,7 @@ FFFC..FFFD ; Grapheme_Base # So [2] OBJECT REPLACEMENT CHARACTER..REPLACEME
1310413098
31350..33479 ; Grapheme_Base # Lo [8490] CJK UNIFIED IDEOGRAPH-31350..CJK UNIFIED IDEOGRAPH-33479
1310513099
3D000..3FC3F ; Grapheme_Base # Lo [11328] SEAL CHARACTER-3D000..SEAL CHARACTER-3FC3F
1310613100

13107-
# Total code points: 170313
13101+
# Total code points: 170314
1310813102

1310913103
# ================================================
1311013104

unicodetools/data/ucd/dev/DerivedNormalizationProps.txt

Lines changed: 15 additions & 13 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
# DerivedNormalizationProps-18.0.0.txt
2-
# Date: 2025-11-28, 15:46:59 GMT
2+
# Date: 2025-11-28, 16:10:16 GMT
33
# © 2025 Unicode®, Inc.
44
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
55
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
@@ -1665,7 +1665,7 @@ FFED..FFEE ; NFKD_QC; N # So [2] HALFWIDTH BLACK SQUARE..HALFWIDTH WHITE CI
16651665
11938 ; NFKD_QC; N # Mc DIVES AKURU VOWEL SIGN O
16661666
16121..16128 ; NFKD_QC; N # Mn [8] GURUNG KHEMA VOWEL SIGN U..GURUNG KHEMA VOWEL SIGN AU
16671667
16D68..16D6A ; NFKD_QC; N # Lo [3] KIRAT RAI VOWEL SIGN AI..KIRAT RAI VOWEL SIGN AU
1668-
1B123..1B125 ; NFKD_QC; N # Lo [3] HIRAGANA DIGRAPH KOTO..KATAKANA DIGRAPH TOTE
1668+
1B123..1B126 ; NFKD_QC; N # Lo [4] HIRAGANA DIGRAPH KOTO..KATAKANA DIGRAPH YORI
16691669
1CCD6..1CCEF ; NFKD_QC; N # So [26] OUTLINED LATIN CAPITAL LETTER A..OUTLINED LATIN CAPITAL LETTER Z
16701670
1CCF0..1CCF9 ; NFKD_QC; N # Nd [10] OUTLINED DIGIT ZERO..OUTLINED DIGIT NINE
16711671
1D15E..1D164 ; NFKD_QC; N # So [7] MUSICAL SYMBOL HALF NOTE..MUSICAL SYMBOL ONE HUNDRED TWENTY-EIGHTH NOTE
@@ -1758,7 +1758,7 @@ FFED..FFEE ; NFKD_QC; N # So [2] HALFWIDTH BLACK SQUARE..HALFWIDTH WHITE CI
17581758
1FBF0..1FBF9 ; NFKD_QC; N # Nd [10] SEGMENTED DIGIT ZERO..SEGMENTED DIGIT NINE
17591759
2F800..2FA1D ; NFKD_QC; N # Lo [542] CJK COMPATIBILITY IDEOGRAPH-2F800..CJK COMPATIBILITY IDEOGRAPH-2FA1D
17601760

1761-
# Total code points: 17148
1761+
# Total code points: 17149
17621762

17631763
# ================================================
17641764

@@ -2080,7 +2080,7 @@ FFED..FFEE ; NFKC_QC; N # So [2] HALFWIDTH BLACK SQUARE..HALFWIDTH WHITE CI
20802080
10781..10785 ; NFKC_QC; N # Lm [5] MODIFIER LETTER SUPERSCRIPT TRIANGULAR COLON..MODIFIER LETTER SMALL B WITH HOOK
20812081
10787..107B0 ; NFKC_QC; N # Lm [42] MODIFIER LETTER SMALL DZ DIGRAPH..MODIFIER LETTER SMALL V WITH RIGHT HOOK
20822082
107B2..107BF ; NFKC_QC; N # Lm [14] MODIFIER LETTER SMALL CAPITAL Y..MODIFIER LETTER SMALL ESH WITH DOUBLE BAR
2083-
1B123..1B125 ; NFKC_QC; N # Lo [3] HIRAGANA DIGRAPH KOTO..KATAKANA DIGRAPH TOTE
2083+
1B123..1B126 ; NFKC_QC; N # Lo [4] HIRAGANA DIGRAPH KOTO..KATAKANA DIGRAPH YORI
20842084
1CCD6..1CCEF ; NFKC_QC; N # So [26] OUTLINED LATIN CAPITAL LETTER A..OUTLINED LATIN CAPITAL LETTER Z
20852085
1CCF0..1CCF9 ; NFKC_QC; N # Nd [10] OUTLINED DIGIT ZERO..OUTLINED DIGIT NINE
20862086
1D15E..1D164 ; NFKC_QC; N # So [7] MUSICAL SYMBOL HALF NOTE..MUSICAL SYMBOL ONE HUNDRED TWENTY-EIGHTH NOTE
@@ -2173,7 +2173,7 @@ FFED..FFEE ; NFKC_QC; N # So [2] HALFWIDTH BLACK SQUARE..HALFWIDTH WHITE CI
21732173
1FBF0..1FBF9 ; NFKC_QC; N # Nd [10] SEGMENTED DIGIT ZERO..SEGMENTED DIGIT NINE
21742174
2F800..2FA1D ; NFKC_QC; N # Lo [542] CJK COMPATIBILITY IDEOGRAPH-2F800..CJK COMPATIBILITY IDEOGRAPH-2FA1D
21752175

2176-
# Total code points: 5027
2176+
# Total code points: 5028
21772177

21782178
# ================================================
21792179

@@ -2835,7 +2835,7 @@ FFE3 ; Expands_On_NFKD # Sk FULLWIDTH MACRON
28352835
11938 ; Expands_On_NFKD # Mc DIVES AKURU VOWEL SIGN O
28362836
16121..16128 ; Expands_On_NFKD # Mn [8] GURUNG KHEMA VOWEL SIGN U..GURUNG KHEMA VOWEL SIGN AU
28372837
16D68..16D6A ; Expands_On_NFKD # Lo [3] KIRAT RAI VOWEL SIGN AI..KIRAT RAI VOWEL SIGN AU
2838-
1B123..1B125 ; Expands_On_NFKD # Lo [3] HIRAGANA DIGRAPH KOTO..KATAKANA DIGRAPH TOTE
2838+
1B123..1B126 ; Expands_On_NFKD # Lo [4] HIRAGANA DIGRAPH KOTO..KATAKANA DIGRAPH YORI
28392839
1D15E..1D164 ; Expands_On_NFKD # So [7] MUSICAL SYMBOL HALF NOTE..MUSICAL SYMBOL ONE HUNDRED TWENTY-EIGHTH NOTE
28402840
1D1BB..1D1C0 ; Expands_On_NFKD # So [6] MUSICAL SYMBOL MINIMA..MUSICAL SYMBOL FUSA BLACK
28412841
1F100..1F10A ; Expands_On_NFKD # No [11] DIGIT ZERO FULL STOP..DIGIT NINE COMMA
@@ -2848,7 +2848,7 @@ FFE3 ; Expands_On_NFKD # Sk FULLWIDTH MACRON
28482848
1F213 ; Expands_On_NFKD # So SQUARED KATAKANA DE
28492849
1F240..1F248 ; Expands_On_NFKD # So [9] TORTOISE SHELL BRACKETED CJK UNIFIED IDEOGRAPH-672C..TORTOISE SHELL BRACKETED CJK UNIFIED IDEOGRAPH-6557
28502850

2851-
# Total code points: 13413
2851+
# Total code points: 13414
28522852

28532853
# ================================================
28542854

@@ -2975,7 +2975,7 @@ FE74 ; Expands_On_NFKC # Lo ARABIC KASRATAN ISOLATED FORM
29752975
FE76..FE7F ; Expands_On_NFKC # Lo [10] ARABIC FATHA ISOLATED FORM..ARABIC SUKUN MEDIAL FORM
29762976
FEF5..FEFC ; Expands_On_NFKC # Lo [8] ARABIC LIGATURE LAM WITH ALEF WITH MADDA ABOVE ISOLATED FORM..ARABIC LIGATURE LAM WITH ALEF FINAL FORM
29772977
FFE3 ; Expands_On_NFKC # Sk FULLWIDTH MACRON
2978-
1B123..1B125 ; Expands_On_NFKC # Lo [3] HIRAGANA DIGRAPH KOTO..KATAKANA DIGRAPH TOTE
2978+
1B123..1B126 ; Expands_On_NFKC # Lo [4] HIRAGANA DIGRAPH KOTO..KATAKANA DIGRAPH YORI
29792979
1D15E..1D164 ; Expands_On_NFKC # So [7] MUSICAL SYMBOL HALF NOTE..MUSICAL SYMBOL ONE HUNDRED TWENTY-EIGHTH NOTE
29802980
1D1BB..1D1C0 ; Expands_On_NFKC # So [6] MUSICAL SYMBOL MINIMA..MUSICAL SYMBOL FUSA BLACK
29812981
1F100..1F10A ; Expands_On_NFKC # No [11] DIGIT ZERO FULL STOP..DIGIT NINE COMMA
@@ -2987,7 +2987,7 @@ FFE3 ; Expands_On_NFKC # Sk FULLWIDTH MACRON
29872987
1F200..1F201 ; Expands_On_NFKC # So [2] SQUARE HIRAGANA HOKA..SQUARED KATAKANA KOKO
29882988
1F240..1F248 ; Expands_On_NFKC # So [9] TORTOISE SHELL BRACKETED CJK UNIFIED IDEOGRAPH-672C..TORTOISE SHELL BRACKETED CJK UNIFIED IDEOGRAPH-6557
29892989

2990-
# Total code points: 1240
2990+
# Total code points: 1241
29912991

29922992
# ================================================
29932993

@@ -7238,6 +7238,7 @@ FFF0..FFF8 ; NFKC_CF; # Cn [9] <reserved-FFF0>..<reserved-FF
72387238
1B123 ; NFKC_CF; 3053 3068 # Lo HIRAGANA DIGRAPH KOTO
72397239
1B124 ; NFKC_CF; 30C8 30AD # Lo KATAKANA DIGRAPH TOKI
72407240
1B125 ; NFKC_CF; 30C8 30C6 # Lo KATAKANA DIGRAPH TOTE
7241+
1B126 ; NFKC_CF; 30E8 30EA # Lo KATAKANA DIGRAPH YORI
72417242
1BCA0..1BCA3 ; NFKC_CF; # Cf [4] SHORTHAND FORMAT LETTER OVERLAP..SHORTHAND FORMAT UP STEP
72427243
1CCD6 ; NFKC_CF; 0061 # So OUTLINED LATIN CAPITAL LETTER A
72437244
1CCD7 ; NFKC_CF; 0062 # So OUTLINED LATIN CAPITAL LETTER B
@@ -9255,7 +9256,7 @@ E0080..E00FF ; NFKC_CF; # Cn [128] <reserved-E0080>..<reserved-E
92559256
E0100..E01EF ; NFKC_CF; # Mn [240] VARIATION SELECTOR-17..VARIATION SELECTOR-256
92569257
E01F0..E0FFF ; NFKC_CF; # Cn [3600] <reserved-E01F0>..<reserved-E0FFF>
92579258

9258-
# Total code points: 10650
9259+
# Total code points: 10651
92599260

92609261
# ================================================
92619262

@@ -13468,6 +13469,7 @@ FFF0..FFF8 ; NFKC_SCF; # Cn [9] <reserved-FFF0>..<reserved-F
1346813469
1B123 ; NFKC_SCF; 3053 3068 # Lo HIRAGANA DIGRAPH KOTO
1346913470
1B124 ; NFKC_SCF; 30C8 30AD # Lo KATAKANA DIGRAPH TOKI
1347013471
1B125 ; NFKC_SCF; 30C8 30C6 # Lo KATAKANA DIGRAPH TOTE
13472+
1B126 ; NFKC_SCF; 30E8 30EA # Lo KATAKANA DIGRAPH YORI
1347113473
1BCA0..1BCA3 ; NFKC_SCF; # Cf [4] SHORTHAND FORMAT LETTER OVERLAP..SHORTHAND FORMAT UP STEP
1347213474
1CCD6 ; NFKC_SCF; 0061 # So OUTLINED LATIN CAPITAL LETTER A
1347313475
1CCD7 ; NFKC_SCF; 0062 # So OUTLINED LATIN CAPITAL LETTER B
@@ -15485,7 +15487,7 @@ E0080..E00FF ; NFKC_SCF; # Cn [128] <reserved-E0080>..<reserved-
1548515487
E0100..E01EF ; NFKC_SCF; # Mn [240] VARIATION SELECTOR-17..VARIATION SELECTOR-256
1548615488
E01F0..E0FFF ; NFKC_SCF; # Cn [3600] <reserved-E01F0>..<reserved-E0FFF>
1548715489

15488-
# Total code points: 10612
15490+
# Total code points: 10613
1548915491

1549015492
# ================================================
1549115493

@@ -16408,7 +16410,7 @@ FFF0..FFF8 ; Changes_When_NFKC_Casefolded # Cn [9] <reserved-FFF0>..<reserv
1640816410
118A0..118BF ; Changes_When_NFKC_Casefolded # L& [32] WARANG CITI CAPITAL LETTER NGAA..WARANG CITI CAPITAL LETTER VIYO
1640916411
16E40..16E5F ; Changes_When_NFKC_Casefolded # L& [32] MEDEFAIDRIN CAPITAL LETTER M..MEDEFAIDRIN CAPITAL LETTER Y
1641016412
16EA0..16EB8 ; Changes_When_NFKC_Casefolded # L& [25] BERIA ERFE CAPITAL LETTER ARKAB..BERIA ERFE CAPITAL LETTER AY
16411-
1B123..1B125 ; Changes_When_NFKC_Casefolded # Lo [3] HIRAGANA DIGRAPH KOTO..KATAKANA DIGRAPH TOTE
16413+
1B123..1B126 ; Changes_When_NFKC_Casefolded # Lo [4] HIRAGANA DIGRAPH KOTO..KATAKANA DIGRAPH YORI
1641216414
1BCA0..1BCA3 ; Changes_When_NFKC_Casefolded # Cf [4] SHORTHAND FORMAT LETTER OVERLAP..SHORTHAND FORMAT UP STEP
1641316415
1CCD6..1CCEF ; Changes_When_NFKC_Casefolded # So [26] OUTLINED LATIN CAPITAL LETTER A..OUTLINED LATIN CAPITAL LETTER Z
1641416416
1CCF0..1CCF9 ; Changes_When_NFKC_Casefolded # Nd [10] OUTLINED DIGIT ZERO..OUTLINED DIGIT NINE
@@ -16516,6 +16518,6 @@ E0080..E00FF ; Changes_When_NFKC_Casefolded # Cn [128] <reserved-E0080>..<reser
1651616518
E0100..E01EF ; Changes_When_NFKC_Casefolded # Mn [240] VARIATION SELECTOR-17..VARIATION SELECTOR-256
1651716519
E01F0..E0FFF ; Changes_When_NFKC_Casefolded # Cn [3600] <reserved-E01F0>..<reserved-E0FFF>
1651816520

16519-
# Total code points: 10650
16521+
# Total code points: 10651
1652016522

1652116523
# EOF

unicodetools/data/ucd/dev/EastAsianWidth.txt

Lines changed: 2 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
# EastAsianWidth-18.0.0.txt
2-
# Date: 2025-11-28, 15:47:02 GMT
2+
# Date: 2025-11-28, 16:10:19 GMT
33
# © 2025 Unicode®, Inc.
44
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
55
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
@@ -2400,8 +2400,7 @@ FFFD ; A # So REPLACEMENT CHARACTER
24002400
1AFF5..1AFFB ; W # Lm [7] KATAKANA LETTER MINNAN TONE-7..KATAKANA LETTER MINNAN NASALIZED TONE-5
24012401
1AFFD..1AFFE ; W # Lm [2] KATAKANA LETTER MINNAN NASALIZED TONE-7..KATAKANA LETTER MINNAN NASALIZED TONE-8
24022402
1B000..1B0FF ; W # Lo [256] KATAKANA LETTER ARCHAIC E..HENTAIGANA LETTER RE-2
2403-
1B100..1B125 ; W # Lo [38] HENTAIGANA LETTER RE-3..KATAKANA DIGRAPH TOTE
2404-
1B127..1B128 ; W # Lo [2] KATAKANA LETTER ALTERNATE NE..KATAKANA LETTER ALTERNATE WI
2403+
1B100..1B128 ; W # Lo [41] HENTAIGANA LETTER RE-3..KATAKANA LETTER ALTERNATE WI
24052404
1B132 ; W # Lo HIRAGANA LETTER SMALL KO
24062405
1B150..1B152 ; W # Lo [3] HIRAGANA LETTER SMALL WI..HIRAGANA LETTER SMALL WO
24072406
1B155 ; W # Lo KATAKANA LETTER SMALL KO

0 commit comments

Comments
 (0)