Unicode Utilities: Character Properties

Unmarked properties are from Unicode V15.1.0; the beta properties are from Unicode V16.0.0β. For more information, see Unicode Utilities Beta.

help | character | properties | confusables | unicode-set | compare-sets | regex | bnf-regex | breaks | transform | bidi | bidi-c | idna | languageid


 庁 
5E81
CJK UNIFIED IDEOGRAPH-5E81
Han Script
id: restricted
confuse: none
non-Unihan properties for U+5E81
With Non-Default Values With Default Values
Age V1_1
Alphabetic Yes
Bidi_Mirroring_Glyph null
Bidi_Paired_Bracket null
Block CJK_Unified_Ideographs
CJK_Radical null
Do_Not_Emit_Preferred 16.0β: null
East_Asian_Width Wide
Emoji_DCM null
Emoji_KDDI null
Emoji_SB null
Equivalent_Unified_Ideograph null
exemplar ja
exemplar_aux
exemplar_punct
General_Category Other_Letter
Grapheme_Base Yes
HanType Han
ID_Continue Yes
ID_Start Yes
Identifier_Status Allowed
Identifier_Type Recommended
Ideographic Yes
Idn_Status valid
idna2003 valid
idna2008 PVALID
idna2008c valid
ISO_Comment 16.0β: null
Jamo_Short_Name null
kEH_Cat 16.0β: null
kEH_Desc 16.0β: null
kEH_Func 16.0β: null
kEH_FVal 16.0β: null
kEH_HG 16.0β: null
kEH_IFAO 16.0β: null
kEH_JSesh 16.0β: null
kEH_UniK 16.0β: null
kReading null
kRSTUnicode null
kSrc_NushuDuben null
kTGT_MergedSrc null
Line_Break Ideographic
Name_Alias null
Named_Sequences null
Named_Sequences_Prov null
Script Han
Script_Extensions Han
Sentence_Break OLetter
Standardized_Variant null
subhead null
toIdna2003 null
toUts46n null
toUts46t null
Unicode_1_Name null
Unified_Ideograph Yes
uts46 valid
Vertical_Orientation Upright
XID_Continue Yes
XID_Start Yes
ANY Yes
ASCII No
ASCII_Hex_Digit No
Basic_Emoji No
Bidi_Class Left_To_Right
Bidi_Control No
Bidi_Mirrored No
Bidi_Paired_Bracket_Type None
bmp Yes
Canonical_Combining_Class Not_Reordered
Case_Folding
Case_Ignorable No
Cased No
Changes_When_Casefolded No
Changes_When_Casemapped No
Changes_When_Lowercased No
Changes_When_NFKC_Casefolded No
Changes_When_Titlecased No
Changes_When_Uppercased No
Composition_Exclusion No
Confusable_MA
Confusable_ML
Confusable_SA
Confusable_SL
Dash No
Decomposition_Mapping
Decomposition_Type None
Default_Ignorable_Code_Point No
Deprecated No
Diacritic No
Do_Not_Emit_Type 16.0β: null
Emoji No
Emoji_Component No
Emoji_Modifier No
Emoji_Modifier_Base No
Emoji_Presentation No
Expands_On_NFC No
Expands_On_NFD No
Expands_On_NFKC No
Expands_On_NFKD No
Extended_Pictographic No
Extender No
FC_NFKC_Closure
Full_Composition_Exclusion No
Grapheme_Cluster_Break Other
Grapheme_Extend No
Grapheme_Link No
Hangul_Syllable_Type Not_Applicable
Hex_Digit No
Hyphen No
ID_Compat_Math_Continue No
ID_Compat_Math_Start No
Idn_2008 na
Idn_Mapping
IDS_Binary_Operator No
IDS_Trinary_Operator No
IDS_Unary_Operator No
Indic_Conjunct_Break None
Indic_Positional_Category NA
Indic_Syllabic_Category Other
isNFC Yes
isNFD Yes
isNFKC Yes
isNFKD Yes
isNFM Yes
Join_Control No
Joining_Group No_Joining_Group
Joining_Type Non_Joining
kEH_Core 16.0β: No
kEH_NoMirror 16.0β: No
kEH_NoRotate 16.0β: No
Logical_Order_Exception No
Lowercase No
Lowercase_Mapping
Math No
Modifier_Combining_Mark 16.0β: No
NFC_Quick_Check Yes
NFD_Quick_Check Yes
NFKC_Casefold
NFKC_Quick_Check Yes
NFKC_Simple_Casefold
NFKD_Quick_Check Yes
Noncharacter_Code_Point No
Numeric_Type None
Numeric_Value NaN
Other_Alphabetic No
Other_Default_Ignorable_Code_Point No
Other_Grapheme_Extend No
Other_ID_Continue No
Other_ID_Start No
Other_Joining_Type Deduce_From_General_Category
Other_Lowercase No
Other_Math No
Other_Uppercase No
Pattern_Syntax No
Pattern_White_Space No
Prepended_Concatenation_Mark No
Quotation_Mark No
Radical No
Regional_Indicator No
RGI_Emoji No
RGI_Emoji_Flag_Sequence No
RGI_Emoji_Keycap_Sequence No
RGI_Emoji_Modifier_Sequence No
RGI_Emoji_Tag_Sequence No
RGI_Emoji_Zwj_Sequence No
Sentence_Terminal No
Simple_Case_Folding
Simple_Lowercase_Mapping
Simple_Titlecase_Mapping
Simple_Uppercase_Mapping
Soft_Dotted No
Terminal_Punctuation No
Titlecase_Mapping
toCasefold
toLowercase
toNFC
toNFD
toNFKC
toNFKD
toNFM
toTitlecase
toUppercase
uca null
uca2 null
uca2.5 null
uca3 null
Uppercase No
Uppercase_Mapping
Variation_Selector No
White_Space No
Word_Break Other
Unihan properties for U+5E81
kAccountingNumeric NaN
kAlternateTotalStrokes null
kBigFive null
kCangjie IMN
kCantonese ting1
kCCCII 242975
kCheungBauer null
kCheungBauerIndex null
kCihaiT null
kCNS1986 E-2247
kCNS1992 3-2247
kCompatibilityVariant
kCowles null
kDaeJaweon 0652.160
kDefinition hall, central room
kEACC 333D2F
kFanqie 16.0β: 他丁
kFenn null
kFennIndex null
kFourCornerCode 0022.1
kFrequency null
kGB0 null
kGB1 null
kGB3 null
kGB5 2885
kGB7 null
kGB8 null
kGradeLevel null
kGSR null
kHangul null
kHanYu 20872.070
kHanyuPinlu null
kHanyuPinyin 20872.070:tīng
kHDZRadBreak null
kHKGlyph null
kHKSCS null
kIBMJapan null
kIICore AJ
kIRG_GSource G5-3C75
kIRG_HSource null
kIRG_JSource J0-4423
kIRG_KPSource null
kIRG_KSource K2-304D
kIRG_MSource null
kIRG_SSource null
kIRG_TSource T3-2247
kIRG_UKSource null
kIRG_USource null
kIRG_VSource null
kIRGDaeJaweon 0652.160
kIRGDaiKanwaZiten 09223
kIRGHanyuDaZidian 20872.070
kIRGKangXi 0343.070
kJa null
kJapanese チョウ テイ
kJapaneseKun YAKUSHO
kJapaneseOn CHOU|TEI
kJinmeiyoKanji null
kJis0 3603
kJis1 null
kJIS0213 null
kJoyoKanji 2010
kKangXi 0343.070
kKarlgren null
kKorean CHENG
kKoreanEducationHanja null
kKoreanName null
kKPS0 null
kKPS1 null
kKSC0 null
kKSC1 null
kLau null
kMainlandTelegraph null
kMandarin tīng
kMatthews null
kMeyerWempe null
kMojiJoho MJ010961
kMorohashi 09223
kNelson 1498
kOtherNumeric NaN
kPhonetic null
kPrimaryNumeric NaN
kPseudoGB1 null
kRSAdobe_Japan1_6 C+3007+53.3.2
kRSJapanese null
kRSKangXi 53.2
kRSKanWa null
kRSKorean null
kRSUnicode 53.2
kSBGY 197.25
kSemanticVariant null
kSimplifiedVariant null
kSMSZD2003Index null
kSMSZD2003Readings null
kSpecializedSemanticVariant null
kSpoofingVariant null
kStrange null
kTaiwanTelegraph null
kTang null
kTGH null
kTGHZ2013 null
kTotalStrokes 5
kTraditionalVariant null
kUnihanCore2020 J
kVietnamese null
kVietnameseNumeric null
kXerox null
kXHC1983 null
kZhuangNumeric null
kZVariant null

The list includes both Unicode Character Properties and some additions (like idna2003 or subhead)


Fonts and Display. If you don't have a good set of Unicode fonts (and modern browser), you may not be able to read some of the characters. Some suggested fonts that you can add for coverage are: Noto Fonts site, Unicode Fonts for Ancient Scripts, Large, multi-script Unicode fonts. See also: Unicode Display Problems.

Version 3.9; ICU version: 74.1; Unicode/Emoji version: 15.1.0; Unicodeβ version: 16.0.0;