Unicode Utilities: Character Properties

help | character | properties | confusables | unicode-set | compare-sets | regex | bnf-regex | breaks | transform | bidi | bidi-c | idna | languageid


 一 
4E00
CJK UNIFIED IDEOGRAPH-4E00
Han Script
id: restricted
confuse: , , , , , , , , , ,
non-Unihan properties for U+4E00
With Non-Default ValuesWith Default Values
AgeV1_1
AlphabeticYes
Bidi_Mirroring_Glyphnull
Bidi_Paired_Bracketnull
BlockCJK_Unified_Ideographs
CJK_Radical1
Confusable_MA
East_Asian_WidthWide
Emoji_DCMnull
Emoji_KDDInull
Emoji_SBnull
Equivalent_Unified_Ideographnull
exemplarja,yue,yue-Hans,zh,zh-Hant
exemplar_aux
exemplar_punct
General_CategoryOther_Letter
Grapheme_BaseYes
HanTypeHan
ID_ContinueYes
ID_StartYes
Identifier_StatusAllowed
Identifier_TypeRecommended
IdeographicYes
Idn_Statusvalid
idna2003valid
idna2008PVALID
idna2008cvalid
ISO_Commentnull
Jamo_Short_Namenull
Line_BreakIdeographic
Name_Aliasnull
Named_Sequencesnull
Named_Sequences_Provnull
Numeric_TypeNumeric
Numeric_Value1
ScriptHan
Script_ExtensionsHan
Sentence_BreakOLetter
Standardized_Variantnull
subheadnull
toIdna2003null
toUts46nnull
toUts46tnull
Unicode_1_Namenull
Unified_IdeographYes
uts46valid
Vertical_OrientationUpright
XID_ContinueYes
XID_StartYes
ANYYes
ASCIINo
ASCII_Hex_DigitNo
Basic_EmojiNo
Bidi_ClassLeft_To_Right
Bidi_ControlNo
Bidi_MirroredNo
Bidi_Paired_Bracket_TypeNone
bmpYes
Canonical_Combining_ClassNot_Reordered
Case_Folding
Case_IgnorableNo
CasedNo
Changes_When_CasefoldedNo
Changes_When_CasemappedNo
Changes_When_LowercasedNo
Changes_When_NFKC_CasefoldedNo
Changes_When_TitlecasedNo
Changes_When_UppercasedNo
Composition_ExclusionNo
Confusable_ML
Confusable_SA
Confusable_SL
DashNo
Decomposition_Mapping
Decomposition_TypeNone
Default_Ignorable_Code_PointNo
DeprecatedNo
DiacriticNo
EmojiNo
Emoji_ComponentNo
Emoji_ModifierNo
Emoji_Modifier_BaseNo
Emoji_PresentationNo
Expands_On_NFCNo
Expands_On_NFDNo
Expands_On_NFKCNo
Expands_On_NFKDNo
Extended_PictographicNo
ExtenderNo
FC_NFKC_Closure
Full_Composition_ExclusionNo
Grapheme_Cluster_BreakOther
Grapheme_ExtendNo
Grapheme_LinkNo
Hangul_Syllable_TypeNot_Applicable
Hex_DigitNo
HyphenNo
ID_Compat_Math_ContinueNo
ID_Compat_Math_StartNo
Idn_2008na
Idn_Mapping
IDS_Binary_OperatorNo
IDS_Trinary_OperatorNo
IDS_Unary_OperatorNo
Indic_Conjunct_BreakNone
Indic_Positional_CategoryNA
Indic_Syllabic_CategoryOther
isNFCYes
isNFDYes
isNFKCYes
isNFKDYes
isNFMYes
Join_ControlNo
Joining_GroupNo_Joining_Group
Joining_TypeNon_Joining
Logical_Order_ExceptionNo
LowercaseNo
Lowercase_Mapping
MathNo
Modifier_Combining_MarkNo
NFC_Quick_CheckYes
NFD_Quick_CheckYes
NFKC_Casefold
NFKC_Quick_CheckYes
NFKC_Simple_Casefold
NFKD_Quick_CheckYes
Noncharacter_Code_PointNo
Other_AlphabeticNo
Other_Default_Ignorable_Code_PointNo
Other_Grapheme_ExtendNo
Other_ID_ContinueNo
Other_ID_StartNo
Other_Joining_TypeDeduce_From_General_Category
Other_LowercaseNo
Other_MathNo
Other_UppercaseNo
Pattern_SyntaxNo
Pattern_White_SpaceNo
Prepended_Concatenation_MarkNo
Quotation_MarkNo
RadicalNo
Regional_IndicatorNo
RGI_EmojiNo
RGI_Emoji_Flag_SequenceNo
RGI_Emoji_Keycap_SequenceNo
RGI_Emoji_Modifier_SequenceNo
RGI_Emoji_Tag_SequenceNo
RGI_Emoji_Zwj_SequenceNo
Sentence_TerminalNo
Simple_Case_Folding
Simple_Lowercase_Mapping
Simple_Titlecase_Mapping
Simple_Uppercase_Mapping
Soft_DottedNo
Terminal_PunctuationNo
Titlecase_Mapping
toCasefold
toLowercase
toNFC
toNFD
toNFKC
toNFKD
toNFM
toTitlecase
toUppercase
ucanull
uca2null
uca2.5null
uca3null
UppercaseNo
Uppercase_Mapping
Variation_SelectorNo
White_SpaceNo
Word_BreakOther
Unihan properties for U+4E00
kAccountingNumericNaN
kAlternateTotalStrokesnull
kBigFiveA440
kCangjieM
kCantonesejat1
kCCCII213021
kCheungBauernull
kCheungBauerIndexnull
kCihaiT1.101
kCNS19861-4421
kCNS19921-4421
kCompatibilityVariant
kCowles5133
kDaeJaweon0129.010
kDefinitionone; a, an; alone
kEACC213021
kFanqienull
kFenn1A
kFennIndex216.01|217.06|218.01|220.06
kFourCornerCode1000.0
kFrequency1
kGB05027
kGB15027
kGB3null
kGB5null
kGB7null
kGB8null
kGradeLevel1
kGSR0394a
kHangul일:0E
kHanYu10001.010
kHanyuPinluyī(32747)
kHanyuPinyin10001.010:yī
kHDZRadBreak⼀[U+2F00]:10001.010
kHKGlyph0001
kHKSCSnull
kIBMJapannull
kIICoreAGTJHKMP
kIRG_GSourceG0-523B
kIRG_HSourceHB1-A440
kIRG_JSourceJ0-306C
kIRG_KPSourceKP0-FCD6
kIRG_KSourceK0-6C69
kIRG_MSourcenull
kIRG_SSourcenull
kIRG_TSourceT1-4421
kIRG_UKSourcenull
kIRG_USourcenull
kIRG_VSourceV1-4A21
kIRGDaeJaweon0129.010
kIRGDaiKanwaZiten00001
kIRGHanyuDaZidian10001.010
kIRGKangXi0075.010
kJanull
kJapaneseイチ イツ ひと ひとつ
kJapaneseKunHITOTSU|HITOTABI|HAJIME
kJapaneseOnICHI|ITSU
kJinmeiyoKanjinull
kJis01676
kJis1null
kJIS0213null
kJoyoKanji2010
kKangXi0075.010
kKarlgren175
kKoreanIL
kKoreanEducationHanja2007
kKoreanNamenull
kKPS0FCD6
kKPS1null
kKSC07673
kKSC1null
kLau3341
kMainlandTelegraph0001
kMandarin
kMatthews3016
kMeyerWempe3837
kMojiJohoMJ006294
kMorohashi00001
kNelson0001
kOtherNumericNaN
kPhonetic1499
kPrimaryNumeric1
kPseudoGB1null
kRSAdobe_Japan1_6C+1200+1.1.0
kRSJapanesenull
kRSKangXi1.0
kRSKanWanull
kRSKoreannull
kRSUnicode1.0
kSBGY468.40
kSemanticVariantU+5F0C<kLau,kMatthews,kMeyerWempe|U+58F9<kLau,kMatthews,kMeyerWempe
kSimplifiedVariantnull
kSMSZD2003Index1.01
kSMSZD2003Readingsyī粵jat1
kSpecializedSemanticVariantU+58F9
kSpoofingVariantnull
kStrangenull
kTaiwanTelegraph0001
kTang*qit|qit
kTGH2013:1
kTGHZ2013430.150:yī
kTotalStrokes1
kTraditionalVariantnull
kUnihanCore2020GHJKMPT
kVietnamesenhất
kVietnameseNumericnull
kXerox241:042
kXHC19831351.020:yī|1360.040:yí|1368.160:yì
kZhuangNumericnull
kZVariantnull

The list includes both Unicode Character Properties and some additions (like idna2003 or subhead)


Fonts and Display. If you don't have a good set of Unicode fonts (and modern browser), you may not be able to read some of the characters. Some suggested fonts that you can add for coverage are: Noto Fonts site, Unicode Fonts for Ancient Scripts, Large, multi-script Unicode fonts. See also: Unicode Display Problems.

Version 3.9; ICU version: 74.1; Unicode/Emoji version: 15.1.0;