Since: 7.0

class IntlChar (View source)

IntlChar provides access to a number of utility methods that can be used to access information about Unicode characters.

The methods and constants adhere closely to the names and behavior used by the underlying ICU library.

Constants

UNICODE_VERSION

CODEPOINT_MIN

CODEPOINT_MAX

FOLD_CASE_DEFAULT

FOLD_CASE_EXCLUDE_SPECIAL_I

PROPERTY_ALPHABETIC

PROPERTY_BINARY_START

PROPERTY_ASCII_HEX_DIGIT

PROPERTY_BIDI_CONTROL

PROPERTY_BIDI_MIRRORED

PROPERTY_DASH

PROPERTY_DEFAULT_IGNORABLE_CODE_POINT

PROPERTY_DEPRECATED

PROPERTY_DIACRITIC

PROPERTY_EXTENDER

PROPERTY_FULL_COMPOSITION_EXCLUSION

PROPERTY_GRAPHEME_BASE

PROPERTY_GRAPHEME_EXTEND

PROPERTY_GRAPHEME_LINK

PROPERTY_HEX_DIGIT

PROPERTY_HYPHEN

PROPERTY_ID_CONTINUE

PROPERTY_ID_START

PROPERTY_IDEOGRAPHIC

PROPERTY_IDS_BINARY_OPERATOR

PROPERTY_IDS_TRINARY_OPERATOR

PROPERTY_JOIN_CONTROL

PROPERTY_LOGICAL_ORDER_EXCEPTION

PROPERTY_LOWERCASE

PROPERTY_MATH

PROPERTY_NONCHARACTER_CODE_POINT

PROPERTY_QUOTATION_MARK

PROPERTY_RADICAL

PROPERTY_SOFT_DOTTED

PROPERTY_TERMINAL_PUNCTUATION

PROPERTY_UNIFIED_IDEOGRAPH

PROPERTY_UPPERCASE

PROPERTY_WHITE_SPACE

PROPERTY_XID_CONTINUE

PROPERTY_XID_START

PROPERTY_CASE_SENSITIVE

PROPERTY_S_TERM

PROPERTY_VARIATION_SELECTOR

PROPERTY_NFD_INERT

PROPERTY_NFKD_INERT

PROPERTY_NFC_INERT

PROPERTY_NFKC_INERT

PROPERTY_SEGMENT_STARTER

PROPERTY_PATTERN_SYNTAX

PROPERTY_PATTERN_WHITE_SPACE

PROPERTY_POSIX_ALNUM

PROPERTY_POSIX_BLANK

PROPERTY_POSIX_GRAPH

PROPERTY_POSIX_PRINT

PROPERTY_POSIX_XDIGIT

PROPERTY_CASED

PROPERTY_CASE_IGNORABLE

PROPERTY_CHANGES_WHEN_LOWERCASED

PROPERTY_CHANGES_WHEN_UPPERCASED

PROPERTY_CHANGES_WHEN_TITLECASED

PROPERTY_CHANGES_WHEN_CASEFOLDED

PROPERTY_CHANGES_WHEN_CASEMAPPED

PROPERTY_CHANGES_WHEN_NFKC_CASEFOLDED

PROPERTY_BINARY_LIMIT

PROPERTY_BIDI_CLASS

PROPERTY_INT_START

PROPERTY_BLOCK

PROPERTY_CANONICAL_COMBINING_CLASS

PROPERTY_DECOMPOSITION_TYPE

PROPERTY_EAST_ASIAN_WIDTH

PROPERTY_GENERAL_CATEGORY

PROPERTY_JOINING_GROUP

PROPERTY_JOINING_TYPE

PROPERTY_LINE_BREAK

PROPERTY_NUMERIC_TYPE

PROPERTY_SCRIPT

PROPERTY_HANGUL_SYLLABLE_TYPE

PROPERTY_NFD_QUICK_CHECK

PROPERTY_NFKD_QUICK_CHECK

PROPERTY_NFC_QUICK_CHECK

PROPERTY_NFKC_QUICK_CHECK

PROPERTY_LEAD_CANONICAL_COMBINING_CLASS

PROPERTY_TRAIL_CANONICAL_COMBINING_CLASS

PROPERTY_GRAPHEME_CLUSTER_BREAK

PROPERTY_SENTENCE_BREAK

PROPERTY_WORD_BREAK

PROPERTY_BIDI_PAIRED_BRACKET_TYPE

PROPERTY_INT_LIMIT

PROPERTY_GENERAL_CATEGORY_MASK

PROPERTY_MASK_START

PROPERTY_MASK_LIMIT

PROPERTY_NUMERIC_VALUE

PROPERTY_DOUBLE_START

PROPERTY_DOUBLE_LIMIT

PROPERTY_AGE

PROPERTY_STRING_START

PROPERTY_BIDI_MIRRORING_GLYPH

PROPERTY_CASE_FOLDING

PROPERTY_ISO_COMMENT

PROPERTY_LOWERCASE_MAPPING

PROPERTY_NAME

PROPERTY_SIMPLE_CASE_FOLDING

PROPERTY_SIMPLE_LOWERCASE_MAPPING

PROPERTY_SIMPLE_TITLECASE_MAPPING

PROPERTY_SIMPLE_UPPERCASE_MAPPING

PROPERTY_TITLECASE_MAPPING

PROPERTY_UNICODE_1_NAME

PROPERTY_UPPERCASE_MAPPING

PROPERTY_BIDI_PAIRED_BRACKET

PROPERTY_STRING_LIMIT

PROPERTY_SCRIPT_EXTENSIONS

PROPERTY_OTHER_PROPERTY_START

PROPERTY_OTHER_PROPERTY_LIMIT

PROPERTY_INVALID_CODE

CHAR_CATEGORY_UNASSIGNED

CHAR_CATEGORY_GENERAL_OTHER_TYPES

CHAR_CATEGORY_UPPERCASE_LETTER

CHAR_CATEGORY_LOWERCASE_LETTER

CHAR_CATEGORY_TITLECASE_LETTER

CHAR_CATEGORY_MODIFIER_LETTER

CHAR_CATEGORY_OTHER_LETTER

CHAR_CATEGORY_NON_SPACING_MARK

CHAR_CATEGORY_ENCLOSING_MARK

CHAR_CATEGORY_COMBINING_SPACING_MARK

CHAR_CATEGORY_DECIMAL_DIGIT_NUMBER

CHAR_CATEGORY_LETTER_NUMBER

CHAR_CATEGORY_OTHER_NUMBER

CHAR_CATEGORY_SPACE_SEPARATOR

CHAR_CATEGORY_LINE_SEPARATOR

CHAR_CATEGORY_PARAGRAPH_SEPARATOR

CHAR_CATEGORY_CONTROL_CHAR

CHAR_CATEGORY_FORMAT_CHAR

CHAR_CATEGORY_PRIVATE_USE_CHAR

CHAR_CATEGORY_SURROGATE

CHAR_CATEGORY_DASH_PUNCTUATION

CHAR_CATEGORY_START_PUNCTUATION

CHAR_CATEGORY_END_PUNCTUATION

CHAR_CATEGORY_CONNECTOR_PUNCTUATION

CHAR_CATEGORY_OTHER_PUNCTUATION

CHAR_CATEGORY_MATH_SYMBOL

CHAR_CATEGORY_CURRENCY_SYMBOL

CHAR_CATEGORY_MODIFIER_SYMBOL

CHAR_CATEGORY_OTHER_SYMBOL

CHAR_CATEGORY_INITIAL_PUNCTUATION

CHAR_CATEGORY_FINAL_PUNCTUATION

CHAR_CATEGORY_CHAR_CATEGORY_COUNT

CHAR_DIRECTION_LEFT_TO_RIGHT

CHAR_DIRECTION_RIGHT_TO_LEFT

CHAR_DIRECTION_EUROPEAN_NUMBER

CHAR_DIRECTION_EUROPEAN_NUMBER_SEPARATOR

CHAR_DIRECTION_EUROPEAN_NUMBER_TERMINATOR

CHAR_DIRECTION_ARABIC_NUMBER

CHAR_DIRECTION_COMMON_NUMBER_SEPARATOR

CHAR_DIRECTION_BLOCK_SEPARATOR

CHAR_DIRECTION_SEGMENT_SEPARATOR

CHAR_DIRECTION_WHITE_SPACE_NEUTRAL

CHAR_DIRECTION_OTHER_NEUTRAL

CHAR_DIRECTION_LEFT_TO_RIGHT_EMBEDDING

CHAR_DIRECTION_LEFT_TO_RIGHT_OVERRIDE

CHAR_DIRECTION_RIGHT_TO_LEFT_ARABIC

CHAR_DIRECTION_RIGHT_TO_LEFT_EMBEDDING

CHAR_DIRECTION_RIGHT_TO_LEFT_OVERRIDE

CHAR_DIRECTION_POP_DIRECTIONAL_FORMAT

CHAR_DIRECTION_DIR_NON_SPACING_MARK

CHAR_DIRECTION_BOUNDARY_NEUTRAL

CHAR_DIRECTION_FIRST_STRONG_ISOLATE

CHAR_DIRECTION_LEFT_TO_RIGHT_ISOLATE

CHAR_DIRECTION_RIGHT_TO_LEFT_ISOLATE

CHAR_DIRECTION_POP_DIRECTIONAL_ISOLATE

CHAR_DIRECTION_CHAR_DIRECTION_COUNT

BLOCK_CODE_NO_BLOCK

BLOCK_CODE_BASIC_LATIN

BLOCK_CODE_LATIN_1_SUPPLEMENT

BLOCK_CODE_LATIN_EXTENDED_A

BLOCK_CODE_LATIN_EXTENDED_B

BLOCK_CODE_IPA_EXTENSIONS

BLOCK_CODE_SPACING_MODIFIER_LETTERS

BLOCK_CODE_COMBINING_DIACRITICAL_MARKS

BLOCK_CODE_GREEK

BLOCK_CODE_CYRILLIC

BLOCK_CODE_ARMENIAN

BLOCK_CODE_HEBREW

BLOCK_CODE_ARABIC

BLOCK_CODE_SYRIAC

BLOCK_CODE_THAANA

BLOCK_CODE_DEVANAGARI

BLOCK_CODE_BENGALI

BLOCK_CODE_GURMUKHI

BLOCK_CODE_GUJARATI

BLOCK_CODE_ORIYA

BLOCK_CODE_TAMIL

BLOCK_CODE_TELUGU

BLOCK_CODE_KANNADA

BLOCK_CODE_MALAYALAM

BLOCK_CODE_SINHALA

BLOCK_CODE_THAI

BLOCK_CODE_LAO

BLOCK_CODE_TIBETAN

BLOCK_CODE_MYANMAR

BLOCK_CODE_GEORGIAN

BLOCK_CODE_HANGUL_JAMO

BLOCK_CODE_ETHIOPIC

BLOCK_CODE_CHEROKEE

BLOCK_CODE_UNIFIED_CANADIAN_ABORIGINAL_SYLLABICS

BLOCK_CODE_OGHAM

BLOCK_CODE_RUNIC

BLOCK_CODE_KHMER

BLOCK_CODE_MONGOLIAN

BLOCK_CODE_LATIN_EXTENDED_ADDITIONAL

BLOCK_CODE_GREEK_EXTENDED

BLOCK_CODE_GENERAL_PUNCTUATION

BLOCK_CODE_SUPERSCRIPTS_AND_SUBSCRIPTS

BLOCK_CODE_CURRENCY_SYMBOLS

BLOCK_CODE_COMBINING_MARKS_FOR_SYMBOLS

BLOCK_CODE_LETTERLIKE_SYMBOLS

BLOCK_CODE_NUMBER_FORMS

BLOCK_CODE_ARROWS

BLOCK_CODE_MATHEMATICAL_OPERATORS

BLOCK_CODE_MISCELLANEOUS_TECHNICAL

BLOCK_CODE_CONTROL_PICTURES

BLOCK_CODE_OPTICAL_CHARACTER_RECOGNITION

BLOCK_CODE_ENCLOSED_ALPHANUMERICS

BLOCK_CODE_BOX_DRAWING

BLOCK_CODE_BLOCK_ELEMENTS

BLOCK_CODE_GEOMETRIC_SHAPES

BLOCK_CODE_MISCELLANEOUS_SYMBOLS

BLOCK_CODE_DINGBATS

BLOCK_CODE_BRAILLE_PATTERNS

BLOCK_CODE_CJK_RADICALS_SUPPLEMENT

BLOCK_CODE_KANGXI_RADICALS

BLOCK_CODE_IDEOGRAPHIC_DESCRIPTION_CHARACTERS

BLOCK_CODE_CJK_SYMBOLS_AND_PUNCTUATION

BLOCK_CODE_HIRAGANA

BLOCK_CODE_KATAKANA

BLOCK_CODE_BOPOMOFO

BLOCK_CODE_HANGUL_COMPATIBILITY_JAMO

BLOCK_CODE_KANBUN

BLOCK_CODE_BOPOMOFO_EXTENDED

BLOCK_CODE_ENCLOSED_CJK_LETTERS_AND_MONTHS

BLOCK_CODE_CJK_COMPATIBILITY

BLOCK_CODE_CJK_UNIFIED_IDEOGRAPHS_EXTENSION_A

BLOCK_CODE_CJK_UNIFIED_IDEOGRAPHS

BLOCK_CODE_YI_SYLLABLES

BLOCK_CODE_YI_RADICALS

BLOCK_CODE_HANGUL_SYLLABLES

BLOCK_CODE_HIGH_SURROGATES

BLOCK_CODE_HIGH_PRIVATE_USE_SURROGATES

BLOCK_CODE_LOW_SURROGATES

BLOCK_CODE_PRIVATE_USE_AREA

BLOCK_CODE_PRIVATE_USE

BLOCK_CODE_CJK_COMPATIBILITY_IDEOGRAPHS

BLOCK_CODE_ALPHABETIC_PRESENTATION_FORMS

BLOCK_CODE_ARABIC_PRESENTATION_FORMS_A

BLOCK_CODE_COMBINING_HALF_MARKS

BLOCK_CODE_CJK_COMPATIBILITY_FORMS

BLOCK_CODE_SMALL_FORM_VARIANTS

BLOCK_CODE_ARABIC_PRESENTATION_FORMS_B

BLOCK_CODE_SPECIALS

BLOCK_CODE_HALFWIDTH_AND_FULLWIDTH_FORMS

BLOCK_CODE_OLD_ITALIC

BLOCK_CODE_GOTHIC

BLOCK_CODE_DESERET

BLOCK_CODE_BYZANTINE_MUSICAL_SYMBOLS

BLOCK_CODE_MUSICAL_SYMBOLS

BLOCK_CODE_MATHEMATICAL_ALPHANUMERIC_SYMBOLS

BLOCK_CODE_CJK_UNIFIED_IDEOGRAPHS_EXTENSION_B

BLOCK_CODE_CJK_COMPATIBILITY_IDEOGRAPHS_SUPPLEMENT

BLOCK_CODE_TAGS

BLOCK_CODE_CYRILLIC_SUPPLEMENT

BLOCK_CODE_CYRILLIC_SUPPLEMENTARY

BLOCK_CODE_TAGALOG

BLOCK_CODE_HANUNOO

BLOCK_CODE_BUHID

BLOCK_CODE_TAGBANWA

BLOCK_CODE_MISCELLANEOUS_MATHEMATICAL_SYMBOLS_A

BLOCK_CODE_SUPPLEMENTAL_ARROWS_A

BLOCK_CODE_SUPPLEMENTAL_ARROWS_B

BLOCK_CODE_MISCELLANEOUS_MATHEMATICAL_SYMBOLS_B

BLOCK_CODE_SUPPLEMENTAL_MATHEMATICAL_OPERATORS

BLOCK_CODE_KATAKANA_PHONETIC_EXTENSIONS

BLOCK_CODE_VARIATION_SELECTORS

BLOCK_CODE_SUPPLEMENTARY_PRIVATE_USE_AREA_A

BLOCK_CODE_SUPPLEMENTARY_PRIVATE_USE_AREA_B

BLOCK_CODE_LIMBU

BLOCK_CODE_TAI_LE

BLOCK_CODE_KHMER_SYMBOLS

BLOCK_CODE_PHONETIC_EXTENSIONS

BLOCK_CODE_MISCELLANEOUS_SYMBOLS_AND_ARROWS

BLOCK_CODE_YIJING_HEXAGRAM_SYMBOLS

BLOCK_CODE_LINEAR_B_SYLLABARY

BLOCK_CODE_LINEAR_B_IDEOGRAMS

BLOCK_CODE_AEGEAN_NUMBERS

BLOCK_CODE_UGARITIC

BLOCK_CODE_SHAVIAN

BLOCK_CODE_OSMANYA

BLOCK_CODE_CYPRIOT_SYLLABARY

BLOCK_CODE_TAI_XUAN_JING_SYMBOLS

BLOCK_CODE_VARIATION_SELECTORS_SUPPLEMENT

BLOCK_CODE_ANCIENT_GREEK_MUSICAL_NOTATION

BLOCK_CODE_ANCIENT_GREEK_NUMBERS

BLOCK_CODE_ARABIC_SUPPLEMENT

BLOCK_CODE_BUGINESE

BLOCK_CODE_CJK_STROKES

BLOCK_CODE_COMBINING_DIACRITICAL_MARKS_SUPPLEMENT

BLOCK_CODE_COPTIC

BLOCK_CODE_ETHIOPIC_EXTENDED

BLOCK_CODE_ETHIOPIC_SUPPLEMENT

BLOCK_CODE_GEORGIAN_SUPPLEMENT

BLOCK_CODE_GLAGOLITIC

BLOCK_CODE_KHAROSHTHI

BLOCK_CODE_MODIFIER_TONE_LETTERS

BLOCK_CODE_NEW_TAI_LUE

BLOCK_CODE_OLD_PERSIAN

BLOCK_CODE_PHONETIC_EXTENSIONS_SUPPLEMENT

BLOCK_CODE_SUPPLEMENTAL_PUNCTUATION

BLOCK_CODE_SYLOTI_NAGRI

BLOCK_CODE_TIFINAGH

BLOCK_CODE_VERTICAL_FORMS

BLOCK_CODE_NKO

BLOCK_CODE_BALINESE

BLOCK_CODE_LATIN_EXTENDED_C

BLOCK_CODE_LATIN_EXTENDED_D

BLOCK_CODE_PHAGS_PA

BLOCK_CODE_PHOENICIAN

BLOCK_CODE_CUNEIFORM

BLOCK_CODE_CUNEIFORM_NUMBERS_AND_PUNCTUATION

BLOCK_CODE_COUNTING_ROD_NUMERALS

BLOCK_CODE_SUNDANESE

BLOCK_CODE_LEPCHA

BLOCK_CODE_OL_CHIKI

BLOCK_CODE_CYRILLIC_EXTENDED_A

BLOCK_CODE_VAI

BLOCK_CODE_CYRILLIC_EXTENDED_B

BLOCK_CODE_SAURASHTRA

BLOCK_CODE_KAYAH_LI

BLOCK_CODE_REJANG

BLOCK_CODE_CHAM

BLOCK_CODE_ANCIENT_SYMBOLS

BLOCK_CODE_PHAISTOS_DISC

BLOCK_CODE_LYCIAN

BLOCK_CODE_CARIAN

BLOCK_CODE_LYDIAN

BLOCK_CODE_MAHJONG_TILES

BLOCK_CODE_DOMINO_TILES

BLOCK_CODE_SAMARITAN

BLOCK_CODE_UNIFIED_CANADIAN_ABORIGINAL_SYLLABICS_EXTENDED

BLOCK_CODE_TAI_THAM

BLOCK_CODE_VEDIC_EXTENSIONS

BLOCK_CODE_LISU

BLOCK_CODE_BAMUM

BLOCK_CODE_COMMON_INDIC_NUMBER_FORMS

BLOCK_CODE_DEVANAGARI_EXTENDED

BLOCK_CODE_HANGUL_JAMO_EXTENDED_A

BLOCK_CODE_JAVANESE

BLOCK_CODE_MYANMAR_EXTENDED_A

BLOCK_CODE_TAI_VIET

BLOCK_CODE_MEETEI_MAYEK

BLOCK_CODE_HANGUL_JAMO_EXTENDED_B

BLOCK_CODE_IMPERIAL_ARAMAIC

BLOCK_CODE_OLD_SOUTH_ARABIAN

BLOCK_CODE_AVESTAN

BLOCK_CODE_INSCRIPTIONAL_PARTHIAN

BLOCK_CODE_INSCRIPTIONAL_PAHLAVI

BLOCK_CODE_OLD_TURKIC

BLOCK_CODE_RUMI_NUMERAL_SYMBOLS

BLOCK_CODE_KAITHI

BLOCK_CODE_EGYPTIAN_HIEROGLYPHS

BLOCK_CODE_ENCLOSED_ALPHANUMERIC_SUPPLEMENT

BLOCK_CODE_ENCLOSED_IDEOGRAPHIC_SUPPLEMENT

BLOCK_CODE_CJK_UNIFIED_IDEOGRAPHS_EXTENSION_C

BLOCK_CODE_MANDAIC

BLOCK_CODE_BATAK

BLOCK_CODE_ETHIOPIC_EXTENDED_A

BLOCK_CODE_BRAHMI

BLOCK_CODE_BAMUM_SUPPLEMENT

BLOCK_CODE_KANA_SUPPLEMENT

BLOCK_CODE_PLAYING_CARDS

BLOCK_CODE_MISCELLANEOUS_SYMBOLS_AND_PICTOGRAPHS

BLOCK_CODE_EMOTICONS

BLOCK_CODE_TRANSPORT_AND_MAP_SYMBOLS

BLOCK_CODE_ALCHEMICAL_SYMBOLS

BLOCK_CODE_CJK_UNIFIED_IDEOGRAPHS_EXTENSION_D

BLOCK_CODE_ARABIC_EXTENDED_A

BLOCK_CODE_ARABIC_MATHEMATICAL_ALPHABETIC_SYMBOLS

BLOCK_CODE_CHAKMA

BLOCK_CODE_MEETEI_MAYEK_EXTENSIONS

BLOCK_CODE_MEROITIC_CURSIVE

BLOCK_CODE_MEROITIC_HIEROGLYPHS

BLOCK_CODE_MIAO

BLOCK_CODE_SHARADA

BLOCK_CODE_SORA_SOMPENG

BLOCK_CODE_SUNDANESE_SUPPLEMENT

BLOCK_CODE_TAKRI

BLOCK_CODE_BASSA_VAH

BLOCK_CODE_CAUCASIAN_ALBANIAN

BLOCK_CODE_COPTIC_EPACT_NUMBERS

BLOCK_CODE_COMBINING_DIACRITICAL_MARKS_EXTENDED

BLOCK_CODE_DUPLOYAN

BLOCK_CODE_ELBASAN

BLOCK_CODE_GEOMETRIC_SHAPES_EXTENDED

BLOCK_CODE_GRANTHA

BLOCK_CODE_KHOJKI

BLOCK_CODE_KHUDAWADI

BLOCK_CODE_LATIN_EXTENDED_E

BLOCK_CODE_LINEAR_A

BLOCK_CODE_MAHAJANI

BLOCK_CODE_MANICHAEAN

BLOCK_CODE_MENDE_KIKAKUI

BLOCK_CODE_MODI

BLOCK_CODE_MRO

BLOCK_CODE_MYANMAR_EXTENDED_B

BLOCK_CODE_NABATAEAN

BLOCK_CODE_OLD_NORTH_ARABIAN

BLOCK_CODE_OLD_PERMIC

BLOCK_CODE_ORNAMENTAL_DINGBATS

BLOCK_CODE_PAHAWH_HMONG

BLOCK_CODE_PALMYRENE

BLOCK_CODE_PAU_CIN_HAU

BLOCK_CODE_PSALTER_PAHLAVI

BLOCK_CODE_SHORTHAND_FORMAT_CONTROLS

BLOCK_CODE_SIDDHAM

BLOCK_CODE_SINHALA_ARCHAIC_NUMBERS

BLOCK_CODE_SUPPLEMENTAL_ARROWS_C

BLOCK_CODE_TIRHUTA

BLOCK_CODE_WARANG_CITI

BLOCK_CODE_COUNT

BLOCK_CODE_INVALID_CODE

BPT_NONE

BPT_OPEN

BPT_CLOSE

BPT_COUNT

EA_NEUTRAL

EA_AMBIGUOUS

EA_HALFWIDTH

EA_FULLWIDTH

EA_NARROW

EA_WIDE

EA_COUNT

UNICODE_CHAR_NAME

UNICODE_10_CHAR_NAME

EXTENDED_CHAR_NAME

CHAR_NAME_ALIAS

CHAR_NAME_CHOICE_COUNT

SHORT_PROPERTY_NAME

LONG_PROPERTY_NAME

PROPERTY_NAME_CHOICE_COUNT

DT_NONE

DT_CANONICAL

DT_COMPAT

DT_CIRCLE

DT_FINAL

DT_FONT

DT_FRACTION

DT_INITIAL

DT_ISOLATED

DT_MEDIAL

DT_NARROW

DT_NOBREAK

DT_SMALL

DT_SQUARE

DT_SUB

DT_SUPER

DT_VERTICAL

DT_WIDE

DT_COUNT

JT_NON_JOINING

JT_JOIN_CAUSING

JT_DUAL_JOINING

JT_LEFT_JOINING

JT_RIGHT_JOINING

JT_TRANSPARENT

JT_COUNT

JG_NO_JOINING_GROUP

JG_AIN

JG_ALAPH

JG_ALEF

JG_BEH

JG_BETH

JG_DAL

JG_DALATH_RISH

JG_E

JG_FEH

JG_FINAL_SEMKATH

JG_GAF

JG_GAMAL

JG_HAH

JG_TEH_MARBUTA_GOAL

JG_HAMZA_ON_HEH_GOAL

JG_HE

JG_HEH

JG_HEH_GOAL

JG_HETH

JG_KAF

JG_KAPH

JG_KNOTTED_HEH

JG_LAM

JG_LAMADH

JG_MEEM

JG_MIM

JG_NOON

JG_NUN

JG_PE

JG_QAF

JG_QAPH

JG_REH

JG_REVERSED_PE

JG_SAD

JG_SADHE

JG_SEEN

JG_SEMKATH

JG_SHIN

JG_SWASH_KAF

JG_SYRIAC_WAW

JG_TAH

JG_TAW

JG_TEH_MARBUTA

JG_TETH

JG_WAW

JG_YEH

JG_YEH_BARREE

JG_YEH_WITH_TAIL

JG_YUDH

JG_YUDH_HE

JG_ZAIN

JG_FE

JG_KHAPH

JG_ZHAIN

JG_BURUSHASKI_YEH_BARREE

JG_FARSI_YEH

JG_NYA

JG_ROHINGYA_YEH

JG_MANICHAEAN_ALEPH

JG_MANICHAEAN_AYIN

JG_MANICHAEAN_BETH

JG_MANICHAEAN_DALETH

JG_MANICHAEAN_DHAMEDH

JG_MANICHAEAN_FIVE

JG_MANICHAEAN_GIMEL

JG_MANICHAEAN_HETH

JG_MANICHAEAN_HUNDRED

JG_MANICHAEAN_KAPH

JG_MANICHAEAN_LAMEDH

JG_MANICHAEAN_MEM

JG_MANICHAEAN_NUN

JG_MANICHAEAN_ONE

JG_MANICHAEAN_PE

JG_MANICHAEAN_QOPH

JG_MANICHAEAN_RESH

JG_MANICHAEAN_SADHE

JG_MANICHAEAN_SAMEKH

JG_MANICHAEAN_TAW

JG_MANICHAEAN_TEN

JG_MANICHAEAN_TETH

JG_MANICHAEAN_THAMEDH

JG_MANICHAEAN_TWENTY

JG_MANICHAEAN_WAW

JG_MANICHAEAN_YODH

JG_MANICHAEAN_ZAYIN

JG_STRAIGHT_WAW

JG_COUNT

GCB_OTHER

GCB_CONTROL

GCB_CR

GCB_EXTEND

GCB_L

GCB_LF

GCB_LV

GCB_LVT

GCB_T

GCB_V

GCB_SPACING_MARK

GCB_PREPEND

GCB_REGIONAL_INDICATOR

GCB_COUNT

WB_OTHER

WB_ALETTER

WB_FORMAT

WB_KATAKANA

WB_MIDLETTER

WB_MIDNUM

WB_NUMERIC

WB_EXTENDNUMLET

WB_CR

WB_EXTEND

WB_LF

WB_MIDNUMLET

WB_NEWLINE

WB_REGIONAL_INDICATOR

WB_HEBREW_LETTER

WB_SINGLE_QUOTE

WB_DOUBLE_QUOTE

WB_COUNT

SB_OTHER

SB_ATERM

SB_CLOSE

SB_FORMAT

SB_LOWER

SB_NUMERIC

SB_OLETTER

SB_SEP

SB_SP

SB_STERM

SB_UPPER

SB_CR

SB_EXTEND

SB_LF

SB_SCONTINUE

SB_COUNT

LB_UNKNOWN

LB_AMBIGUOUS

LB_ALPHABETIC

LB_BREAK_BOTH

LB_BREAK_AFTER

LB_BREAK_BEFORE

LB_MANDATORY_BREAK

LB_CONTINGENT_BREAK

LB_CLOSE_PUNCTUATION

LB_COMBINING_MARK

LB_CARRIAGE_RETURN

LB_EXCLAMATION

LB_GLUE

LB_HYPHEN

LB_IDEOGRAPHIC

LB_INSEPARABLE

LB_INSEPERABLE

LB_INFIX_NUMERIC

LB_LINE_FEED

LB_NONSTARTER

LB_NUMERIC

LB_OPEN_PUNCTUATION

LB_POSTFIX_NUMERIC

LB_PREFIX_NUMERIC

LB_QUOTATION

LB_COMPLEX_CONTEXT

LB_SURROGATE

LB_SPACE

LB_BREAK_SYMBOLS

LB_ZWSPACE

LB_NEXT_LINE

LB_WORD_JOINER

LB_H2

LB_H3

LB_JL

LB_JT

LB_JV

LB_CLOSE_PARENTHESIS

LB_CONDITIONAL_JAPANESE_STARTER

LB_HEBREW_LETTER

LB_REGIONAL_INDICATOR

LB_COUNT

NT_NONE

NT_DECIMAL

NT_DIGIT

NT_NUMERIC

NT_COUNT

HST_NOT_APPLICABLE

HST_LEADING_JAMO

HST_VOWEL_JAMO

HST_TRAILING_JAMO

HST_LV_SYLLABLE

HST_LVT_SYLLABLE

HST_COUNT

NO_NUMERIC_VALUE

PROPERTY_IDS_UNARY_OPERATOR Since: 8.4

PROPERTY_ID_COMPAT_MATH_START Since: 8.4

PROPERTY_ID_COMPAT_MATH_CONTINUE Since: 8.4

Methods

static bool|null
hasBinaryProperty(int|string $codepoint, int $property)

Check a binary Unicode property for a code point

static array|null
charAge(int|string $codepoint)

No description

static int|null
charDigitValue(int|string $codepoint)

No description

static int|null
charDirection(int|string $codepoint)

Get bidirectional category value for a code point

static int|null
charFromName(string $name, int $type = IntlChar::UNICODE_CHAR_NAME)

No description

static string|int|null
charMirror(int|string $codepoint)

No description

static string|null
charName(int|string $codepoint, int $type = IntlChar::UNICODE_CHAR_NAME)

Retrieve the name of a Unicode character

static int|null
charType(int|string $codepoint)

Get the general category value for a code point

static string|null
chr(mixed $codepoint)

Return Unicode character by code point value

static int|false|null
digit(int|string $codepoint, int $base = 10)

Get the decimal digit value of a code point for a given radix

static 
enumCharNames(int|string $start, int|string $end, callable $callback, int $type = IntlChar::UNICODE_CHAR_NAME)

Enumerate all assigned Unicode characters within a range

static void
enumCharTypes(callable $callback)

Enumerate all code points with their Unicode general categories

static string|int|null
foldCase(int|string $codepoint, int $options = IntlChar::FOLD_CASE_DEFAULT)

Perform case folding on a code point

static int
forDigit(int $digit, int $base = 10)

Get character representation for a given digit and radix

static string|int|null
getBidiPairedBracket(int|string $codepoint)

Get the paired bracket character for a code point

static int|null
getBlockCode(int|string $codepoint)

Get the Unicode allocation block containing a code point

static int|null
getCombiningClass(int|string $codepoint)

Get the combining class of a code point

static string|false|null
getFC_NFKC_Closure(int|string $codepoint)

Get the FC_NFKC_Closure property for a code point

static int
getIntPropertyMaxValue(int $property)

Get the max value for a Unicode property

static int
getIntPropertyMinValue(int $property)

Get the min value for a Unicode property

static int|null
getIntPropertyValue(int|string $codepoint, int $property)

Get the value for a Unicode property for a code point

static float|null
getNumericValue(int|string $codepoint)

Get the numeric value for a Unicode code point

static int
getPropertyEnum(string $alias)

Get the property constant value for a given property name

static string|false
getPropertyName(int $property, int $type = IntlChar::LONG_PROPERTY_NAME)

Get the Unicode name for a property

static int
getPropertyValueEnum(int $property, string $name)

Get the property value for a given value name

static string|false
getPropertyValueName(int $property, int $value, int $type = IntlChar::LONG_PROPERTY_NAME)

Get the Unicode name for a property value

static array
getUnicodeVersion()

Get the Unicode version

static bool|null
isalnum(int|string $codepoint)

Check if code point is an alphanumeric character

static bool|null
isalpha(int|string $codepoint)

Check if code point is a letter character

static bool|null
isbase(int|string $codepoint)

Check if code point is a base character

static bool|null
isblank(int|string $codepoint)

Check if code point is a "blank" or "horizontal space" character

static bool|null
iscntrl(int|string $codepoint)

Check if code point is a control character

static bool|null
isdefined(int|string $codepoint)

Check whether the code point is defined

static bool|null
isdigit(int|string $codepoint)

Check if code point is a digit character

static bool|null
isgraph(int|string $codepoint)

Check if code point is a graphic character

static bool|null
isIDIgnorable(int|string $codepoint)

Check if code point is an ignorable character

static bool|null
isIDPart(int|string $codepoint)

Check if code point is permissible in an identifier

static bool|null
isIDStart(int|string $codepoint)

Check if code point is permissible as the first character in an identifier

static bool|null
isISOControl(int|string $codepoint)

Check if code point is an ISO control code

static bool|null
isJavaIDPart(int|string $codepoint)

Check if code point is permissible in a Java identifier

static bool|null
isJavaIDStart(int|string $codepoint)

Check if code point is permissible as the first character in a Java identifier

static bool|null
isJavaSpaceChar(int|string $codepoint)

Check if code point is a space character according to Java

static bool|null
islower(int|string $codepoint)

Check if code point is a lowercase letter

static bool|null
isMirrored(int|string $codepoint)

Check if code point has the Bidi_Mirrored property

static bool|null
isprint(int|string $codepoint)

Check if code point is a printable character

static bool|null
ispunct(int|string $codepoint)

Check if code point is punctuation character

static bool|null
isspace(int|string $codepoint)

Check if code point is a space character

static bool|null
istitle(int|string $codepoint)

Check if code point is a titlecase letter

static bool|null
isUAlphabetic(int|string $codepoint)

Check if code point has the Alphabetic Unicode property

static bool|null
isULowercase(int|string $codepoint)

Check if code point has the Lowercase Unicode property

static bool|null
isupper(int|string $codepoint)

Check if code point has the general category "Lu" (uppercase letter)

static bool|null
isUUppercase(int|string $codepoint)

Check if code point has the Uppercase Unicode property

static bool|null
isUWhiteSpace(int|string $codepoint)

Check if code point has the White_Space Unicode property

static bool|null
isWhitespace(int|string $codepoint)

Check if code point is a whitespace character according to ICU

static bool|null
isxdigit(int|string $codepoint)

Check if code point is a hexadecimal digit

static int|null
ord(int|string $character)

Return Unicode code point value of character

static string|int|null
tolower(int|string $codepoint)

Make Unicode character lowercase

static string|int|null
totitle(int|string $codepoint)

Make Unicode character titlecase

static string|int|null
toupper(int|string $codepoint)

Make Unicode character uppercase

Details

static bool|null hasBinaryProperty(int|string $codepoint, int $property)

Since: 7.0

Check a binary Unicode property for a code point

Parameters

int|string $codepoint

The integer codepoint value (e.g. 0x2603 for U+2603 SNOWMAN), or the character encoded as a UTF-8 string (e.g. "\u{2603}")

int $property

The Unicode property to lookup (see the IntlChar::PROPERTY_* constants).

Return Value

bool|null

Returns TRUE or FALSE according to the binary Unicode property value for codepoint. Also FALSE if property is out of bounds or if the Unicode version does not have data for the property at all, or not for this code point. Or NULL if codepoint is out of bounds.

static array|null charAge(int|string $codepoint)

Since: 7.0

No description

Parameters

int|string $codepoint

The integer codepoint value (e.g. 0x2603 for U+2603 SNOWMAN), or the character encoded as a UTF-8 string (e.g. "\u{2603}")

Return Value

array|null

The Unicode version number, as an array. For example, version 1.3.31.2 would be represented as [1, 3, 31, 2]. Or NULL if codepoint is out of bounds.

static int|null charDigitValue(int|string $codepoint)

Since: 7.0

No description

Parameters

int|string $codepoint

The integer codepoint value (e.g. 0x2603 for U+2603 SNOWMAN), or the character encoded as a UTF-8 string (e.g. "\u{2603}")

Return Value

int|null

The decimal digit value of codepoint, or -1 if it is not a decimal digit character. Or NULL if codepoint is out of bounds.

static int|null charDirection(int|string $codepoint)

Since: 7.0

Get bidirectional category value for a code point

Parameters

int|string $codepoint

The integer codepoint value (e.g. 0x2603 for U+2603 SNOWMAN), or the character encoded as a UTF-8 string (e.g. "\u{2603}")

Return Value

int|null

The bidirectional category value; one of the following constants:

  • IntlChar::CHAR_DIRECTION_LEFT_TO_RIGHT
  • IntlChar::CHAR_DIRECTION_RIGHT_TO_LEFT
  • IntlChar::CHAR_DIRECTION_EUROPEAN_NUMBER
  • IntlChar::CHAR_DIRECTION_EUROPEAN_NUMBER_SEPARATOR
  • IntlChar::CHAR_DIRECTION_EUROPEAN_NUMBER_TERMINATOR
  • IntlChar::CHAR_DIRECTION_ARABIC_NUMBER
  • IntlChar::CHAR_DIRECTION_COMMON_NUMBER_SEPARATOR
  • IntlChar::CHAR_DIRECTION_BLOCK_SEPARATOR
  • IntlChar::CHAR_DIRECTION_SEGMENT_SEPARATOR
  • IntlChar::CHAR_DIRECTION_WHITE_SPACE_NEUTRAL
  • IntlChar::CHAR_DIRECTION_OTHER_NEUTRAL
  • IntlChar::CHAR_DIRECTION_LEFT_TO_RIGHT_EMBEDDING
  • IntlChar::CHAR_DIRECTION_LEFT_TO_RIGHT_OVERRIDE
  • IntlChar::CHAR_DIRECTION_RIGHT_TO_LEFT_ARABIC
  • IntlChar::CHAR_DIRECTION_RIGHT_TO_LEFT_EMBEDDING
  • IntlChar::CHAR_DIRECTION_RIGHT_TO_LEFT_OVERRIDE
  • IntlChar::CHAR_DIRECTION_POP_DIRECTIONAL_FORMAT
  • IntlChar::CHAR_DIRECTION_DIR_NON_SPACING_MARK
  • IntlChar::CHAR_DIRECTION_BOUNDARY_NEUTRAL
  • IntlChar::CHAR_DIRECTION_FIRST_STRONG_ISOLATE
  • IntlChar::CHAR_DIRECTION_LEFT_TO_RIGHT_ISOLATE
  • IntlChar::CHAR_DIRECTION_RIGHT_TO_LEFT_ISOLATE
  • IntlChar::CHAR_DIRECTION_POP_DIRECTIONAL_ISOLATE
  • IntlChar::CHAR_DIRECTION_CHAR_DIRECTION_COUNT

Or NULL if codepoint is out of bounds.

static int|null charFromName(string $name, int $type = IntlChar::UNICODE_CHAR_NAME)

Since: 7.0

No description

Parameters

string $name

Full name of the Unicode character.

int $type

[optional]

Which set of names to use for the lookup. Can be any of these constants:

  • IntlChar::UNICODE_CHAR_NAME (default)
  • IntlChar::UNICODE_10_CHAR_NAME
  • IntlChar::EXTENDED_CHAR_NAME
  • IntlChar::CHAR_NAME_ALIAS
  • IntlChar::CHAR_NAME_CHOICE_COUNT

Return Value

int|null

The Unicode value of the code point with the given name (as an integer), or NULL if there is no such code point.

static string|int|null charMirror(int|string $codepoint)

No description

Parameters

int|string $codepoint

The integer codepoint value (e.g. 0x2603 for U+2603 SNOWMAN), or the character encoded as a UTF-8 string (e.g. "\u{2603}")

Return Value

string|int|null

Returns another Unicode code point that may serve as a mirror-image substitute, or codepoint itself if there is no such mapping or codepoint does not have the Bidi_Mirrored property. The return type will be integer unless the code point was passed as a UTF-8 string, in which case a string will be returned. Or NULL if codepoint will be out of bound.

static string|null charName(int|string $codepoint, int $type = IntlChar::UNICODE_CHAR_NAME)

Since: 7.0

Retrieve the name of a Unicode character

Parameters

int|string $codepoint

The integer codepoint value (e.g. 0x2603 for U+2603 SNOWMAN), or the character encoded as a UTF-8 string (e.g. "\u{2603}")

int $type

[optional] Which set of names to use for the lookup. Can be any of these constants:

  • IntlChar::UNICODE_CHAR_NAME (default)
  • IntlChar::UNICODE_10_CHAR_NAME
  • IntlChar::EXTENDED_CHAR_NAME
  • IntlChar::CHAR_NAME_ALIAS
  • IntlChar::CHAR_NAME_CHOICE_COUNT

Return Value

string|null

The corresponding name, or an empty string if there is no name for this character, or NULL if codepoint is out of bounds.

static int|null charType(int|string $codepoint)

Since: 7.0

Get the general category value for a code point

Parameters

int|string $codepoint

The integer codepoint value (e.g. 0x2603 for U+2603 SNOWMAN), or the character encoded as a UTF-8 string (e.g. "\u{2603}")

Return Value

int|null

Returns the general category type, which may be one of the following constants:

  • IntlChar::CHAR_CATEGORY_UNASSIGNED
  • IntlChar::CHAR_CATEGORY_GENERAL_OTHER_TYPES
  • IntlChar::CHAR_CATEGORY_UPPERCASE_LETTER
  • IntlChar::CHAR_CATEGORY_LOWERCASE_LETTER
  • IntlChar::CHAR_CATEGORY_TITLECASE_LETTER
  • IntlChar::CHAR_CATEGORY_MODIFIER_LETTER
  • IntlChar::CHAR_CATEGORY_OTHER_LETTER
  • IntlChar::CHAR_CATEGORY_NON_SPACING_MARK
  • IntlChar::CHAR_CATEGORY_ENCLOSING_MARK
  • IntlChar::CHAR_CATEGORY_COMBINING_SPACING_MARK
  • IntlChar::CHAR_CATEGORY_DECIMAL_DIGIT_NUMBER
  • IntlChar::CHAR_CATEGORY_LETTER_NUMBER
  • IntlChar::CHAR_CATEGORY_OTHER_NUMBER
  • IntlChar::CHAR_CATEGORY_SPACE_SEPARATOR
  • IntlChar::CHAR_CATEGORY_LINE_SEPARATOR
  • IntlChar::CHAR_CATEGORY_PARAGRAPH_SEPARATOR
  • IntlChar::CHAR_CATEGORY_CONTROL_CHAR
  • IntlChar::CHAR_CATEGORY_FORMAT_CHAR
  • IntlChar::CHAR_CATEGORY_PRIVATE_USE_CHAR
  • IntlChar::CHAR_CATEGORY_SURROGATE
  • IntlChar::CHAR_CATEGORY_DASH_PUNCTUATION
  • IntlChar::CHAR_CATEGORY_START_PUNCTUATION
  • IntlChar::CHAR_CATEGORY_END_PUNCTUATION
  • IntlChar::CHAR_CATEGORY_CONNECTOR_PUNCTUATION
  • IntlChar::CHAR_CATEGORY_OTHER_PUNCTUATION
  • IntlChar::CHAR_CATEGORY_MATH_SYMBOL
  • IntlChar::CHAR_CATEGORY_CURRENCY_SYMBOL
  • IntlChar::CHAR_CATEGORY_MODIFIER_SYMBOL
  • IntlChar::CHAR_CATEGORY_OTHER_SYMBOL
  • IntlChar::CHAR_CATEGORY_INITIAL_PUNCTUATION
  • IntlChar::CHAR_CATEGORY_FINAL_PUNCTUATION
  • IntlChar::CHAR_CATEGORY_CHAR_CATEGORY_COUNT

Or NULL if codepoint is out of bound.

static string|null chr(mixed $codepoint)

Since: 7.0

Return Unicode character by code point value

Parameters

mixed $codepoint

The integer codepoint value (e.g. 0x2603 for U+2603 SNOWMAN), or the character encoded as a UTF-8 string (e.g. "\u{2603}")

Return Value

string|null

A string containing the single character specified by the Unicode code point value. Or NULL if codepoint is out of bound.

static int|false|null digit(int|string $codepoint, int $base = 10)

Since: 7.0

Get the decimal digit value of a code point for a given radix

Parameters

int|string $codepoint

The integer codepoint value (e.g. 0x2603 for U+2603 SNOWMAN), or the character encoded as a UTF-8 string (e.g. "\u{2603}")

int $base

The radix (defaults to 10).

Return Value

int|false|null

Returns the numeric value represented by the character in the specified radix, or FALSE if there is no value or if the value exceeds the radix, or NULL if codepoint is out of bound.

static enumCharNames(int|string $start, int|string $end, callable $callback, int $type = IntlChar::UNICODE_CHAR_NAME)

Since: 7.0

Enumerate all assigned Unicode characters within a range

Parameters

int|string $start

The first code point in the enumeration range.

int|string $end

One more than the last code point in the enumeration range (the first one after the range).

callable $callback

The function that is to be called for each character name. The following three arguments will be passed into it:

  • integer $codepoint - The numeric code point value
  • integer $nameChoice - The same value as the nameChoice parameter below
  • string $name - The name of the character
int $type

[optional]

Selector for which kind of names to enumerate. Can be any of these constants:

  • IntlChar::UNICODE_CHAR_NAME (default)
  • IntlChar::UNICODE_10_CHAR_NAME
  • IntlChar::EXTENDED_CHAR_NAME
  • IntlChar::CHAR_NAME_ALIAS
  • IntlChar::CHAR_NAME_CHOICE_COUNT

static void enumCharTypes(callable $callback)

Since: 7.0

Enumerate all code points with their Unicode general categories

Parameters

callable $callback

The function that is to be called for each contiguous range of code points with the same general category. The following three arguments will be passed into it:

  • integer $start - The starting code point of the range
  • integer $end - The ending code point of the range
  • integer $name - The category type (one of the IntlChar::CHAR_CATEGORY_* constants)

Return Value

void

static string|int|null foldCase(int|string $codepoint, int $options = IntlChar::FOLD_CASE_DEFAULT)

Since: 7.0

Perform case folding on a code point

Parameters

int|string $codepoint

The integer codepoint value (e.g. 0x2603 for U+2603 SNOWMAN), or the character encoded as a UTF-8 string (e.g. "\u{2603}")

int $options

[optional] Either IntlChar::FOLD_CASE_DEFAULT (default) or IntlChar::FOLD_CASE_EXCLUDE_SPECIAL_I.

Return Value

string|int|null

Returns the Simple_Case_Folding of the code point, if any; otherwise the code point itself. Returns NULL if codepoint is out of bound.

static int forDigit(int $digit, int $base = 10)

Since: 7.0

Get character representation for a given digit and radix

Parameters

int $digit

The number to convert to a character.

int $base

[optional]

The radix (defaults to 10).

Return Value

int

The character representation (as a string) of the specified digit in the specified radix.

static string|int|null getBidiPairedBracket(int|string $codepoint)

Since: 7.0

Get the paired bracket character for a code point

Parameters

int|string $codepoint

The integer codepoint value (e.g. 0x2603 for U+2603 SNOWMAN), or the character encoded as a UTF-8 string (e.g. "\u{2603}")

Return Value

string|int|null

Returns the paired bracket code point, or codepoint itself if there is no such mapping. The return type will be integer unless the code point was passed as a UTF-8 string, in which case a string will be returned. Or NULL if codepoint is out of bound.

static int|null getBlockCode(int|string $codepoint)

Since: 7.0

Get the Unicode allocation block containing a code point

Parameters

int|string $codepoint

The integer codepoint value (e.g. 0x2603 for U+2603 SNOWMAN), or the character encoded as a UTF-8 string (e.g. "\u{2603}")

Return Value

int|null

Returns the block value for codepoint, or NULL if codepoint is out of bound. See the IntlChar::BLOCKCODE* constants for possible return values.

static int|null getCombiningClass(int|string $codepoint)

Since: 7.0

Get the combining class of a code point

Parameters

int|string $codepoint

The integer codepoint value (e.g. 0x2603 for U+2603 SNOWMAN), or the character encoded as a UTF-8 string (e.g. "\u{2603}")

Return Value

int|null

Returns the combining class of the character. Or NULL if codepoint is out of bound.

static string|false|null getFC_NFKC_Closure(int|string $codepoint)

Since: 7.0

Get the FC_NFKC_Closure property for a code point

Parameters

int|string $codepoint

The integer codepoint value (e.g. 0x2603 for U+2603 SNOWMAN), or the character encoded as a UTF-8 string (e.g. "\u{2603}")

Return Value

string|false|null

Returns the FC_NFKC_Closure property string for the codepoint, or an empty string if there is none, or NULL if codepoint is out of bound, or FALSE if there was an error.

static int getIntPropertyMaxValue(int $property)

Since: 7.0

Get the max value for a Unicode property

Parameters

int $property

The Unicode property to lookup (see the IntlChar::PROPERTY_* constants).

Return Value

int

The maximum value returned by {IntlChar for a Unicode property. <=0 if the property selector is out of range.

static int getIntPropertyMinValue(int $property)

Since: 7.0

Get the min value for a Unicode property

Parameters

int $property

The Unicode property to lookup (see the IntlChar::PROPERTY_* constants).

Return Value

int

The minimum value returned by {IntlChar for a Unicode property. 0 if the property selector is out of range.

static int|null getIntPropertyValue(int|string $codepoint, int $property)

Since: 7.0

Get the value for a Unicode property for a code point

Parameters

int|string $codepoint

The integer codepoint value (e.g. 0x2603 for U+2603 SNOWMAN), or the character encoded as a UTF-8 string (e.g. "\u{2603}")

int $property

The Unicode property to lookup (see the IntlChar::PROPERTY_* constants).

Return Value

int|null

Returns the numeric value that is directly the property value or, for enumerated properties, corresponds to the numeric value of the enumerated constant of the respective property value enumeration type.

Returns 0 or 1 (for FALSE/TRUE) for binary Unicode properties.

Returns a bit-mask for mask properties.

Returns 0 if property is out of bounds or if the Unicode version does not have data for the property at all, or not for this code point.

Returns NULL if codepoint is out of bound.

static float|null getNumericValue(int|string $codepoint)

Since: 7.0

Get the numeric value for a Unicode code point

Parameters

int|string $codepoint

The integer codepoint value (e.g. 0x2603 for U+2603 SNOWMAN), or the character encoded as a UTF-8 string (e.g. "\u{2603}")

Return Value

float|null

Numeric value of codepoint, or float(-123456789) if none is defined, or NULL if codepoint is out of bound.

static int getPropertyEnum(string $alias)

Since: 7.0

Get the property constant value for a given property name

Parameters

string $alias

The property name to be matched. The name is compared using "loose matching" as described in PropertyAliases.txt.

Return Value

int

Returns an IntlChar::PROPERTY_ constant value, or IntlChar::PROPERTY_INVALID_CODE if the given name does not match any property.

static string|false getPropertyName(int $property, int $type = IntlChar::LONG_PROPERTY_NAME)

Since: 7.0

Get the Unicode name for a property

Parameters

int $property

The Unicode property to lookup (see the IntlChar::PROPERTY_* constants).

IntlChar::PROPERTY_INVALID_CODE should not be used. Also, if property is out of range, FALSE is returned.

int $type

Selector for which name to get. If out of range, FALSE is returned.

All properties have a long name. Most have a short name, but some do not. Unicode allows for additional names; if present these will be returned by adding 1, 2, etc. to IntlChar::LONG_PROPERTY_NAME.

Return Value

string|false

Returns the name, or FALSE if either the property or the nameChoice is out of range.

If a given nameChoice returns FALSE, then all larger values of nameChoice will return FALSE, with one exception: if FALSE is returned for IntlChar::SHORT_PROPERTY_NAME, then IntlChar::LONG_PROPERTY_NAME (and higher) may still return a non-FALSE value.

static int getPropertyValueEnum(int $property, string $name)

Since: 7.0

Get the property value for a given value name

Parameters

int $property

The Unicode property to lookup (see the IntlChar::PROPERTY_* constants). If out of range, or this method doesn't work with the given value, IntlChar::PROPERTY_INVALID_CODE is returned

string $name

The value name to be matched. The name is compared using "loose matching" as described in PropertyValueAliases.txt.

Return Value

int

Returns the corresponding value integer, or IntlChar::PROPERTY_INVALID_CODE if the given name does not match any value of the given property, or if the property is invalid.

static string|false getPropertyValueName(int $property, int $value, int $type = IntlChar::LONG_PROPERTY_NAME)

Since: 7.0

Get the Unicode name for a property value

Parameters

int $property

The Unicode property to lookup (see the IntlChar::PROPERTY_* constants). If out of range, or this method doesn't work with the given value, FALSE is returned.

int $value

Selector for a value for the given property. If out of range, FALSE is returned.

In general, valid values range from 0 up to some maximum. There are a couple exceptions:

  • IntlChar::PROPERTY_BLOCK values begin at the non-zero value IntlChar::BLOCK_CODE_BASIC_LATIN
  • IntlChar::PROPERTY_CANONICAL_COMBINING_CLASS values are not contiguous and range from 0..240.
int $type

[optional]

Selector for which name to get. If out of range, FALSE is returned. All values have a long name. Most have a short name, but some do not. Unicode allows for additional names; if present these will be returned by adding 1, 2, etc. to IntlChar::LONG_PROPERTY_NAME.

Return Value

string|false

Returns the name, or FALSE if either the property or the nameChoice is out of range. If a given nameChoice returns FALSE, then all larger values of nameChoice will return FALSE, with one exception: if FALSE is returned for IntlChar::SHORT_PROPERTY_NAME, then IntlChar::LONG_PROPERTY_NAME (and higher) may still return a non-FALSE value.

static array getUnicodeVersion()

Since: 7.0

Get the Unicode version

Return Value

array

An array containing the Unicode version number.

static bool|null isalnum(int|string $codepoint)

Since: 7.0

Check if code point is an alphanumeric character

Parameters

int|string $codepoint

The integer codepoint value (e.g. 0x2603 for U+2603 SNOWMAN), or the character encoded as a UTF-8 string (e.g. "\u{2603}")

Return Value

bool|null

Returns TRUE if codepoint is an alphanumeric character, FALSE if not, NULL if codepoint is out of bound.

static bool|null isalpha(int|string $codepoint)

Since: 7.0

Check if code point is a letter character

Parameters

int|string $codepoint

The integer codepoint value (e.g. 0x2603 for U+2603 SNOWMAN), or the character encoded as a UTF-8 string (e.g. "\u{2603}")

Return Value

bool|null

Returns TRUE if codepoint is a letter character, FALSE if not, NULL if codepoint is out of bound.

static bool|null isbase(int|string $codepoint)

Since: 7.0

Check if code point is a base character

Parameters

int|string $codepoint

The integer codepoint value (e.g. 0x2603 for U+2603 SNOWMAN), or the character encoded as a UTF-8 string (e.g. "\u{2603}")

Return Value

bool|null

Returns TRUE if codepoint is a base character, FALSE if not, NULL if codepoint is out of bound.

static bool|null isblank(int|string $codepoint)

Since: 7.0

Check if code point is a "blank" or "horizontal space" character

Parameters

int|string $codepoint

The integer codepoint value (e.g. 0x2603 for U+2603 SNOWMAN), or the character encoded as a UTF-8 string (e.g. "\u{2603}")

Return Value

bool|null

Returns TRUE if codepoint is either a "blank" or "horizontal space" character, FALSE if not, NULL if codepoint is out of bound.

static bool|null iscntrl(int|string $codepoint)

Since: 7.0

Check if code point is a control character

Parameters

int|string $codepoint

The integer codepoint value (e.g. 0x2603 for U+2603 SNOWMAN), or the character encoded as a UTF-8 string (e.g. "\u{2603}")

Return Value

bool|null

Returns TRUE if codepoint is a control character, FALSE if not, NULL if codepoint is out of bound.

static bool|null isdefined(int|string $codepoint)

Since: 7.0

Check whether the code point is defined

Parameters

int|string $codepoint

The integer codepoint value (e.g. 0x2603 for U+2603 SNOWMAN), or the character encoded as a UTF-8 string (e.g. "\u{2603}")

Return Value

bool|null

Returns TRUE if codepoint is a defined character, FALSE if not, NULL if codepoint is out of bound.

static bool|null isdigit(int|string $codepoint)

Since: 7.0

Check if code point is a digit character

Parameters

int|string $codepoint

The integer codepoint value (e.g. 0x2603 for U+2603 SNOWMAN), or the character encoded as a UTF-8 string (e.g. "\u{2603}")

Return Value

bool|null

Returns TRUE if codepoint is a digit character, FALSE if not, NULL if codepoint is out of bound.

static bool|null isgraph(int|string $codepoint)

Since: 7.0

Check if code point is a graphic character

Parameters

int|string $codepoint

The integer codepoint value (e.g. 0x2603 for U+2603 SNOWMAN), or the character encoded as a UTF-8 string (e.g. "\u{2603}")

Return Value

bool|null

Returns TRUE if codepoint is a "graphic" character, FALSE if not, NULL if codepoint is out of bound.

static bool|null isIDIgnorable(int|string $codepoint)

Since: 7.0

Check if code point is an ignorable character

Parameters

int|string $codepoint

The integer codepoint value (e.g. 0x2603 for U+2603 SNOWMAN), or the character encoded as a UTF-8 string (e.g. "\u{2603}")

Return Value

bool|null

Returns TRUE if codepoint is ignorable in identifiers, FALSE if not, NULL if codepoint is out of bound.

static bool|null isIDPart(int|string $codepoint)

Since: 7.0

Check if code point is permissible in an identifier

Parameters

int|string $codepoint

The integer codepoint value (e.g. 0x2603 for U+2603 SNOWMAN), or the character encoded as a UTF-8 string (e.g. "\u{2603}")

Return Value

bool|null

Returns TRUE if codepoint is the code point may occur in an identifier, FALSE if not, NULL if codepoint is out of bound.

static bool|null isIDStart(int|string $codepoint)

Since: 7.0

Check if code point is permissible as the first character in an identifier

Parameters

int|string $codepoint

The integer codepoint value (e.g. 0x2603 for U+2603 SNOWMAN), or the character encoded as a UTF-8 string (e.g. "\u{2603}")

Return Value

bool|null

Returns TRUE if codepoint may start an identifier, FALSE if not, NULL if codepoint is out of bound.

static bool|null isISOControl(int|string $codepoint)

Since: 7.0

Check if code point is an ISO control code

Parameters

int|string $codepoint

The integer codepoint value (e.g. 0x2603 for U+2603 SNOWMAN), or the character encoded as a UTF-8 string (e.g. "\u{2603}")

Return Value

bool|null

Returns TRUE if codepoint is an ISO control code, FALSE if not, NULL if codepoint is out of bound.

static bool|null isJavaIDPart(int|string $codepoint)

Since: 7.0

Check if code point is permissible in a Java identifier

Parameters

int|string $codepoint

The integer codepoint value (e.g. 0x2603 for U+2603 SNOWMAN), or the character encoded as a UTF-8 string (e.g. "\u{2603}")

Return Value

bool|null

Returns TRUE if codepoint may occur in a Java identifier, FALSE if not, NULL if codepoint is out of bound.

static bool|null isJavaIDStart(int|string $codepoint)

Since: 7.0

Check if code point is permissible as the first character in a Java identifier

Parameters

int|string $codepoint

The integer codepoint value (e.g. 0x2603 for U+2603 SNOWMAN), or the character encoded as a UTF-8 string (e.g. "\u{2603}")

Return Value

bool|null

Returns TRUE if codepoint may start a Java identifier, FALSE if not, NULL if codepoint is out of bound.

static bool|null isJavaSpaceChar(int|string $codepoint)

Since: 7.0

Check if code point is a space character according to Java

Parameters

int|string $codepoint

The integer codepoint value (e.g. 0x2603 for U+2603 SNOWMAN), or the character encoded as a UTF-8 string (e.g. "\u{2603}")

Return Value

bool|null

Returns TRUE if codepoint is a space character according to Java, FALSE if not, NULL if codepoint is out of bound.

static bool|null islower(int|string $codepoint)

Since: 7.0

Check if code point is a lowercase letter

Parameters

int|string $codepoint

The integer codepoint value (e.g. 0x2603 for U+2603 SNOWMAN), or the character encoded as a UTF-8 string (e.g. "\u{2603}")

Return Value

bool|null

Returns TRUE if codepoint is an Ll lowercase letter, FALSE if not, NULL if codepoint is out of bound.

static bool|null isMirrored(int|string $codepoint)

Since: 7.0

Check if code point has the Bidi_Mirrored property

Parameters

int|string $codepoint

The integer codepoint value (e.g. 0x2603 for U+2603 SNOWMAN), or the character encoded as a UTF-8 string (e.g. "\u{2603}")

Return Value

bool|null

Returns TRUE if codepoint has the Bidi_Mirrored property, FALSE if not, NULL if codepoint is out of bound.

static bool|null isprint(int|string $codepoint)

Since: 7.0

Check if code point is a printable character

Parameters

int|string $codepoint

The integer codepoint value (e.g. 0x2603 for U+2603 SNOWMAN), or the character encoded as a UTF-8 string (e.g. "\u{2603}")

Return Value

bool|null

Returns TRUE if codepoint is a printable character, FALSE if not, NULL if codepoint is out of bound.

static bool|null ispunct(int|string $codepoint)

Since: 7.0

Check if code point is punctuation character

Parameters

int|string $codepoint

The integer codepoint value (e.g. 0x2603 for U+2603 SNOWMAN), or the character encoded as a UTF-8 string (e.g. "\u{2603}")

Return Value

bool|null

Returns TRUE if codepoint is a punctuation character, FALSE if not, NULL if codepoint is out of bound.

static bool|null isspace(int|string $codepoint)

Since: 7.0

Check if code point is a space character

Parameters

int|string $codepoint

The integer codepoint value (e.g. 0x2603 for U+2603 SNOWMAN), or the character encoded as a UTF-8 string (e.g. "\u{2603}")

Return Value

bool|null

Returns TRUE if codepoint is a space character, FALSE if not, NULL if codepoint is out of bound.

static bool|null istitle(int|string $codepoint)

Since: 7.0

Check if code point is a titlecase letter

Parameters

int|string $codepoint

The integer codepoint value (e.g. 0x2603 for U+2603 SNOWMAN), or the character encoded as a UTF-8 string (e.g. "\u{2603}")

Return Value

bool|null

Returns TRUE if codepoint is a titlecase letter, FALSE if not, NULL if codepoint is out of bound.

static bool|null isUAlphabetic(int|string $codepoint)

Since: 7.0

Check if code point has the Alphabetic Unicode property

Parameters

int|string $codepoint

The integer codepoint value (e.g. 0x2603 for U+2603 SNOWMAN), or the character encoded as a UTF-8 string (e.g. "\u{2603}")

Return Value

bool|null

Returns TRUE if codepoint has the Alphabetic Unicode property, FALSE if not, NULL if codepoint is out of bound.

static bool|null isULowercase(int|string $codepoint)

Since: 7.0

Check if code point has the Lowercase Unicode property

Parameters

int|string $codepoint

The integer codepoint value (e.g. 0x2603 for U+2603 SNOWMAN), or the character encoded as a UTF-8 string (e.g. "\u{2603}")

Return Value

bool|null

Returns TRUE if codepoint has the Lowercase Unicode property, FALSE if not, NULL if codepoint is out of bound.

static bool|null isupper(int|string $codepoint)

Since: 7.0

Check if code point has the general category "Lu" (uppercase letter)

Parameters

int|string $codepoint

The integer codepoint value (e.g. 0x2603 for U+2603 SNOWMAN), or the character encoded as a UTF-8 string (e.g. "\u{2603}")

Return Value

bool|null

Returns TRUE if codepoint is an Lu uppercase letter, FALSE if not, NULL if codepoint is out of bound.

static bool|null isUUppercase(int|string $codepoint)

Since: 7.0

Check if code point has the Uppercase Unicode property

Parameters

int|string $codepoint

The integer codepoint value (e.g. 0x2603 for U+2603 SNOWMAN), or the character encoded as a UTF-8 string (e.g. "\u{2603}")

Return Value

bool|null

Returns TRUE if codepoint has the Uppercase Unicode property, FALSE if not, NULL if codepoint is out of bound.

static bool|null isUWhiteSpace(int|string $codepoint)

Since: 7.0

Check if code point has the White_Space Unicode property

Parameters

int|string $codepoint

The integer codepoint value (e.g. 0x2603 for U+2603 SNOWMAN), or the character encoded as a UTF-8 string (e.g. "\u{2603}")

Return Value

bool|null

Returns TRUE if codepoint has the White_Space Unicode property, FALSE if not, NULL if codepoint is out of bound.

static bool|null isWhitespace(int|string $codepoint)

Since: 7.0

Check if code point is a whitespace character according to ICU

Parameters

int|string $codepoint

The integer codepoint value (e.g. 0x2603 for U+2603 SNOWMAN), or the character encoded as a UTF-8 string (e.g. "\u{2603}")

Return Value

bool|null

Returns TRUE if codepoint is a whitespace character according to ICU, FALSE if not, NULL if codepoint is out of bound.

static bool|null isxdigit(int|string $codepoint)

Since: 7.0

Check if code point is a hexadecimal digit

Parameters

int|string $codepoint

The integer codepoint value (e.g. 0x2603 for U+2603 SNOWMAN), or the character encoded as a UTF-8 string (e.g. "\u{2603}")

Return Value

bool|null

Returns TRUE if codepoint is a hexadecimal character, FALSE if not, NULL if codepoint is out of bound.

static int|null ord(int|string $character)

Since: 7.0

Return Unicode code point value of character

Parameters

int|string $character

A Unicode character.

Return Value

int|null

Returns the Unicode code point value as an integer, NULL if codepoint is out of bound.

static string|int|null tolower(int|string $codepoint)

Since: 7.0

Make Unicode character lowercase

Parameters

int|string $codepoint

The integer codepoint value (e.g. 0x2603 for U+2603 SNOWMAN), or the character encoded as a UTF-8 string (e.g. "\u{2603}")

Return Value

string|int|null

Returns the Simple_Lowercase_Mapping of the code point, if any; otherwise the code point itself. The return type will be integer unless the code point was passed as a UTF-8 string, in which case a string will be returned. Or NULL if codepoint is out of bound.

static string|int|null totitle(int|string $codepoint)

Since: 7.0

Make Unicode character titlecase

Parameters

int|string $codepoint

The integer codepoint value (e.g. 0x2603 for U+2603 SNOWMAN), or the character encoded as a UTF-8 string (e.g. "\u{2603}")

Return Value

string|int|null

Returns the Simple_Titlecase_Mapping of the code point, if any; otherwise the code point itself. The return type will be integer unless the code point was passed as a UTF-8 string, in which case a string will be returned. Or NULL if codepoint is out of bound.

static string|int|null toupper(int|string $codepoint)

Since: 7.0

Make Unicode character uppercase

Parameters

int|string $codepoint

The integer codepoint value (e.g. 0x2603 for U+2603 SNOWMAN), or the character encoded as a UTF-8 string (e.g. "\u{2603}")

Return Value

string|int|null

Returns the Simple_Uppercase_Mapping of the code point, if any; otherwise the code point itself. The return type will be integer unless the code point was passed as a UTF-8 string, in which case a string will be returned. Or NULL if codepoint is out of bound.