System.Globalization.CharUnicodeInfo.GetUnicodeCategory Method

Gets the Unicode category of the specified character.

Syntax

public static UnicodeCategory GetUnicodeCategory (char ch)

Parameters

ch
The Unicode character for which to get the Unicode category.

Returns

A System.Globalization.UnicodeCategory value indicating the category of the specified character.

Remarks

The Unicode characters are divided into categories. A character's category is one of its properties. For example, a character might be an uppercase letter, a lowercase letter, a decimal digit number, a letter number, a connector punctuation, a math symbol, or a currency symbol. The System.Globalization.UnicodeCategory class returns the category of a Unicode character. For more information on Unicode characters, see the tp://go.microsoft.com/fwlink/?linkid=37123.

The CharUnicodeInfo.GetUnicodeCategory(char) method assumes that ch corresponds to a single linguistic character and returns its category. This means that, for surrogate pairs, it returns UnicodeCategory.Surrogate instead of the category to which the surrogate belongs. For example, the Ugaritic alphabet occupies code points U+10380 to U+1039F. The following example uses the char.ConvertFromUtf32(int) method to instantiate a string that represents UGARITIC LETTER ALPA (U+10380), which is the first letter of the Ugaritic alphabet. As the output from the example shows, the char.IsNumber(char) method returns false if it is passed either the high surrogate or the low surrogate of this character.

code reference: System.Globalization.CharUnicodeInfo.GetUnicodeCategory#1

Note that CharUnicodeInfo.GetUnicodeCategory(char) does not always return the same System.Globalization.UnicodeCategory value as the char.GetUnicodeCategory(char) method when passed a particular character as a parameter. The CharUnicodeInfo.GetUnicodeCategory(char) method is designed to reflect the current version of the Unicode standard. In contrast, although the char.GetUnicodeCategory(char) method usually reflects the current version of the Unicode standard, it might return a character's category based on a previous version of the standard, or it might return a category that differs from the current standard to preserve backward compatibility.

Requirements

Namespace: System.Globalization
Assembly: mscorlib (in mscorlib.dll)
Assembly Versions: 2.0.0.0, 4.0.0.0
Since: .NET 2.0