System.Text.RegularExpressions.Regex.Split Method

Splits an input string into an array of substrings at the positions defined by a regular expression pattern specified in the System.Text.RegularExpressions.Regex constructor.

Syntax

public string[] Split (string input)

Parameters

input
The string to split.

Returns

An array of strings.

Remarks

The Regex.Split(string) methods are similar to the string.Split(Char[]) method, except that Regex.Split(string) splits the string at a delimiter determined by a regular expression instead of a set of characters. The string is split as many times as possible. If no delimiter is found, the return value contains one element whose value is the original input string.

If multiple matches are adjacent to one another, an empty string is inserted into the array. For example, splitting a string on a single hyphen causes the returned array to include an empty string in the position where two adjacent hyphens are found, as the following code shows.

code reference: System.Text.RegularExpressions.Regex.Split#1

If a match is found at the beginning or the end of the input string, an empty string is included at the beginning or the end of the returned array. The following example uses the regular expression pattern \d+ to split an input string on numeric characters. Because the string begins and ends with matching numeric characters, the value of the first and last element of the returned array is string.Empty.

code reference: System.Text.RegularExpressions.Regex.Split#21

If capturing parentheses are used in a Regex.Split(string) expression, any captured text is included in the resulting string array. For example, if you split the string "plum-pear" on a hyphen placed within capturing parentheses, the returned array includes a string element that contains the hyphen.

code reference: System.Text.RegularExpressions.Regex.Split#2

However, when the regular expression pattern includes multiple sets of capturing parentheses, the behavior of this method depends on the version of the .NET Framework. In the .NET Framework 1.0 and 1.1, if a match is not found within the first set of capturing parentheses, captured text from additional capturing parentheses is not included in the returned array. Starting with the .NET Framework 2.0, all captured text is also added to the returned array. For example, the following code uses two sets of capturing parentheses to extract the elements of a date, including the date delimiters, from a date string. The first set of capturing parentheses captures the hyphen, and the second set captures the forward slash. If the example code is compiled and run under the .NET Framework 1.0 or 1.1, it excludes the slash characters; if it is compiled and run under the .NET Framework 2.0 or later versions, it includes them.

code reference: System.Text.RegularExpressions.Regex.Split#3

If the regular expression can match the empty string, Regex.Split(string) will split the string into an array of single-character strings because the empty string delimiter can be found at every location. For example:

code reference: System.Text.RegularExpressions.Regex.Split#11

Note that the returned array also includes an empty string at the beginning and end of the array.

The System.Text.RegularExpressions.RegexMatchTimeoutException exception is thrown if the execution time of the split operation exceeds the time-out interval specified by the Regex.#ctor(string, RegexOptions, TimeSpan) constructor. If you do not set a time-out interval when you call the constructor, the exception is thrown if the operation exceeds any time-out value established for the application domain in which the System.Text.RegularExpressions.Regex object is created. If no time-out is defined in the System.Text.RegularExpressions.Regex constructor call or in the application domain's properties, or if the time-out value is Regex.InfiniteMatchTimeout, no exception is thrown

Requirements

Namespace: System.Text.RegularExpressions
Assembly: System (in System.dll)
Assembly Versions: 1.0.5000.0, 2.0.0.0, 4.0.0.0