.

Java 1.6Language-Neutral

.

chardescriptionPattern.
compile("\\s")
StringTokenizer; Properties (file format)StreamTokenizer;
String.trim()*
Scanner;
Character.isWhitespace()
Unicode 5.0
"whitespace"
Unicode 5.0
"pattern whitespace"
is also unicode control character?

.

(00-08)...x +

.

\u0009character tabulationxxxxxx +

.

\u000aline feedxxxxxx +

.

\u000bline tabulationxxxxx +

.

\u000cform feedxxxxxx +

.

\u000dcarriage returnxxxxxx +

.

(0e-1b)...x +

.

\u001cinformation separator 4xx +

.

\u001dinformation separator 3xx +

.

\u001einformation separator 2xx +

.

\u001finformation separator 1xx +

.

\u0020spacexxxxxx

.

\u0085next linexx +

.

\u00a0non-breaking spacex

.

\u1680ogham space markxx

.

\u180emongolian vowel separatorxx

.

\u2000en quadxx

.

\u2001em quadxx

.

\u2002en spacexx

.

\u2003em spacexx

.

\u2004three-per-em spacexx

.

\u2005four-per-em spacexx

.

\u2006six-per-em spacexx

.

\u2007figure spacex

.

\u2008punctuation spacexx

.

\u2009thin spacexx

.

\u200ahair spacexx

.

\u200bzero-width spacex

.

\u200czero-width non-joiner

.

\u200dzero-width joiner

.

\u200eleft-to-right markx

.

\u200fright-to-left markx

.

\u2028line separatorxxx

.

\u2029paragraph separatorxxx

.

\u202fnarrow non-breaking spacex

.

\u205fmedium mathematical spacexx

.

\u3000ideographic spacexx

.

compiled by Kevin Bourrillion, http://smallwig.blogspot.com