Class WordTokenization

java.lang.Object
org.biojava.utils.Unchangeable
org.biojava.bio.seq.io.WordTokenization
All Implemented Interfaces:
Serializable, Annotatable, SymbolTokenization, Changeable
Direct Known Subclasses:
CrossProductTokenization, DoubleTokenization, IntegerTokenization, NameTokenization, SubIntegerTokenization

public abstract class WordTokenization extends Unchangeable implements SymbolTokenization, Serializable
Base class for tokenizations which accept whitespace-separated `words'. Splits at whitespace, except when it is quoted by either double-quotes ("), brackets (), or square brackets [].
Since:
1.2
Author:
Thomas Down, Greg Cox, Keith James
See Also: