charset-0.3.7.1: Fast unicode character sets based on complemented PATRICIA tries

Copyright(c) Edward Kmett 2010-2012
LicenseBSD3
Maintainerekmett@gmail.com
Stabilityexperimental
PortabilityDeriveDataTypeable
Safe HaskellSafe
LanguageHaskell98

Data.CharSet.Unicode.Category

Contents

Description

Provides unicode general categories, which are typically connoted by \p{Ll} or \p{Modifier_Letter}. Lookups can be constructed using categories or individual character sets can be used directly.

A case, _ and - insensitive lookup is provided by lookupCategory and can be used to provide behavior similar to that of Perl or PCRE.

Synopsis

Unicode General Category

Lookup

CharSets by Category

Letter

Letter&

Mark

Separator

Symbol

Number

Punctuation

Other