nlp
1.2.0
nlp
/
com.londogard.nlp.tokenizer
/
TokenizerSpecialTokens
Tokenizer
Special
Tokens
public
class
TokenizerSpecialTokens
Content copied to clipboard
Special tokens that is usable to Machine Learning.
Functions
Properties
Functions
get
All
Caps
Link copied to clipboard
public
final
Character
getAllCaps
(
)
Content copied to clipboard
get
BOS
Link copied to clipboard
public
final
Character
getBOS
(
)
Content copied to clipboard
get
BOW
Link copied to clipboard
public
final
Character
getBOW
(
)
Content copied to clipboard
get
Char
Repetition
Link copied to clipboard
public
final
Character
getCharRepetition
(
)
Content copied to clipboard
get
EOS
Link copied to clipboard
public
final
Character
getEOS
(
)
Content copied to clipboard
get
Number
Link copied to clipboard
public
final
Character
getNumber
(
)
Content copied to clipboard
get
Number
Pattern
Link copied to clipboard
public
final
Regex
getNumberPattern
(
)
Content copied to clipboard
get
Number
Str
Link copied to clipboard
public
final
String
getNumberStr
(
)
Content copied to clipboard
get
Pad
Link copied to clipboard
public
final
Character
getPad
(
)
Content copied to clipboard
get
Space
Link copied to clipboard
public
final
Character
getSpace
(
)
Content copied to clipboard
get
Start
Of
Word
Link copied to clipboard
public
final
Character
getStartOfWord
(
)
Content copied to clipboard
get
Upper
Link copied to clipboard
public
final
Character
getUpper
(
)
Content copied to clipboard
get
Word
Repetition
Link copied to clipboard
public
final
Character
getWordRepetition
(
)
Content copied to clipboard
Properties
All
Caps
Link copied to clipboard
private
final
Character
AllCaps
Content copied to clipboard
BOS
Link copied to clipboard
private
final
Character
BOS
Content copied to clipboard
BOW
Link copied to clipboard
private
final
Character
BOW
Content copied to clipboard
Char
Repetition
Link copied to clipboard
private
final
Character
CharRepetition
Content copied to clipboard
EOS
Link copied to clipboard
private
final
Character
EOS
Content copied to clipboard
INSTANCE
Link copied to clipboard
public
final
static
TokenizerSpecialTokens
INSTANCE
Content copied to clipboard
Number
Link copied to clipboard
private
final
Character
Number
Content copied to clipboard
Number
Pattern
Link copied to clipboard
private
final
Regex
NumberPattern
Content copied to clipboard
Number
Str
Link copied to clipboard
private
final
String
NumberStr
Content copied to clipboard
Pad
Link copied to clipboard
private
final
Character
Pad
Content copied to clipboard
Space
Link copied to clipboard
private
final
Character
Space
Content copied to clipboard
Start
Of
Word
Link copied to clipboard
private
final
Character
StartOfWord
Content copied to clipboard
Upper
Link copied to clipboard
private
final
Character
Upper
Content copied to clipboard
Word
Repetition
Link copied to clipboard
private
final
Character
WordRepetition
Content copied to clipboard