Bpe Embeddings
BPEEmbeddings are subword embeddings that embeds SentencePieceTokenizer tokenized data. Studies show that performance are on par with GloVe (+-5%) while only using few MB's of data rather than GB's. Supports 275 languages through bpemb.
Constructors
Link copied to clipboard
public BpeEmbeddings BpeEmbeddings( Path filePath, Integer dimensions, Character delimiter, Tokenizer tokenizer)
Content copied to clipboard
Types
Functions
Link copied to clipboard
Link copied to clipboard
Link copied to clipboard
Link copied to clipboard
Link copied to clipboard
Link copied to clipboard
Link copied to clipboard
Link copied to clipboard
Link copied to clipboard
Link copied to clipboard
public List<NDArray<Float, D1>> traverseVectorsOrNull(List<String> words)
Content copied to clipboard