Skip to content

KotlinNLP/NeuralTokenizer

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

NeuralTokenizer Maven Central Build Status

NeuralTokenizer is a very simple to use text tokenizer which uses neural networks from the SimpleDNN library.

NeuralTokenizer is part of KotlinNLP.

Getting Started

Import with Maven

<dependency>
    <groupId>com.kotlinnlp</groupId>
    <artifactId>neuraltokenizer</artifactId>
    <version>0.4.8</version>
</dependency>

Examples

Try some examples of usage of NeuralTokenizer running the files in the examples folder.

To run the examples you need datasets of test and training that you can find here

Model Serialization

The neural model is all contained into a single class which provides simple dump() and load() methods to serialize it and afterwards load it.

License

This software is released under the terms of the Mozilla Public License, v. 2.0

Contributions

We greatly appreciate any bug reports and contributions, which can be made by filing an issue or making a pull request through the github page.

About

NeuralTokenizer is a very simple to use text tokenizer which uses neural networks from the SimpleDNN library.

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages