Project description

Maximum Likelihood fit for N-grams

A small library for quickly deriving the Maximum Likelihood estimates and Neural Network training for N-grams.

Installation

pip install ngram-ml

Usage

from ngram_ml import *

Example

Maximum Likelihood Estimator Example

mle = NGramMLEstimator(sentences=tokens, n_grams=2, label_smoothing=1)
mle.calculate_cross_entropy(tokens)
mle.calculate_cross_entropy([['<S>', 'the', 'cat', 'sat', 'on', 'the', 'mat', '</S>']])

mle.generate_sentence(30, initial_pre_seq= tuple([mle.word_to_idx['pencil']]))
mle.generate_most_probable_sentence(30, initial_pre_seq= tuple([mle.word_to_idx['book']]))

Neural Network Example

# Neural Network Example
dataset = NGramDataset(sentences=tokens, n_grams=2)
NN = NGramNeuralNet(n_grams=2, in_size=dataset.n_unique_words, embed_size=200)
NN.train(dataset.x, dataset.y, n_epochs=100, lr=0.01)

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Development Status
- 2 - Pre-Alpha
Intended Audience
- Developers
- Education
License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Release history Release notifications | RSS feed

This version

0.1.0

Mar 16, 2023

0.0.1

Mar 14, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ngram_ml-0.1.0.tar.gz (9.5 kB view hashes)

Uploaded Mar 16, 2023 Source

Built Distribution

ngram_ml-0.1.0-py3-none-any.whl (4.1 kB view hashes)

Uploaded Mar 16, 2023 Python 3

Hashes for ngram_ml-0.1.0.tar.gz

Hashes for ngram_ml-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`3a090be042570016fea27ccb9b3a6a24db1757ada26762592076df2438ee8621`
MD5	`8a719fde046623dce59542def441b10f`
BLAKE2b-256	`a93df0af05129c8b41ff6d3692a621327765e36479304bf17d15334c18bb0e3f`

Hashes for ngram_ml-0.1.0-py3-none-any.whl

Hashes for ngram_ml-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`4ece709a1220e898542347cefc56a89f1f232954d995cf6ee20ce1e0510f60ba`
MD5	`76981d96c158cb6cd42d8968c1b531ba`
BLAKE2b-256	`abe4b0f207997dbe1c81508c87da625d807de6373bf4bec28c1f17fb5ee401cb`