Distributed Scale Out Framework for ML models serving/inference

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Development Status
- 3 - Alpha
Intended Audience
- Developers
License
- OSI Approved :: GNU General Public License v3 (GPLv3)
Programming Language
- Python :: 3.6
- Python :: 3.7
Topic
- Software Development

Project description

py-inferout

Distributed Scale Out Framework for ML model serving/inferencing

This Project is in development, Not intended for production use

Install

It is available in PyPI

$ pip install inferout

if you don't have pip command use -m pip

$ python -m pip install inferout

Usage (Quick example)

First of all we need an ml model to be able to serve. It can be of any kind, any framework (pytorch, tensorflow, rasa, etc.) as long as it can be loaded in python 3.7+
We need to implement two interfaces
- serving_engine - this is to teach inferout how to load, serve (infer), and unload the models of a specific kind/usecase. example
- storage_engine - this is to teach inferout how to get/download/locate the models. example
Get redis server ready. Yes we need redis to use inferout. inferout uses redis to store metadata and pass messages between diferent componets and nodes. https://redis.io/download
Bootstrap cluster - this is create minimum metadata required to run cluster all you need is a cluster name and a redis URL
```
$ inferout bootstrap_cluster --cluster-name my_rasa_nlu --redis-url redis:///10
```
Launch the worker
```
$ inferout worker --cluster-name my_rasa_nlu --redis-url redis:///10 --storage-engines "my_rasa_nlu.storage_engine" --serving-engines "my_rasa_nlu.serving_engine"
```
Can run multiple workers for single cluster. to run multiple workers in single system (for development and to test) we can use diferent port numbers for each worker. try "inferout worker -h" for more details

What we need to make sure is worker availabily and connectivity(serving api port) between nodes else(replication of models and distributing to workers, smartly routing the inferencing requests) will be taken care by inferout framework.

Create namespace

$ curl -XPUT localhost:9500/namespaces/default -d '{"settings":{"storage_engine":"my_rasa_nlu.storage_engine","serving_engine":"my_rasa_nlu.serving_engine"}}'

Create model

$ curl -XPUT localhost:9500/namespaces/default/models/mymodel1 -d '{"parameters":{"path":"nlu-20210726-153112.tar.gz"}}'

wondering how to get this model file and where to place it for testing? install open source rasa using pip. https://rasa.com/docs/rasa/installation

$ pip3 install rasa

init rasa project

$ rasa init

train your nlu model

$ rasa train nlu

make required directories and copy the model

$ mkdir /tmp/nlu_models
$ mkdir /temp_nlu_models
$ ls models
$ cp models/nlu-*.tar.gz /tmp/nlu_models/

Query your model (inference)
```
$ curl -XPOST localhost:9510/default/mymodel1 -d '{"input_data":{"query":"Hi"}}'
```
Did you find any change in port number? Yes for namespace and model creation we used 9500 but now we used 9510 inferout worker provides 2 API services
- management API - create/update/delete/inspect models, namespaces, workers
- inferencing API - to quiry models
What next? Explore other management APIs, for now just find API endpoints in source code

Project details

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Development Status
- 3 - Alpha
Intended Audience
- Developers
License
- OSI Approved :: GNU General Public License v3 (GPLv3)
Programming Language
- Python :: 3.6
- Python :: 3.7
Topic
- Software Development

Release history Release notifications | RSS feed

This version

0.0.0a5 pre-release

Oct 21, 2021

0.0.0a4 pre-release

Oct 21, 2021

0.0.0a3 pre-release

Jul 27, 2021

0.0.0a2 pre-release

Jul 26, 2021

0.0.0a1 pre-release yanked

Jul 25, 2021

0.0.0a0 pre-release yanked

Jul 25, 2021

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

inferout-0.0.0a5.tar.gz (17.1 kB view hashes)

Uploaded Oct 21, 2021 Source

Hashes for inferout-0.0.0a5.tar.gz

Hashes for inferout-0.0.0a5.tar.gz
Algorithm	Hash digest
SHA256	`aa7212808e73a0dfb8b49cae6179b985f02ef30fa64812be5821477d659fce67`
MD5	`2b87fdd8e58b11b4e91b1839d50a2c2c`
BLAKE2b-256	`704d6f8f9aee468f7cc353f012db3aea381168cabd345b57ec2fc5aafed786a4`