MLflow plugin adding a Knative deployment target

These details have not been verified by PyPI

Project links

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

MLflow Knative Deployment Plugin

MLflow plugin adding a Knative deployment client to MLflow CLI and Python API.

Note: MLServer (V2 Inference API) is enabled for all Docker builds.

Requirements

Python 3.10+
MLflow 2+
Docker

The target Kubernetes cluster must be running Knative 1.10+.

Installation

pip install mlflow-knative

Getting Started

The plugin adds support for a knative target scheme to the mlflow deployments CLI.

A Kubernetes context is used to define the Knative target as knative:/<context>.
You can list available contexts with kubectl config get-contexts.

Note: Passing only the base knative:/ (context omitted) as target lets the Kubernetes client pick a default configuration, which may be in-cluster. Kubernetes RBAC must be configured to allow the pod running this deployment client to create, list, update and delete Knative services.

Make sure Docker is running locally if you intend create or update a deployment, as this is required to build an image from the MLflow model.

Setting the image_repository config key is required to make the Docker image of the model available for deployment by Knative. Additionally you may also provide an image tag with the image_tag config key (defaults to latest).

mlflow deployments create \
  --target knative:/<context> \
  --name <deployment-name> \
  --model-uri models:/<model-name>/<model-version> \
  --config image_repository=<image-repository-URI> \
  --config image_tag=<image-tag>

The plugin provides detailed target help.

mlflow deployments help --target knative

All features are also available as a Python API deployment client.

from mlflow.deployments import get_deploy_client

client = get_deploy_client("knative:/my-cluster")
client.create_deployment(
	"hello-world",
	"models:/hello-world/1",
	config={
		"image_repository": "hello-world"
	}
)

Using a Private Image Repository

To use a private Docker image repository, simply run docker login defore running the deployment client, then use the full repository URI as value for the image_repository config key.

docker login --username <username> --password-stdin <private-repository-URL>

# If using AWS ECR:
aws ecr get-login-password | docker login --username AWS --password-stdin <private-repository-URL>

mlflow deployments create \
  --target knative:/<context> \
  --name <deployment-name> \
  --model-uri models:/<model-name>/<model-version> \
  --config image_repository=<image-repository-URI>  # e.g.: 000000000000.dkr.ecr.eu-west-3.amazonaws.com/model-name

Using a Remote MLflow Model Registry

Set an environment variable as export MLFLOW_TRACKING_URI=<tracking-server-uri> to use a remote MLflow model registry. This also works with a private model registry secured with OAuth 2, using the MLflow OIDC Client Plugin.

Knative Service Configuration

The deployment client can use any available namespace on the target cluster by setting the namespace config key. The default value is default.

mlflow deployments create \
  --target knative:/<context> \
  --name <deployment-name> \
  --model-uri models:/<model-name>/<model-version> \
  --config image_repository=<image-repository-URI> \
  --config namespace=<my-namespace>

To deploy a Knative service with a custom templated manifest, set the service_template config key. The value is a path to the YAML manifest you will be using.

mlflow deployments create \
  --target knative:/<context> \
  --name <deployment-name> \
  --model-uri models:/<model-name>/<model-version> \
  --config image_repository=<image-repository-URI> \
  --config service_template=<path/to/manifest>

$name, $namespace and $image templated values are respectively the deployment name, the provided namespace (or "default"), the image determined from the provided image repository and tag.

Caching Behavior

On deployment creation, the plugin always builds and pushes a new Docker image for the model.
On deployment update, if both:

an image exists on the repository with the expected tag, and
the MLflow run ID of the current deployment matches the run ID of the model to deploy (identified by its URI on the model registry) the plugin will skip the image build and push step.

Otherwise, the plugin will build and push a new image on the repository with the specified tag.

Using immutable tags is recommended, and we advise not relying on the default latest tag.

Updating a deployment will always attempt to patch the corresponding Knative service and rely on Knative handling the changes to determine if a new revision is required.
The generation service metadata property will be bumped when a new revision is created, which might be useful to compare service state before and after the update.

License

This project is licensed under the terms of the MIT license.

A yzr Free and Open Source project.

Project details

These details have not been verified by PyPI

Project links

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

This version

0.4.3

Aug 2, 2023

0.4.2

Jul 31, 2023

0.4.1

Jul 31, 2023

0.3.0

Jul 10, 2023

0.2.0

Jul 8, 2023

0.1.3

Jun 28, 2023

0.1.2

Jun 28, 2023

0.1.1

Jun 28, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mlflow_knative-0.4.3.tar.gz (12.1 kB view hashes)

Uploaded Aug 2, 2023 Source

Built Distribution

mlflow_knative-0.4.3-py3-none-any.whl (12.7 kB view hashes)

Uploaded Aug 2, 2023 Python 3

Hashes for mlflow_knative-0.4.3.tar.gz

Hashes for mlflow_knative-0.4.3.tar.gz
Algorithm	Hash digest
SHA256	`b48e7fe3092b9f20e9802f810297ef5c2a15ae3a11312b57dae3ffd6afc30e4c`
MD5	`1368f3d26fdf16cf97bd272b4184014a`
BLAKE2b-256	`d183a9dc609f4b2a12b2da5c28d1d93749a40ef5c6b863923dcc1deb0b36e554`

Hashes for mlflow_knative-0.4.3-py3-none-any.whl

Hashes for mlflow_knative-0.4.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`e504207a6a4eb44fea8c41e0f54815d0540c4991bf5a243d8b193efcfb1f94ff`
MD5	`07e02ba52996289d4d94afd481942af8`
BLAKE2b-256	`1fb433fdcd144b2423866bd13329926b08b29c8ad30af424696a5fc7fbe53d95`