Quality of Analysis for Machine Learning

Project description

QoA4ML - Quality of Analytics for ML

Source code

Monitoring Client

QoA Client: an object that observes metrics, generates metric reports, and sends them to the Observation service via a list of connectors (e.g., messaging connector: RabbitMQ).

The developers only need to init a QoAClient at the beginning and use it to observe/evaluate metrics by self-instrumentation (calling its functions) at the right place in the source code.

To initiate a QoA Client, developers can specify a configuration file path or refer to a configuration as a dictionary, or give the registration service (URL) where the client can get its configuration.

The configuration contains the information about the client and its configuration in form of dictionary

Example:

clientConf = { 
    "client":{
        "userID": "aaltosea1",
        "instance_name": "ML02",
        "stageID": "ML",
        "method": "REST",
        "application": "test",
        "role": "ml"
    },
    "connector":{
        "amqp_connector":{
            "class": "amqp",
            "conf":{
                "end_point": "localhost",
                "exchange_name": "qoa4ml",
                "exchange_type": "topic",
                "out_routing_key": "qoa.report.ml"
            }
        }
    }
}
qoaClient = QoaClient(config_dict=clientConf)

The connector is the dictionary containing multiple connector configuration (amqp, mqtt, kafka) If 'connector' is not define, developer must give 'registration_url' The 'registration_url' specify the service where the client register for monitoring service. If it's set, the client register with the service and receive connector configuration. For example: "http://localhost:5001/registration"

Via this client, developers can call different monitoring probes to measure desired metrics and categorize them into data quality, service performance or inference quality.
- By using our probes (e.g., observeErronous, observeMissing, and observeInferenceMetric), the metrics are already categorized in the quality report.
- For unsupported metrics or user-defined metrics, the developers can report them by using observeMetric providing metric's names and their expected categories. For example qoaClient.observeMetric(metric_name="image_width", value=200, category=1).
Category: metrics are categorized into following groups:
- 0 - Quality: Performance (metrics for evaluating service performance e.g., response time, throughput)
- 1 - Quality: Data (metrics for evaluating data quality e.g., missing, duplicate, erroneous)
- 2 - Quality: Inference (metrics for evaluating quality of ML inference, measured from inferences e.g., accuracy, confidence)
- 3 - Resource: metrics for evaluating resource utilization e.g. CPU, Memory
To send the quality report to the observation service, the developers can call report from the QoAClient. For example: qualityReport = qoaClient.report(), the function will additionally return the report at current stage and save it to qualityReport
To aggregate reports from previous stage (in a pipeline) for building the computation graphs, the client can call importPReport. For example qoaClient.importPReport(previousReport)

Probes

QoA4ML Probes: libraries and lightweight modules capturing metrics. They are integrated into suitable ML serving frameworks and ML code
Probe properties:
- Can be written in different languages (Python, GoLang)
- Can have different communications to monitoring systems (depending on probes and its ML support)
- Capture metrics with a clear definition/scope
  - e.g., Response time for an ML stage (training) or a service call (of ML APIs)
  - Thus output of probes must be correlated to objects to be monitored and the tenant
- Support high or low-level metrics/attributes
  - depending on probes implementation
- Can be instrumented into source code or standlone

Metric

We support some metric classes for collecting different types of metric: Counter, Gauge, Summary, Histogram

Metric: an original class providing some common functions on an metric object.
- Attribute:
  - metric_name
  - description
  - value
- Function:
  - __init__: let user define the metric name, description and default value.
  - set: set its value to a specific value
  - get_val: get current value
  - get_name: return metric name
  - get_des: return metric description
  - __str__: return information about the metric in form of string
  - to_dict: return information about the metric in form of dictionary
Counter
- Attribute: same as Metric & on further developing
- Function:
  - inc: increase the value of the metric by the given number/by 1 by default.
  - reset: set the value back to zero.
Gauge
- Attribute: same as Metric & on further developing
- Function:
  - inc: increase the value of the metric by a given number/by 1 by default.
  - dec: decrease the value of the metric by a given number/by 1 by default.
  - set: set the value to a given number.
Summary
- Attribute: same as Metric & on further developing
- Function:
  - inc: increase the value of the metric by a given number/by 1 by default.
  - dec: decrease the value of the metric by a given number/by 1 by default.
  - set: set the value to a given number.
Histogram
- Attribute: same as Metric & on further developing
- Function:
  - inc: increase the value of the metric by a given number/by 1 by default.
  - dec: decrease the value of the metric by a given number/by 1 by default.
  - set: set the value to a given number.

QoA4ML Reports

This module defines QoA_Report, an object that provide functions to export monitored metric to the following schema:

{
    "computationGraph":{
        "instances":{
            "@instance_id":{
                "instance_name": "@name_of_instance",
                "method": "@method/task/function",
                "previous_instance":["@list_of_previous_instance"]
            },
            ...
        },
        "last_instance": "@name_of_last_instance_in_the_graph"
    },
    "quality":{
        "data":{
            "@stageID":{
                "@metric_name":{
                    "@instance_id": "@value"
                }
            }
        },
        "performance":{
            "@stageID":{
                "@metric_name":{
                    "@instance_id": "@value"
                }
            }
        },
        "inference":{
            "@inference_id":{
                "value": "@value",
                "confident": "@confidence",
                "accuracy": "@accuracy",
                "instance_id": "@instance_id",
                "source": ["@list_of_inferences_to_infer_this_inference"]
            }
        }
    }
}

The example is shown in example/reports/qoa_report/example.txt

Attribute:
- previous_report_instance = list previous services
- report_list: list of reports from previous services
- previous_inference: list previous inferences
- quality_report: report all quality (data, service, inference qualtiy) of the service
- execution_graph: report the execution graph
- report: the final report to be sent
Function:
- __init__: init as empty report.
- import_report_from_file: init QoA Report from json file.
- importPReport: import reports from previous service to build the execution and inference graph
- build_execution_graph: build execution graph from list of previous reports
- build_quality_report: build the quality report from metrics collected in runtime
- generateReport: return the final report.
- observeMetric: observe metrics in runtime with 3 categories: service quality, data quality, inference qualtiy. This can be extended to observe resource metrics.

Examples

https://github.com/rdsea/QoA4ML/tree/main/example

Overview

Class

Probes will be integrated to client program or system service to collect metrics at the edge Probes will generate reports and sent to message broker using different connector. Coresponding collector should be used to acquire the metrics.

Collector

The manager/orchestrator have to integrate collector to collect metric using different protocols for further analysis.

Attribute:
Function:
- __init__: take a configuration as a dict containing information about the data source, eg. broker, channel, queue, etc. It can take an object as an attribute host to return the message for further processing.
- If the collector is initiated by an object inherited class, this class must implement message_processing function to process the message returned by the collector. Otherwise, the collector will print the message to the console.
- on_request: handle message from data source (message broker,...)
- start & stop: start and stop consuming message
- get_queue: return the queue name.

Connector

Connectors are implement with different protocols for sending report. Example: sending report to message broker - AMQP/MQTT

Attribute:
Function:
- __init__: take a configuration as a dict containing information about the data sink, eg. broker, channel, queue, etc. It can take a bool parameter log for logging messages for further processing.
- send_data: a function to send data to specified routing_key/queue with a corresponding key corr_id to trace back message.

Utilities

A module provide some frequently used functions and some function to directly collect system metrics.

Note

eva_duplicate, eva_erronous, eva_missing, and detect_outlier probes are using ydata-quality library, which is only available for Python 3.8
For using ML quality probes, you may need to install a few more dependencies, e.g., tensorflow and Pillow.
QoaClient uses AMQP protocol by default. To use MQTT, you may need to install paho-mqtt.
To monitor Docker stats, you need to install docker python client.
To connect with Prometheus, you need to install prometheus-client

Change Log

0.0.13 (18/04/2022)

First Release

0.0.18 (10/05/2022)

Update system metric

0.0.19 (31/05/2022)

Update process metric

0.0.54 (20/09/2022)

Update monitoring system/process/docker

0.0.62 (20/09/2022)

Add metric and modify format of system/process reports

0.0.64 (15/03/2022)

Add metric and modify format of system/process reports

0.0.72 (22/05/2023)

Refactor source code

Project details

Release history Release notifications | RSS feed

This version

0.1.21

Nov 9, 2023

0.1.20

Sep 24, 2023

0.1.19

Sep 17, 2023

0.1.18

Sep 17, 2023

0.1.17

Sep 17, 2023

0.1.16

Sep 17, 2023

0.1.15

Sep 17, 2023

0.1.14

Sep 15, 2023

0.1.13

Sep 15, 2023

0.1.12

Sep 15, 2023

0.1.11

Sep 15, 2023

0.1.10

Sep 15, 2023

0.1.9

Sep 14, 2023

0.1.8

Aug 15, 2023

0.1.6

Aug 6, 2023

0.1.5

Aug 6, 2023

0.1.4

Aug 6, 2023

0.1.3

Aug 6, 2023

0.1.2

Aug 6, 2023

0.1.1

Aug 6, 2023

0.1.0

Jul 31, 2023

0.0.99

Jul 31, 2023

0.0.98

Jul 31, 2023

0.0.97

Jul 31, 2023

0.0.96

Jul 31, 2023

0.0.95

Jul 31, 2023

0.0.94

Jul 31, 2023

0.0.93

Jul 31, 2023

0.0.92

Jul 31, 2023

0.0.91

Jul 31, 2023

0.0.90

Jul 31, 2023

0.0.89

Jul 31, 2023

0.0.88

Jul 31, 2023

0.0.87

Jul 31, 2023

0.0.86

Jul 31, 2023

0.0.85

Jul 31, 2023

0.0.84

Jul 31, 2023

0.0.83

Jul 31, 2023

0.0.82

May 26, 2023

0.0.81

May 26, 2023

0.0.80

May 26, 2023

0.0.79

May 26, 2023

0.0.78

May 26, 2023

0.0.77

May 25, 2023

0.0.76

May 25, 2023

0.0.75

May 25, 2023

0.0.74

May 25, 2023

0.0.73

May 25, 2023

0.0.72

May 22, 2023

0.0.71

Mar 16, 2023

0.0.70

Mar 16, 2023

0.0.69

Mar 16, 2023

0.0.68

Mar 16, 2023

0.0.67

Mar 15, 2023

0.0.66

Mar 15, 2023

0.0.65

Mar 15, 2023

0.0.64

Mar 15, 2023

0.0.63

Jan 30, 2023

0.0.62

Nov 8, 2022

0.0.61

Nov 8, 2022

0.0.60

Nov 8, 2022

0.0.59

Sep 22, 2022

0.0.58

Sep 20, 2022

0.0.57

Sep 20, 2022

0.0.56

Sep 20, 2022

0.0.55

Sep 20, 2022

0.0.54

Sep 20, 2022

0.0.53

Sep 14, 2022

0.0.52

Sep 13, 2022

0.0.51

Sep 13, 2022

0.0.50

Sep 13, 2022

0.0.49

Sep 13, 2022

0.0.48

Sep 13, 2022

0.0.47

Sep 13, 2022

0.0.46

Sep 13, 2022

0.0.45

Sep 13, 2022

0.0.44

Sep 13, 2022

0.0.43

Sep 13, 2022

0.0.42

Sep 13, 2022

0.0.41

Sep 13, 2022

0.0.40

Sep 13, 2022

0.0.39

Sep 13, 2022

0.0.38

Sep 13, 2022

0.0.37

Sep 13, 2022

0.0.36

Sep 13, 2022

0.0.35

Sep 13, 2022

0.0.34

Sep 13, 2022

0.0.33

Sep 13, 2022

0.0.32

Sep 13, 2022

0.0.31

Sep 13, 2022

0.0.30

Sep 7, 2022

0.0.29

Sep 5, 2022

0.0.28

Sep 5, 2022

0.0.27

Sep 5, 2022

0.0.26

Sep 5, 2022

0.0.25

Sep 5, 2022

0.0.24

Sep 5, 2022

0.0.23

Sep 2, 2022

0.0.22

Sep 2, 2022

0.0.21

Sep 2, 2022

0.0.20

Sep 2, 2022

0.0.19

Jun 1, 2022

0.0.18

May 21, 2022

0.0.17

May 10, 2022

0.0.16

May 10, 2022

0.0.15

May 10, 2022

0.0.14

May 10, 2022

0.0.13

Apr 18, 2022

0.0.12

Apr 18, 2022

0.0.11

Mar 23, 2022

0.0.10

Mar 23, 2022

0.0.9

Mar 23, 2022

0.0.8

Mar 23, 2022

0.0.7

Mar 23, 2022

0.0.6

Mar 23, 2022

0.0.5

Mar 23, 2022

0.0.4

Mar 23, 2022

0.0.3

Mar 23, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

qoa4ml-0.1.21.tar.gz (23.8 kB view hashes)

Uploaded Nov 9, 2023 Source

Hashes for qoa4ml-0.1.21.tar.gz

Hashes for qoa4ml-0.1.21.tar.gz
Algorithm	Hash digest
SHA256	`ecbb877792d00d4056a5c9bb930e1abf7c9fc0bed4376d3417771761239a7f10`
MD5	`52ac25609e799bb9c14b2682f72f6ce5`
BLAKE2b-256	`c67665a932766c9836090d25f7969d604327357972b33a540b644328c0c71738`