Skip to main content

Flexible DataFrame Operations via UI

Project description


Ultima Logo


the ultimate data analytics tool
for no code visualisation and collaborative exploration.

Present easier.   Dig deeper.   Review together.  

The Ultimate BI tool

With Ultibi you can turn your DataFrame into a pivot table with a UI and share it across organisation. You can also define measures applicable to your DataFrame. This means your colleagues/consumers don't have to write any code to analyse the data.


Ultima Logo


Ultibi leverages on the giants: Actix, Polars and Rust which make this possible. We use TypeScript for the frontend.

Examples

Our userguide

Python

pip install ultibi

Quickstart

import ultibi as ul
import polars as pl
import os
os.environ["RUST_LOG"] = "info" # enable logs
os.environ["ADDRESS"] = "0.0.0.0:8000" # host on this address

# Read Data
# for more details: https://pola-rs.github.io/polars/py-polars/html/reference/api/polars.read_csv.html
df = pl.read_csv("titanic.csv")

# Convert it into an Ultibi DataSet
ds = ul.DataSet.from_frame(df)

# By default (might change in the future)
# Fields are Utf8 (non numerics) and integers
# Measures are numeric columns.
ds.ui() 

Then navigate to http://localhost:8000 or checkout http://localhost:8000/swagger-ui for the OpenAPI documentation.

More flexible - custom measures

import ultibi as ul
import polars as pl
import os
os.environ["RUST_LOG"] = "info" # enable logs
os.environ["ADDRESS"] = "0.0.0.0:8000" # host on this address

# Read Data
# for more details: https://pola-rs.github.io/polars/py-polars/html/reference/api/polars.read_csv.html
df = pl.read_csv("titanic.csv")

# Let's add some Custom/Bespoke Calculations to our UI

# Standard Calculator
def survival_mean_age(kwargs: dict[str, str]) -> pl.Expr:
    """Mean Age of Survivals
    pl.col("survived") is 0 or 1
    pl.col("age") * pl.col("survived") - age of survived person, otherwise 0
    pl.col("survived").sum() - number of survived
    """
    return pl.col("age") * pl.col("survived") / pl.col("survived").sum()

# Also a Standard Calculator
def example_dep_calc(kwargs: dict[str, str]) -> pl.Expr:
    return pl.col("SurvivalMeanAge_sum") + pl.col("SouthamptonFareDivAge_sum")

# When we need more involved calculations we go for a Custom Calculator
def custom_calculator(
            srs: list[pl.Series], kwargs: dict[str, str]
        ) -> pl.Series:
        """
        Southampton Fare/Age*multiplier
        """
        df = pl.DataFrame({"age": srs[0], 
                           "fare": srs[1], 
                           "e": srs[2]}) 
        # Add Indicator Column for Southampton
        df = df.with_columns(pl.when(pl.col("e")=="S").then(1).otherwise(0).alias("S")) 
        multiplier = float(kwargs.get("multiplier", 1))
        res = df["S"] * df["fare"] / df["age"] * multiplier
        return res

# inputs for the custom_calculator srs param
inputs = ["age", "fare", "embarked"]
# We return Floats
res_type = pl.Float64
# We return a Series, not a scalar (which otherwise would be auto exploded)
returns_scalar = False

measures = [
            ul.BaseMeasure(
                "SouthamptonFareDivAge",
                ul.CustomCalculator(
                    custom_calculator, res_type, inputs, returns_scalar
                ),
                # (Optional) - we are only interested in Southampton, so
                # unless other measures requested we might as well filter for Southampton only
                # However, if if multiple measures requested, their precompute_filters will be joined as OR.
                [[ul.EqFilter("embarked", "S")]],
                # PARAMS tab of the UI
                calc_params=[ul.CalcParam("mltplr", "1", "float")]
            ),
            ul.BaseMeasure(
                "SurvivalMeanAge",
                ul.StandardCalculator(survival_mean_age),
                aggregation_restriction="sum",
            ),
            ul.DependantMeasure(
                "A_Dependant_Measure",
                ul.StandardCalculator(example_dep_calc),
                [("SurvivalMeanAge", "sum"), ("SouthamptonFareDivAge", "sum")],
            ),
        ]

# Convert it into an Ultibi DataSet
ds = ul.DataSet.from_frame(df, bespoke_measures=measures)

# By default (might change in the future)
# Fields are Utf8 (non numerics) and integers
# Measures are numeric columns.
ds.ui() 

You can also wite Rust native Custom calculators measure which wouldn't be bounded by GIL! Checkout the userguide.

DataSources - In and OutOf memory

We provide aim to support different sources of the data.

scan = pl.read_csv("../frtb_engine/data/frtb/Delta.csv", 
                           dtypes={"SensitivitySpot": pl.Float64})
dsource = ul.DataSource.inmemory(scan)
ds = ul.DataSet.from_source(dsource)
ds.prepare() # .prepare() is only relevant to FRTB dataset currently
ds.ui()

If you don't want to/can't hold all your data in the process memory, you can sacrifise performance for memory with Scan/DataBase

import polars as pl
import ultibi as ul
# Note that the LazyFrame query must start with scan_
# and must've NOT been collected
scan = pl.scan_csv("../frtb_engine/data/frtb/Delta.csv", 
                    dtypes={"SensitivitySpot": pl.Float64})
dsource = ul.DataSource.scan(scan)
ds = ul.DataSet.from_source(dsource)
ds.ui()

Note: Naturally the later two options will be slower, because prior to computing your measures we will need to read the relevant bits of the data into the process memory.

FRTB SA

FRTB SA is a great usecase for ultibi. FRTB SA is a set of standardised, computationally intensive rules established by the regulator. High business impact of these rules manifests in need for analysis and visibility thoroughout an organisation. Note: Ultima is not a certified aggregator. Always benchmark the results against your own interpretation of the rules. See python frtb userguide.

Polars Compatibility

Ultibi Polars
0.5 18.7
0.6 19.18

License

Licensor: Ultibi Ltd. Licensed Work: Ultima The Licensed Work is (c) 2023 Ultibi Ltd.

ultibi python library is made available for the purpose of demonstrating the possibilities offered by the Software so users can evaluate the possibilities and potential of the Software. You can choose one of the following:

  1. GNU GPL v3

  2. Proprietary License. Allows you to use the software in whichever way you want. These licenses are extremely affordable, and you can check out the options by reaching out to us directly, via anatoly at ultimabi dot uk, or by visiting ultimabi dot uk

No Liability

As far as the law allows, the software comes as is, without any warranty or condition, and the licensor will not be liable to you for any damages arising out of these terms or the use or nature of the software, under any kind of legal claim.

Contributions

All code in this Repository is a property of the Licensor. If you make a contribution via PR or any other way, you give the Licensor a right to use your code in whatever way they deem necessary, including copying or refactoring it to a private repository, provided that a copy of your work will remain available under the current conditions.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ultibi-0.6.0.tar.gz (181.4 kB view hashes)

Uploaded Source

Built Distributions

ultibi-0.6.0-cp37-abi3-win_amd64.whl (21.3 MB view hashes)

Uploaded CPython 3.7+ Windows x86-64

ultibi-0.6.0-cp37-abi3-win32.whl (19.0 MB view hashes)

Uploaded CPython 3.7+ Windows x86

ultibi-0.6.0-cp37-abi3-manylinux_2_28_aarch64.whl (30.0 MB view hashes)

Uploaded CPython 3.7+ manylinux: glibc 2.28+ ARM64

ultibi-0.6.0-cp37-abi3-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (30.8 MB view hashes)

Uploaded CPython 3.7+ manylinux: glibc 2.17+ x86-64

ultibi-0.6.0-cp37-abi3-manylinux_2_17_ppc64le.manylinux2014_ppc64le.whl (36.9 MB view hashes)

Uploaded CPython 3.7+ manylinux: glibc 2.17+ ppc64le

ultibi-0.6.0-cp37-abi3-manylinux_2_17_i686.manylinux2014_i686.whl (31.9 MB view hashes)

Uploaded CPython 3.7+ manylinux: glibc 2.17+ i686

ultibi-0.6.0-cp37-abi3-manylinux_2_17_armv7l.manylinux2014_armv7l.whl (29.0 MB view hashes)

Uploaded CPython 3.7+ manylinux: glibc 2.17+ ARMv7l

ultibi-0.6.0-cp37-abi3-macosx_11_0_arm64.whl (21.9 MB view hashes)

Uploaded CPython 3.7+ macOS 11.0+ ARM64

ultibi-0.6.0-cp37-abi3-macosx_10_12_x86_64.whl (23.3 MB view hashes)

Uploaded CPython 3.7+ macOS 10.12+ x86-64

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page