Skip to main content

HDX Data Freshness Database Clean

Project description

Utility to clean Freshness Database

Build Status Coverage Status Code style: black Imports: isort

DEPRECATED - code moved to https://github.com/OCHA-DAP/hdx-data-freshness

This script cleans the freshness database.

Usage

python -m hdx.freshness.dbactions [-db/--db_uri=] [-dp/--db_params=] [action]

Either db_uri or db_params must be provided or the environment variable DB_URI must be set. db_uri or DB_URI are of form: postgresql+psycopg://user:password@host:port/database

db_params is of form: database=XXX,host=X.X.X.X,username=XXX,password=XXX,port=1234, ssh_host=X.X.X.X,ssh_port=1234,ssh_username=XXX, ssh_private_key=/home/XXX/.ssh/keyfile

action:

  • "clone" which creates a shallow clone of the database which only has all the runs and one dataset and its resources per run for testing purposes.

  • "clean" (the default) cleans the database by removing runs according to these rules:

    1. Keep a handful of runs around the end of each quarter all the way back to the first run in 2017
    2. Keep daily runs going back 2 years
    3. Keep weekly runs from 2 to 4 years back
    4. Keep monthly runs for 4 years back and earlier

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

hdx-data-freshness-dbclean-1.0.2.tar.gz (301.2 kB view hashes)

Uploaded Source

Built Distribution

hdx_data_freshness_dbclean-1.0.2-py2.py3-none-any.whl (7.8 kB view hashes)

Uploaded Python 2 Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page