Skip to main content

Biosynthetic Gene Cluster finder with Graph Neural Network

Project description

BGCfinder : Biosynthetic Gene Cluster detection with Graph Neural Network

BGCfinder detects biosynthetic gene clusters in bacterial genomes using deep learning. BGCfinder takes a fasta file containing protein sequences and convert each of them into a graph. Graph neural network takes the input graphs to detect biosynthetic gene cluster..

Installation requirement:

To construct the conda environment,

$ conda create --name BGCfinder  python=3.9
$ conda init bash   
$ conda activate BGCfinder   
$ conda install pytorch==1.11.0 torchvision==0.12.0 torchaudio==0.11.0 cudatoolkit=11.3 -c pytorch
$ conda install pyg -c pyg    
$ pip install BGCfinder     

To download the BGCfinder model and test files,

$ bgc-download

To find the protein-coding gene in bacterial genome (Installation of Prodigal is required for this step),

$ prodigal -f gff -i bacterial_genome_seq.fasta -a bacterial_protein_seq.fasta -o bacterial_genome_seq.gff 

To run BGCfinder with a fasta file containing amino acid sequence with CPU (recommended),

$ bgcfinder bacterial_protein_seq.fasta -o output_filename_prefix -l log_record.log -d False

To run BGCfinder with a fasta file containing amino acid sequence with GPU,

$ bgcfinder bacterial_protein_seq.fasta -o output_filename_prefix -l log_record.log -d True

The development environment of BGCfinder :

'torch==1.10.0',   
'torch-geometric==2.0.2',   
'torch-scatter==2.0.9',   
'torch-sparse==0.6.12'   

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

BGCfinder-0.0.30-py3-none-any.whl (10.8 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page