genealloy

GeneAlloy helps designing overlapping sequences.


Keywords
biology
License
MIT
Install
pip install genealloy==0.1.1

Documentation

GeneAlloy

Build StatusCoverage Status

GeneAlloy logo

Genealloy helps designing overlapping sequences.

It takes two amino acid coding nucleotide sequences and a codon conversion table of allowed triplet -> triplet transitions, and determines whether one sequence can be inserted into the other one. Note that the package is under development.

Overlapping sequences are nucleotide sequences that encode different amino acid sequences on the same DNA or RNA region. These sequences are either on the complementary strands (in any frame), or on the same strand as frameshift sequences. This phenomenon is made possible by the redundancy of the genetic code (codon degeneracy).

In the metallurgic terminology used at the genome foundries, the host sequence (into which the shorter sequence is inserted) is called the matrix or solvent, and the shorter guest (or parasite) is called the solute; and a combination sequence is called a genealloy.

Install

pip install genealloy

Usage

import genealloy as ga
swaptable = ga.generate_swaptable(ga.codon_to_aa, ga.aa_to_codon_extended)
host = 'TCGTCGTACCAGCCGCAGAGGAGAGCTACTTTT'
parasite =  'GTACCCGCTGCG'  # frameshift 2
ga.make_genealloy(host, parasite, swaptable)

Find partial overlaps:

ga.find_partial_overlaps(host, parasite, swaptable)

Version

The GeneAlloy project uses the semantic versioning scheme. The package is under development.

License = MIT

Genealloy is free software, which means the users have the freedom to run, copy, distribute, study, change and improve the software.

Genealloy was written at the Edinburgh Genome Foundry by Peter Vegh and is released under the MIT license.