AAComposition module


The module is used for computing the composition of amino acids, dipetide and

3-mers (tri-peptide) for a given protein sequence. You can get 8420 descriptors

for a given protein sequence. You can freely use and distribute it. If you hava

any problem, you could contact with us timely!

References:

[1]: Reczko, M. and Bohr, H. (1994) The DEF data base of sequence based protein

fold class predictions. Nucleic Acids Res, 22, 3616-3619.

[2]: Hua, S. and Sun, Z. (2001) Support vector machine approach for protein

subcellular localization prediction. Bioinformatics, 17, 721-728.

[3]:Grassmann, J., Reczko, M., Suhai, S. and Edler, L. (1999) Protein fold class

prediction: new methods of statistical classification. Proc Int Conf Intell Syst Mol

Biol, 106-112.

Authors: Dongsheng Cao and Zhijiang Yao.

Date: 2016.7.27

Email: oriental-cds@163.com and gadsby@163.com


AAComposition.CalculateAAComposition(ProteinSequence)[source]

Calculate the composition of Amino acids

for a given protein sequence.

Usage:

result=CalculateAAComposition(protein)

Input: protein is a pure protein sequence.

Output: result is a dict form containing the composition of

AAComposition.CalculateAADipeptideComposition(ProteinSequence)[source]

Calculate the composition of AADs, dipeptide and 3-mers for a

given protein sequence.

Usage:

result=CalculateAADipeptideComposition(protein)

Input: protein is a pure protein sequence.

Output: result is a dict form containing all composition values of

AAComposition.CalculateDipeptideComposition(ProteinSequence)[source]

Calculate the composition of dipeptidefor a given protein sequence.

Usage:

result=CalculateDipeptideComposition(protein)

Input: protein is a pure protein sequence.

Output: result is a dict form containing the composition of

AAComposition.GetSpectrumDict(proteinsequence)[source]

Calcualte the spectrum descriptors of 3-mers for a given protein.

Usage:

result=GetSpectrumDict(protein)

Input: protein is a pure protein sequence.

Output: result is a dict form containing the composition values of 8000

AAComposition.Getkmers()[source]

Get the amino acid list of 3-mers.

Usage:

result=Getkmers()

Output: result is a list form containing 8000 tri-peptides.