Determination of the nucleotide or amino acid composition of genome or protein sequences by using natural vector method and convex hull principle

2021 
Abstract Although with the continuous development of sequencing technology, the number of genome and protein sequences has grown rapidly, these sequences are only a small part of nature. Biologically, it is still a challenging and important problem to detect and predict some new genome or protein sequences based on real sequence data, which motivates us to solve the problem mathematically. The first step to predict the new sequences is determining the nucleotide or amino acid composition of them. In this paper, we apply natural vector method and convex hull principle to determine the nucleotide or amino acid composition of new genome or protein sequences. Our algorithm is based on optimization strategy. The SARS-CoV-2 genome and protein datasets are used to verify the feasibility of our algorithm. Numerical experiments show that our algorithm can detect and predict possible number of each nucleotide or amino acid of genome and protein sequence with respect to the second order natural vectors.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    11
    References
    0
    Citations
    NaN
    KQI
    []