Structural and Evolutionary Analysis of the copia-like Elements in the Arabidopsis thaliana Genome

2001 
: The analysis of 460 kb of genomic sequence of Arabidopsis thaliana chromosome III allowed us to identify two new transposable elements named AtC1 and AtC2. AtC1 shows identical long terminal repeats (LTRs) and all the structural features characteristic of the copia-like active elements. AtC2 is also a full copia-like element, but a putative stop codon in the open reading frame (ORF) would produce a truncated protein. In order to identify the copia-like fraction of the A. thaliana genome, a careful computer-based analysis of the available sequences (which correspond to 92% of the genome) was performed. Approximately 300 nonredundant copia-like sequences homologous to AtC1 and AtC2 were detected, which showed an extreme heterogeneity in size and degree of conservation. This number of copies would correspond to approximately 1% of the A. thaliana genome. Seventy-one sequences were selected for further analysis, with 23 of them being full complete elements. Five corresponded to previously described ones, and the remaining ones, named AtC3 to AtC18 are new elements described in this work. Most of these elements presented a putative functional ORF, nearly identical LTRs, and the other elements necessary for retrotransposon activity. Phylogenetic trees, supported by high bootstrap values, indicated that these 23 elements could be considered separate families. In turn, these 23 families could be clustered into six major lineages, named copia I-VI. Most of the 71 analyzed sequences clustered into these six main clades. The widespread presence of these copia-like superfamilies throughout plant genomes is discussed.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    84
    References
    25
    Citations
    NaN
    KQI
    []