An assessment of the complexity of 3&#39; UTRs relative to that of protein-coding sequences: models selected using two procedures

Name: An assessment of the complexity of 3' UTRs relative to that of protein-coding sequences: models selected using two procedures
Creator: Distinguished Professor Kerrie Mengersen
License: http://creativecommons.org/licenses/by/4.0/
Keywords: Markov models,Biological sciences,Biostatistics,Genome analysis,Genetics,Computational biology,Genome evolution,Sequence analysis,Molecular biology techniques,Probability theory,MATHEMATICAL SCIENCES,Bayes theorem,Molecular biology,Genome complexity,Comparative genomics,Sequencing techniques,research data,data collections,research project

Algama,Manjula; Oldmeadow,Christopher; Tasker,Edward; Mengersen,Kerrie; Keith,Jonathan .

doi:10.4225/09/585740a0f07ac

An assessment of the complexity of 3' UTRs relative to that of protein-coding sequences: models selected using two procedures

Viewed: 2275

The dataset comes from a study which assessed the complexity of 3′ UTRs (three prime untranslated regions) relative to that of protein-coding sequences, by comparing the extent to which segmental substructures can be detected within these two genomic fractions based on sequence composition and conservation.

For the dataset, two different procedures were applied to select the number of classes for each alignment; investigating Deviance Information Criterion V (DICV) values (Procedure 1) and investigating the stability of the classes (Procedure 2). The numbers of classes selected for each sequence by each procedure are summarised.

The data indicates that twelve to fourteen segment classes with distinct character frequencies can be distinguished in each of the three coding sequence alignments, using Procedure 1 or Procedure 2.

Geographical area of data collection

kmlPolyCoords

159.255525,-9.219822 112.921454,-9.219822 112.921454,-54.777218 159.255525,-54.777218 159.255525,-9.219822

Publications

Algama, Manjula, Oldmeadow, Christopher, Tasker, Edward, Mengersen, Kerrie, & Keith, Jonathan M. (2014) Drosophila 3′ UTRs are more complex than protein-coding sequences. PLoS ONE, 9(5), e97336. http://dx.doi.org/10.1371/journal.pone.0097336

Research areas

Markov models

Biological sciences

Biostatistics

Genome analysis

Genetics

Computational biology

Genome evolution

Sequence analysis

Molecular biology techniques

Probability theory

MATHEMATICAL SCIENCES

Bayes theorem

Molecular biology

Genome complexity

Comparative genomics

Sequencing techniques

Cite this collection

Algama, Manjula; Oldmeadow, Christopher; Tasker, Edward; Mengersen, Kerrie; Keith, Jonathan M. (2014): Models selected using two procedures. Table_1.xls. PLOS ONE. 10.1371/journal.pone.0097336.t001.

Algama,Manjula; Oldmeadow,Christopher; Tasker,Edward; Mengersen,Kerrie; Keith,Jonathan . (2014): An assessment of the complexity of 3' UTRs relative to that of protein-coding sequences: models selected using two procedures . [Queensland University of Technology]. https://doi.org/10.4225/09/585740a0f07ac