2019-08-01T12:06:29 n6358

An assessment of the complexity of 3' UTRs relative to that of protein-coding sequences: models selected using two procedures

Viewed: 1534

The dataset comes from a study which assessed the complexity of 3′ UTRs (three prime untranslated regions) relative to that of protein-coding sequences, by comparing the extent to which segmental substructures can be detected within these two genomic fractions based on sequence composition and conservation.

For the dataset, two different procedures were applied to select the number of classes for each alignment; investigating Deviance Information Criterion V (DICV) values (Procedure 1) and investigating the stability of the classes (Procedure 2). The numbers of classes selected for each sequence by each procedure are summarised.

The data indicates that twelve to fourteen segment classes with distinct character frequencies can be distinguished in each of the three coding sequence alignments, using Procedure 1 or Procedure 2.

Geographical area of data collection

159.255525,-9.219822 112.921454,-9.219822 112.921454,-54.777218 159.255525,-54.777218 159.255525,-9.219822


Algama, Manjula, Oldmeadow, Christopher, Tasker, Edward, Mengersen, Kerrie, & Keith, Jonathan M. (2014) Drosophila 3′ UTRs are more complex than protein-coding sequences. PLoS ONE, 9(5), e97336. http://dx.doi.org/10.1371/journal.pone.0097336

Research areas

Markov models
Biological sciences
Genome analysis
Computational biology
Genome evolution
Sequence analysis
Molecular biology techniques
Probability theory
Bayes theorem
Molecular biology
Genome complexity
Comparative genomics
Sequencing techniques

Cite this collection

Algama, Manjula; Oldmeadow, Christopher; Tasker, Edward; Mengersen, Kerrie; Keith, Jonathan M. (2014): Models selected using two procedures. Table_1.xls. PLOS ONE. 10.1371/journal.pone.0097336.t001.
Algama,Manjula; Oldmeadow,Christopher; Tasker,Edward; Mengersen,Kerrie; Keith,Jonathan . (2014): An assessment of the complexity of 3' UTRs relative to that of protein-coding sequences: models selected using two procedures . [Queensland University of Technology]. https://doi.org/10.4225/09/585740a0f07ac

Related information

Manjula Algama, Research collaborator http://goo.gl/UQK7fb
Edward Tasker, Research collaborator http://goo.gl/UQK7fb
Jonathan M. Keith, Research collaborator http://goo.gl/UQK7fb
Christopher Oldmeadow, Research collaborator http://goo.gl/UQK7fb

Access the data

Data file types



Creative Commons Attribution 4.0 (CC-BY)


© 2014 Algama et al.

Dates of data collection

From 2013-01-01 to 2013-12-31


Was collected by
Kerrie Mengersen  (Researcher)


Name: Distinguished Professor Kerrie Mengersen
Phone: +61 7 3138 2063
Fax: +61 7 3138 2310


Date record created:
Date record modified:
Record status:
Published - Open Access