Semantics and ambiguity of stochastic RNA family models.

Robert Giegerich, Christian Höner zu Siederdissen

Bielefeld University, Bielefeld, Germany. robert@techfak.uni-bielefeld.de

IEEE/ACM transactions on computational biology and bioinformatics / IEEE, ACM 2011 Mar-Apr

filter terms:

Stochastic models, such as hidden Markov models or stochastic context-free grammars (SCFGs) can fail to return the correct, maximum likelihood solution in the case of semantic ambiguity. This problem arises when the algorithm implementing the model inspects the same solution in different guises. It is a difficult problem in the sense that proving semantic nonambiguity has been shown to be algorithmically undecidable, while compensating for it (by coalescing scores of equivalent solutions) has been shown to be NP-hard. For stochastic context-free grammars modeling RNA secondary structure, it has been shown that the distortion of results can be quite severe. Much less is known about the case when stochastic context-free grammars model the matching of a query sequence to an implicit consensus structure for an RNA family. We find that three different, meaningful semantics can be associated with the matching of a query against the model--a structural, an alignment, and a trace semantics. Rfam models correctly implement the alignment semantics, and are ambiguous with respect to the other two semantics, which are more abstract. We show how provably correct models can be generated for the trace semantics. For approaches, where such a proof is not possible, we present an automated pipeline to check post factum for ambiguity of the generated models. We propose that both the structure and the trace semantics are worth-while concepts for further study, possibly better suited to capture remotely related family members.

Citation

Robert Giegerich, Christian Höner zu Siederdissen. Semantics and ambiguity of stochastic RNA family models. IEEE/ACM transactions on computational biology and bioinformatics / IEEE, ACM. 2011 Mar-Apr;8(2):499-516

Mesh Tags

Substances

PMID: 21233528

View Full Text

FAQ

Semantics and ambiguity of stochastic RNA family models.

filter terms:

Citation

var meshTagsSectionCollapsed = true; Mesh Tags

var substancesSectionCollapsed = true; Substances

Mesh Tags

Substances