On the Search for Retrotransposons: Alternative Protocols to Obtain Sequences to Learn Profile Hidden Markov Models
dc.contributor.author | Fischer, Carlos N. [UNESP] | |
dc.contributor.author | Campos, Victor De A. [UNESP] | |
dc.contributor.author | Barella, Victor H. | |
dc.contributor.institution | Universidade Estadual Paulista (Unesp) | |
dc.contributor.institution | Universidade de São Paulo (USP) | |
dc.date.accessioned | 2018-12-11T17:20:08Z | |
dc.date.available | 2018-12-11T17:20:08Z | |
dc.date.issued | 2018-05-01 | |
dc.description.abstract | Profile hidden Markov models (pHMMs) have been used to search for transposable elements (TEs) in genomes. For the learning of pHMMs aimed to search for TEs of the retrotransposon class, the conventional protocol is to use the whole internal nucleotide portions of these elements as representative sequences. To further explore the potential of pHMMs in such a search, we propose five alternative ways to obtain the sets of representative sequences of TEs other than the conventional protocol. In this study, we are interested in Bel-PAO, Copia, Gypsy, and DIRS superfamilies from the retrotransposon class. We compared the pHMMs of all six protocols. The test results show that, for each TE superfamily, the pHMMs of at least two of the proposed protocols performed better than the conventional one and that the number of correct predictions provided by the latter can be improved by considering together the results of one or more of the alternative protocols. | en |
dc.description.affiliation | Department of Statistics Applied Maths and Computer Sciences UNESP-São Paulo State University, Avenida 24-A, 1515 Rio Claro | |
dc.description.affiliation | Department of Computer Sciences Institute of Biosciences UNESP-São Paulo State University | |
dc.description.affiliation | Department of Computer Sciences Institute of Mathematical and Computer Sciences USP - University of São Paulo | |
dc.description.affiliationUnesp | Department of Statistics Applied Maths and Computer Sciences UNESP-São Paulo State University, Avenida 24-A, 1515 Rio Claro | |
dc.description.affiliationUnesp | Department of Computer Sciences Institute of Biosciences UNESP-São Paulo State University | |
dc.format.extent | 517-527 | |
dc.identifier | http://dx.doi.org/10.1089/cmb.2017.0219 | |
dc.identifier.citation | Journal of Computational Biology, v. 25, n. 5, p. 517-527, 2018. | |
dc.identifier.doi | 10.1089/cmb.2017.0219 | |
dc.identifier.issn | 1066-5277 | |
dc.identifier.scopus | 2-s2.0-85046884932 | |
dc.identifier.uri | http://hdl.handle.net/11449/176325 | |
dc.language.iso | eng | |
dc.relation.ispartof | Journal of Computational Biology | |
dc.relation.ispartofsjr | 0,824 | |
dc.rights.accessRights | Acesso aberto | |
dc.source | Scopus | |
dc.subject | profile hidden Markov models | |
dc.subject | retrotransposons | |
dc.subject | transposable elements. | |
dc.title | On the Search for Retrotransposons: Alternative Protocols to Obtain Sequences to Learn Profile Hidden Markov Models | en |
dc.type | Artigo | |
dspace.entity.type | Publication | |
unesp.author.lattes | 1858554355077119[1] | |
unesp.author.orcid | 0000-0002-5598-6263[1] | |
unesp.campus | Universidade Estadual Paulista (UNESP), Instituto de Geociências e Ciências Exatas, Rio Claro | pt |
unesp.department | Estatística, Matemática Aplicada e Computação - IGCE | pt |