Logotipo do repositório
 

Publicação:
Pattern recognition analysis on long noncoding RNAs: a tool for prediction in plants

dc.contributor.authorNegri, Tatianne da Costa
dc.contributor.authorAlves, Wonder Alexandre Luz
dc.contributor.authorBugatti, Pedro Henrique
dc.contributor.authorSaito, Priscila Tiemi Maeda
dc.contributor.authorDomingues, Douglas Silva [UNESP]
dc.contributor.authorPaschoal, Alexandre Rossi
dc.contributor.institutionUniversidade Nove de Julho
dc.contributor.institutionUTFPR
dc.contributor.institutionUniversidade Estadual Paulista (Unesp)
dc.date.accessioned2019-10-06T15:46:52Z
dc.date.available2019-10-06T15:46:52Z
dc.date.issued2019-03-25
dc.description.abstractMOTIVATION: Long noncoding RNAs (lncRNAs) correspond to a eukaryotic noncoding RNA class that gained great attention in the past years as a higher layer of regulation for gene expression in cells. There is, however, a lack of specific computational approaches to reliably predict lncRNA in plants, which contrast the variety of prediction tools available for mammalian lncRNAs. This distinction is not that obvious, given that biological features and mechanisms generating lncRNAs in the cell are likely different between animals and plants. Considering this, we present a machine learning analysis and a classifier approach called RNAplonc (https://github.com/TatianneNegri/RNAplonc/) to identify lncRNAs in plants. RESULTS: Our feature selection analysis considered 5468 features, and it used only 16 features to robustly identify lncRNA with the REPTree algorithm. That was the base to create the model and train it with lncRNA and mRNA data from five plant species (thale cress, cucumber, soybean, poplar and Asian rice). After an extensive comparison with other tools largely used in plants (CPC, CPC2, CPAT and PLncPRO), we found that RNAplonc produced more reliable lncRNA predictions from plant transcripts with 87.5% of the best result in eight tests in eight species from the GreeNC database and four independent studies in monocotyledonous (Brachypodium) and eudicotyledonous (Populus and Gossypium) species.en
dc.description.affiliationDepartment of Computer Science Bioinformatics Graduate Program (PPGBIOINFO) Federal University of Technology - Paraná UTFPR Brazil and Informatics and Knowledge Management Graduate Program Universidade Nove de Julho, Campus Cornélio
dc.description.affiliationInformatics and Knowledge Management Graduate Program Universidade Nove de Julho
dc.description.affiliationDepartment of Computer Science Bioinformatics Graduate Program (PPGBIOINFO) Federal University of Technology - Paraná UTFPR, Campus Cornélio
dc.description.affiliationDepartment of Computer Science Bioinformatics Graduate Program (PPGBIOINFO) Federal University of Technology - Paraná UTFPR Brazil and Department of Botany Institute of Biosciences São Paulo State University UNESP, Campus Cornélio
dc.description.affiliationUnespDepartment of Computer Science Bioinformatics Graduate Program (PPGBIOINFO) Federal University of Technology - Paraná UTFPR Brazil and Department of Botany Institute of Biosciences São Paulo State University UNESP, Campus Cornélio
dc.format.extent682-689
dc.identifierhttp://dx.doi.org/10.1093/bib/bby034
dc.identifier.citationBriefings in bioinformatics, v. 20, n. 2, p. 682-689, 2019.
dc.identifier.doi10.1093/bib/bby034
dc.identifier.issn1477-4054
dc.identifier.scopus2-s2.0-85067536297
dc.identifier.urihttp://hdl.handle.net/11449/187774
dc.language.isoeng
dc.relation.ispartofBriefings in bioinformatics
dc.rights.accessRightsAcesso aberto
dc.sourceScopus
dc.subjectbioinformatics
dc.subjectfeatures
dc.subjectlong RNAs
dc.subjectmachine learning
dc.subjectpattern recognition
dc.subjecttool
dc.titlePattern recognition analysis on long noncoding RNAs: a tool for prediction in plantsen
dc.typeArtigo
dspace.entity.typePublication

Arquivos

Coleções