Repository logo
 

Publication:
Identification of novel protein-coding sequences in Eucalyptus grandis plants by high-resolution mass spectrometry

dc.contributor.authorJorge, Gabriel Lemes [UNESP]
dc.contributor.authorBalbuena, Tiago Santana [UNESP]
dc.contributor.institutionUniversidade Estadual Paulista (Unesp)
dc.date.accessioned2021-06-25T12:33:39Z
dc.date.available2021-06-25T12:33:39Z
dc.date.issued2021-03-01
dc.description.abstractEucalyptus species are widely used in the forestry industry, and a significant increase in the number of sequences available in database repositories has been observed for these species. In proteomics, a protein is identified by correlating the theoretical fragmentation spectrum derived from genomic/transcriptomic data against the experimental fragmentation mass spectrum acquired from large-scale analysis of protein mixtures. Proteogenomics is an alternative approach that can identify novel proteins encoded by regions previously considered as non-coding. This study aimed to confidently identify and confirm the existence of previously unknown protein-coding sequences in the Eucalyptus grandis genome. To this end, we used a modified spectral correlation strategy and a dedicated de novo peptide sequencing pipeline. Upon the strategy used here, we confidently identified 41 novel peptide forms and six peptides containing at least one single amino acid substitution. The most representative genomic class of novel peptides was identified as originating from alternative reading frames. In contrast, no clear single amino acid substitution pattern was identified. Validation of the identifications was carried out using a parallel reaction monitoring approach that provided further mass spectrometry support for the existence of the novel peptide sequences. Data are available via ProteomeXchange with identifier PXD022110.en
dc.description.affiliationSao Paulo State Univ, Dept Technol, Jaboticabal, SP, Brazil
dc.description.affiliationUnespSao Paulo State Univ, Dept Technol, Jaboticabal, SP, Brazil
dc.description.sponsorshipConselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)
dc.description.sponsorshipFundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
dc.description.sponsorshipIdCNPq: 400459/2016-7
dc.description.sponsorshipIdFAPESP: 2018/15035-8
dc.format.extent9
dc.identifierhttp://dx.doi.org/10.1016/j.bbapap.2020.140594
dc.identifier.citationBiochimica Et Biophysica Acta-proteins And Proteomics. Amsterdam: Elsevier, v. 1869, n. 3, 9 p., 2021.
dc.identifier.doi10.1016/j.bbapap.2020.140594
dc.identifier.issn1570-9639
dc.identifier.urihttp://hdl.handle.net/11449/209919
dc.identifier.wosWOS:000608739500006
dc.language.isoeng
dc.publisherElsevier B.V.
dc.relation.ispartofBiochimica Et Biophysica Acta-proteins And Proteomics
dc.sourceWeb of Science
dc.subjectBottom-up proteomics
dc.subjectParallel reaction monitoring
dc.subjectProteogenomics
dc.subjectProteomics
dc.titleIdentification of novel protein-coding sequences in Eucalyptus grandis plants by high-resolution mass spectrometryen
dc.typeArtigo
dcterms.licensehttp://www.elsevier.com/about/open-access/open-access-policies/article-posting-policy
dcterms.rightsHolderElsevier B.V.
dspace.entity.typePublication
unesp.departmentTecnologia - FCAVpt

Files