Analysis and comparison of the STR genotypes called with HipSTR, STRait Razor and toaSTR by using next generation sequencing data in a Brazilian population sample

dc.contributor.authorValle-Silva, Guilherme
dc.contributor.authorFrontanilla, Tamara Soledad
dc.contributor.authorAyala, Jesús
dc.contributor.authorDonadi, Eduardo Antonio
dc.contributor.authorSimões, Aguinaldo Luiz
dc.contributor.authorCastelli, Erick C. [UNESP]
dc.contributor.authorMendes-Junior, Celso Teixeira
dc.contributor.institutionUniversidade de São Paulo (USP)
dc.contributor.institutionSoftec S.R.L
dc.contributor.institutionUniversidade Estadual Paulista (UNESP)
dc.date.accessioned2022-04-29T08:46:35Z
dc.date.available2022-04-29T08:46:35Z
dc.date.issued2022-05-01
dc.description.abstractShort tandem repeats (STRs) are particularly difficult to genotype with rapid evolving next-generation sequencing (NGS) technology. Long amplicons containing repetitive sequences result in alignment and genotyping errors. Stutters arising from polymerase slippage often result in reads with additional or missing repeat copies. Many tools are available for analysis of STR markers from NGS data. This study has evaluated the concordance of the HipSTR, STRait Razor, and toaSTR tools for STR genotype calling; NGS data obtained from a highly genetically diverse Brazilian population sample have been used. We found that toaSTR can retrieve a larger number of genotypes (93.8%), whereas HipSTR (84.9%) and STRait Razor present much lower genotype calling (75.3%). Accuracy levels for genotype calling are very similar (identical genotypes ~95% and correct alleles ~ 97.5%) across the three methods. All the markers presenting the same genotype through the methods are in Hardy–Weinberg equilibrium. We found that combined match probability and combined exclusion power are 2.90 × 10−28 and 0.99999999982, respectively. Although toaSTR has varying locus-specific differences and better overall performance of toaSTR, the three programs are reliable genotyping tools. Notwithstanding, additional effort is necessary to improve the genotype calling accuracy of next-generation sequencing datasets.en
dc.description.affiliationDepartamento de Química Laboratório de Pesquisas Forenses e Genômicas Faculdade de Filosofia Ciências e Letras de Ribeirão Preto Universidade de São Paulo, SP
dc.description.affiliationDepartamento de Genética Faculdade de Medicina de Ribeirão Preto Universidade de São Paulo, SP
dc.description.affiliationSoftec S.R.L
dc.description.affiliationDivisão de Imunologia Clínica Departamento de Clínica Médica Faculdade de Medicina de Ribeirão Preto Universidade de São Paulo, SP
dc.description.affiliationSão Paulo State University (UNESP) Department of Pathology School of Medicine, SP
dc.description.affiliationUnespSão Paulo State University (UNESP) Department of Pathology School of Medicine, SP
dc.identifierhttp://dx.doi.org/10.1016/j.fsigen.2022.102676
dc.identifier.citationForensic Science International: Genetics, v. 58.
dc.identifier.doi10.1016/j.fsigen.2022.102676
dc.identifier.issn1878-0326
dc.identifier.issn1872-4973
dc.identifier.scopus2-s2.0-85124188207
dc.identifier.urihttp://hdl.handle.net/11449/231614
dc.language.isoeng
dc.relation.ispartofForensic Science International: Genetics
dc.sourceScopus
dc.subjectBioinformatics
dc.subjectBrazil
dc.subjectCODIS
dc.subjectForensic genetics
dc.subjectMassively parallel sequencing
dc.subjectShort tandem repeats
dc.titleAnalysis and comparison of the STR genotypes called with HipSTR, STRait Razor and toaSTR by using next generation sequencing data in a Brazilian population sampleen
dc.typeArtigo

Arquivos