AgResearch
Browse

File(s) not publicly available

Linkage disequilibrium estimation in low coverage high-throughput sequencing data

High-throughput sequencing methods that multiplex a large number of individuals have provided a cost-effective approach for discovering genome-wide genetic variation in large populations. These sequencing methods are increasingly being utilized in population genetic studies across a diverse range of species. Two side-effects of these methods, however, are (1) sequencing errors and (2) heterozygous genotypes called as homozygous due to only one allele at a particular locus being sequenced, which occurs when the sequencing depth is insufficient. Both of these errors have a profound effect on the estimation of linkage disequilibrium (LD) and, if not taken into account, lead to inaccurate estimates. We developed a new likelihood method, GUS-LD, to estimate pairwise linkage disequilibrium using low coverage sequencing data that accounts for undercalled heterozygous genotypes and sequencing errors. Our findings show that accurate estimates were obtained using GUS-LD, whereas underestimation of LD results if no adjustment is made for the errors.

History

Rights statement

© 2018, Genetics

Language

  • English

Does this contain Māori information or data?

  • No

Publisher

Genetics Society of America

Journal title

Genetics

ISSN

0016-6731

Citation

Bilton, T. P., McEwan, J. C., Clarke, S. M., Brauning, R., Van Stijn, T. C., Rowe, S. J., & Dodds, K. G. (2018). Linkage disequilibrium estimation in low coverage high-throughput sequencing data. Genetics, 209(2), 389-400. doi:10.1534/genetics.118.300831

Funder

Ministry of Business Innovation & Employment

Contract number

A20201

Job code

49050

Usage metrics

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC