Bpop: an efficient program for estimating base population allele frequencies in single and multiple group structured populations

Authors

  • Ismo Stranden Natural Resources Institute Finland (Luke)
  • Esa A. Mäntysaari Natural Resources Institute Finland (Luke)

Keywords:

allele frequency, computer software, genomic evaluation, relationship matrix

Abstract

Base population allele frequencies (AF) should be used in genomic evaluations. A program named Bpop was implemented to estimate base population AF using a generalized least squares (GLS) method when the base population individuals can be assigned to groups. The required dense matrix products involving (A22 )-1v were implemented efficiently using sparse submatrices of A-1, where A and A22 are pedigree relationship matrices for all and genotyped animals, respectively. Three approaches were implemented: iteration on pedigree (IOP), iteration in memory (IM), and direct inversion by sparsity preserving Cholesky decomposition (CHM). The test data had 1.5 million animals genotyped using 50240 markers. Total computing time (the product (A22)-11) was 53 min (1.2 min) by IOP, 51 min (0.3 min) by IM, and 56 min (4.6 min) by CHM. Peak computer core memory use was 0.67 GB by IOP, 0.80 GB by IM, and 7.53 GB by CHM. Thus, the IOP and IM approaches can be recommended for large data sets because of their low memory use and computing time.

Downloads

Download data is not yet available.
Section
Articles

Published

2020-06-18

How to Cite

Stranden, I., & Mäntysaari, E. A. (2020). Bpop: an efficient program for estimating base population allele frequencies in single and multiple group structured populations. Agricultural and Food Science, 29(3), 166–176. https://doi.org/10.23986/afsci.90955
Received 2020-03-17
Accepted 2020-05-15
Published 2020-06-18