Whole genome scan of selection markers in Holstein, Najdi and Holstein-Najdi hybrids

Document Type : Research Paper

Authors

1 Department Animal science of agriculture university of Ramin Khozestan

2 Department of Animal Science, Faculty of Animal Science and Food Industry, Ramin Khuzestan University

3 Department of Animal Science, Faculty of Animal Science, University of Zanjan

Abstract

Introduction: In order to protect native breeds in order to maintain genetic diversity, it is necessary to keep the animal in its place of residence because the cost of protecting genetic resources is high and it is not possible to maintain all genetic resources in laboratories and research institutions. The aim of this research was to identify selection traits in Holstein, Najdi and their hybrids in the herds of Khuzestan province.
Material and Methods: For this purpose, the genomic information of 209 cattle samples including 21, 128 and 60 samples from Holstein, Najdi breeds and their Holstein-Najdi hybrids were used, which were genotyped using Genotyping Array 30K arrays. First, animals with more than 5% missing genotype were excluded from the next steps, and monomorphic genotypes and genotypes with a minimum allelic frequency of less than 5%, as well as loci with more than 5% missing genotype, were excluded. Hardy-Weinberg equilibrium was checked in all loci and loci with P-value less than 6-10 were excluded from the data set. In this study, the signs of selection were investigated using two series of methods, two of which are multi-population (FST, hapFLK ) and the other one is single-population (iHS, nSL). To determine the signs of selection in the studied cattle breeds, FST values for each SNP marker were calculated using the unbiased theta estimator method by Plink and R software. Also, in order to identify the signs of selection in the studied breeds, the FLK Single Marker haplotype expansion method called the hapFLK test was also used. iHS and nSL statistics have been used as intra-population statistics in order to investigate the areas carrying selection signs. To calculate these statistics for each SNP, VCFtools and Selscan version 2.0 were used, respectively.
Results and Discussion: After data editing and imputation of 662,428 SNPs from 329 Najdi, Holstein and their hybrids, and after removing 3,129 SNPs with Hardy-Weinberg 6-10 test and 68,116 SNPs with MAF 0.05, 591,181 SNPs remained for further analysis. In order to identify selection signals at the genome level, the numerical value of fixation index (FST) was used. Only 0.1% of the regions of the genome where all markers had high values. Finally, three regions on the genome were selected for further analysis that passed the threshold. Three regions were located on autosomal chromosomes. These three regions, where the stabilization index had a higher numerical value of 0.38, were located on chromosomes 16, 21, and 8, respectively. A threshold of 0.1% of the high hapFLK value of the population was used to identify the regions under selection by the hapFLK method. Finally, the region of 22774764 to 23377643 base pairs of chromosome 14 and the region of 33981177 to 34039961 base pairs of chromosome 26 had a selection mark at the specified threshold, which was used to identify genes in the next step. The iHS statistic was used to identify signs of selection for alleles that are polymorphic in the population, but have not yet reached the fixation stage. iHS less than zero indicates that the respective haplotype carries the derived allele, and iHS greater than zero indicates that the haplotype in question carries the ancestral gene. In this research, the markers with the highest amount of iHS on chromosomes 3, 7 and 10 were identified as selection markers. By using the nSL statistic which was calculated by Selscan software in the Linux environment and drawing the Manhattan plot diagram of this statistic by R software, it was determined that in the Holstein and Holstein*Najdi hybrids population on chromosome 3 and in the Najdi population 4 and 15 regions have been selected. Some of the areas identified with these statistics are directly and indirectly related to growth traits, reproduction, immune system and other traits related to them.
Conclusion: In this research, four statistics, FST, hapFLK, iHS and nSL, were used to identify the signs of selection in Holstein, Najdi cattle populations and their hybrids. The results of FST and hapFLK statistics, which examine the populations in a multi-population way, showed that the signs of selection detected by these two statistics were on chromosomes 8, 14, 16, 21 and 26, and chromosome 16 was common in each of the two strains. Among the areas identified with these statistics, they are directly or indirectly related to the characteristics of milk production and reproduction in cows. The results of the iHS and nSL statistics, which examine the populations as a single population, showed that the signs of selection detected by these two statistics were on chromosomes 3, 4, 7, 10 and 15, which is the chromosome No. 3 was common in every two races. Some of the areas identified with these statistics are directly and indirectly related to growth traits, reproduction, immune system and other traits related to them. Considering that the statistics used are different in terms of the implementation method and the parameters used (the first two are multi-population based on allele frequency and the second two are single-population based on linkage disequilibrium), consequently their results did not match. But from the total of 59 genes identified by the four methods used, two genes, IQCA1, ACKR3, were jointly identified by iHS and nSL statistics. In addition, it can be concluded that the selection during the past years has been generally focused on traits that have been effective in increasing the production efficiency in addition to increasing the resistance of livestock to the environmental conditions of the place of residence.

Keywords

Main Subjects


Barendse W, Harrison   BE,Bunch RJ,Thomas MB and Turner LB, 2009. "Genome wide signatures of positive selection: the comparison of independent samples and the identification of regions associated to traits." BMC Genomics 10(1): 1-15.
Bastin B, Houser A, Bagley C, Ely K, Payton R, Saxton A, Schrick F, Waller J and Kojima C 2014. "A polymorphism in XKR 4 is significantly associated with serum prolactin concentrations in beef cows grazing tall fescue." Animal Genetics 45(3): 439-444.
Boitard S, Boussaha M, Capitan A, Rocha D and Servin B 2016. "Uncovering adaptation from sequence data: lessons from genome resequencing of four cattle breeds." Genetics 203(1): 433-450.
Bonhomme M, Chevalet C, Servin B, Boitard S, Abdallah J, Blott S and SanCristobal M 2010. "Detecting selection in population trees: the Lewontin and Krakauer test extended." Genetics 186(1): 241-262.
Brito L F, Kijas J W, Ventura R V, Sargolzaei M, Porto-Neto L R, Cánovas A, Feng Z, Jafarikia M and Schenkel F S 2017. "Genetic diversity and signatures of selection in various goat breeds revealed by genome-wide SNP markers." BMC Genomics 18: 1-20.
Browning B L, Tian X, Zhou Y and Browning S R 2021. "Fast two-stage phasing of large-scale sequence data." The American Journal of Human Genetics 108(10): 1880-1890.
Daetwyler H D, Pong-Wong R, Villanueva B and Woolliams J A 2010. "The impact of genetic architecture on genome-wide evaluation methods." Genetics 185(3): 1021-1031.
Esmailizadeh A, Bottema C, Sellick G, Verbyla A, Morris C, Cullen N and Pitchford W 2008. "Effects of the myostatin F94L substitution on beef traits." Journal Of Animal Science 86(5): 1038-1046.
Fariello M I, Boitard S, Naya H, SanCristobal M and Servin B 2013. "Detecting signatures of selection through haplotype differentiation among hierarchically structured populations." Genetics 193(3): 929-941.
Frogameri N, Asadifozi M and Esamelizade A 2015. " Scanning the whole genome of eight breeds of native Iranian cattle to identify selection markers" Jornal of Animal Production 18(2): 201-213.
Glyakina A V, Strizhov N I, Karpov M V, Dovidchenko N V, Matkarimov B T, Isaeva L V, Efimova V S, Rubtsov M A, Novikova L A and Donova M V 2019. "Ile351, Leu355 and Ile461 residues are essential for catalytic activity of bovine cytochrome P450scc (CYP11A1)." Steroids 143: 80-90.
Hou J, Qu K, Jia P, Hanif Q, Zhang J, Chen N, Dang R, Chen H, Huang B and Lei C 2020. "A SNP in PLAG1 is associated with body height trait in Chinese cattle." Animal Genetics 51(1): 87-90.
Hudson R R, Bailey K, Skarecky D, Kwiatowski J and Ayala F J 1994. "Evidence for positive selection in the superoxide dismutase (Sod) region of Drosophila melanogaster." Genetics 136(4): 1329-1340.
Kemper K E, Saxton S J, Bolormaa S, Hayes B J and Goddard M E 2014. "Selection for complex traits leaves little or no classic signatures of selection." BMC Genomics 15(1): 1-14.
Kijas J W, Porto‐Neto L, Dominik S, Reverter A, Bunch R, McCulloch R, Hayes B J, Brauning R, McEwan J and Consortium I S G 2014. "Linkage disequilibrium over short physical distances measured in sheep using a high‐density SNP chip." Animal Genetics 45(5): 754-757.
Kuehn L, Keele J, Bennett G, McDaneld T, Smith T, Snelling W, Sonstegard T and Thallman R 2011. "Predicting breed composition using breed frequencies of 50,000 markers from the US Meat Animal Research Center 2,000 Bull Project." Journal Of Animal Science 89(6): 1742-1750.
Liu H, Xu G, Yuan Z, Dong Y, Wang J and Lu W 2017. "Effect of kisspeptin on the proliferation and apoptosis of bovine granulosa cells." Animal Reproduction Science 185: 1-7.
MacEachern S, Hayes B, McEwan J and Goddard M 2009. "An examination of positive selection and changing effective population size in Angus and Holstein cattle populations (Bos taurus) using a high density SNP genotyping platform and the contribution of ancient polymorphism to genomic diversity in Domestic cattle." BMC Genomics 10(1): 1-19.
Manunza A, Cardoso T F, Noce A, Martínez A, Pons A, Bermejo L A, Landi V, Sànchez A, Jordana J, Delgado J V, Adán S, Capote J, Vidal O, Ugarte E, Arranz J J, Calvo J H, Casellas J and Amills M 2016. "Population structure of eleven Spanish ovine breeds and detection of selective sweeps with BayeScan and hapFLK." Scientific Reports 6(1): 1-10.
Mokhber M, Shahrbabak M M, Sadeghi M, Shahrbabak H M and Williams J 2015. "Genome-wide survey of signature of positive selection in Khuzestani and Mazandrani buffalo breeds." Iranian Journal Of Animal Science (IJAS) 46(2): 119-131.
Moradi M H, Nejati-Javaremi A, Moradi-Shahrbabak M, Dodds K G and McEwan J C 2012. "Genomic scan of selective sweeps in thin and fat tail sheep breeds for identifying of candidate regions associated with fat deposition." BMC Genetics 13(1): 1-15.
Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira M A, Bender D, Maller J, Sklar P, De Bakker P I and Daly M J 2007. "PLINK: a tool set for whole-genome association and population-based linkage analyses." The American Journal Of Human Genetics 81(3): 559-575.
Qanbari S, Gianola D, Hayes B, Schenkel F, Miller S, Moore S, Thaller G and Simianer H 2011. "Application of site and haplotype-frequency based approaches for detecting selection signatures in cattle." BMC Genomics 12(1): 1-12.
Qanbari S, Strom T M, Haberer G, Weigend S, Gheyas A A, Turner F, Burt D W, Preisinger R, Gianola D and Simianer H 2012. "A high resolution genome-wide scan for significant selective sweeps: an application to pooled sequence data in laying chickens." PloS One 7(11): 1-12.
Sabeti P C, Reich D E, Higgins J M, Levine H Z, Richter D J, Schaffner S F, Gabriel S B, Platko J V, Patterson N J and McDonald G J 2002. "Detecting recent positive selection in the human genome from haplotype structure." Nature 419(6909): 832-837.
Sabeti P C, Schaffner S F, Fry B, Lohmueller J, Varilly P, Shamovsky O, Palma A, Mikkelsen T, Altshuler D and Lander E 2006. "Positive natural selection in the human lineage." Science 312(5780): 1614-1620.
Sakamoto Y, Hasegawa K, Moriwaki S, Hara Y, Hamada Y and Sasaki S 2020. "A long‐term study of the effects of SLC12A1 homozygous mutation (g. 62382825G> A, p. Pro372Leu) in Japanese Black cattle." Animal Science Journal 91(1): 1-12.
Smith J M and Haigh J 1974. "The hitch-hiking effect of a favourable gene." Genetics Research 23(1): 23-35.
Smith T P, Grosse W M, Freking B A, Roberts A J, Stone R T, Casas E, Wray J E, White J, Cho J and Fahrenkrug S C 2001. "Sequence evaluation of four pooled-tissue normalized bovine cDNA libraries and construction of a gene index for cattle." Genome Research 11(4): 626-630.
Stella A, Ajmone-Marsan P, Lazzari B and Boettcher P 2010. "Identification of selection signatures in cattle breeds selected for dairy production." Genetics 185(4): 1451-1461.
Szpiech Z A and Hernandez R D 2014. "selscan: an efficient multithreaded program to perform EHH-based scans for positive selection." Molecular Biology And Evolution 31(10): 2824-2827.
Tang K, Thornton K R and Stoneking M 2007. "A new approach for using genome scans to detect recent positive selection in the human genome." PLoS Biology 5(7): 1587-1602.
Thue T and Buchanan F 2003. "Linkage mapping of FBN1 to bovine chromosome 11 " Animal Genetic 34(2): 149-160.
Tripathi A K, Koringa P G, Jakhesara S J, Ahir V B, Ramani U V, Bhatt V D, Sajnani M R, Patel D A, Joshi A J and Shanmuga S J 2012. "A preliminary sketch of horn cancer transcriptome in Indian zebu cattle." Gene 493(1): 124-131.
VanRaden P M (2008). "Efficient methods to compute genomic predictions." Journal Of Dairy Science 91(11): 4414-4423.
Voight B F, Kudaravalli S, Wen X and Pritchard J K 2006. "A map of recent positive selection in the human genome." PLoS Biology 4(3): 446-458.
Weir B S and Cockerham C C 1984. "Estimating F-statistics for the analysis of population structure." Evolution: 1358-1370.
Zimin A V, Delcher A L, Florea L, Kelley D R, Schatz M C, Puiu D, Hanrahan F, Pertea G, Van Tassell C P and Sonstegard T S 2009. "A whole-genome assembly of the domestic cow, Bos taurus." Genome Biology 10: 1-10.