Genomic scan for detection of copy number variations in Caspian and Turkman horses

Document Type : Research Paper

Authors

1 Sari agricuitural sciencea and natural Resource

2 Sari Agricultural Sciences and Natural Resources University

3 Sari Agriculture Sciences and Natural Resources University

4 Department of Animal Science, University of Zanjan

Abstract

Introduction: Horses have played an important role in the history of Iranians during different centuries. They kept horses for various aims such as agriculture, transportation, sport, food sources. Iran has a suitable climatic, social and economic potential for keeping and breeding horses, that is why it has been created different breeds using the selection and breeding. But due to problems such as mechanization, lack of government support, export ban, high costs of breeding and maintenance, import of foreign horses, lack of proper planning, interest in keeping horses has decreased. So, unfortunately, only a few native Iranian breeds remain. Additional investigation of the equine genomic architecture is critical for a better understanding of the equine genome, as well as for expanded comparisons across diverse mammalian species. Turkmen and Caspian horses are well-known breeds of Iranian horse breeds. These breeds were historically selected to perform distinct tasks and therefore may harbor a wealth of unique variation at the genome level. Copy number variation (CNV) along with single nonucleotide polymorphisms (SNPs) play a key role in genetic diversity in livestock species. CNVs, a term that refers to a change in the number of copies of a genomic segment, are responsible for more sequence differences between individuals than SNPs and are considered to be a major source of genetic variation contributing to differences in phenotypes (Beckmann et al, 2007). Several studies identified copy number variations in horses using different techniques (Doan et al, 2012). Part of these studies tried to establish associations between CNVs and a specific trait, a disease or even gene expression (Schurink et al, 2017). Most of these studies found either no association or inconclusive associations as the number of horses with phenotypic information or with specific CNVs were limited. For example, a 62 kb duplication on Equus caballus (ECA) chromosome 10 seemed to be related to recurrent laryngeal neuropathy (Dupuis et al, 2013). However, little is known about CNV in Iranian horses.
Materials and Methods: In this study, detection of CNVs and CNVRs were performed based on SNP data from Caspian and Turkman horse breeds were genotyped via Equine70k SNP beadchip. PennCNV software was only used to detect CNV on autosomes. The PennCNV algorithm was only applied to autosomes (command: -lastchr 26) to identify individual-based CNVs. To increase the confidence of the detected CNVs, quality control was performed by employing standard exclusions of the LRR (standard deviation of LRR) <0.3, a BAF drift <0.01 and a waviness factor <0.05. We classified the status of these CNV into two categories: “loss” (CNV containing a deletion) and “gain” (CNV containing a duplication). The CNVRs were determined by aggregating the overlapping CNVs with CNVRuler. BioMart in the Ensembl database and DAVID was employed to identify genes located in CNVRs and GO terms and KEGG pathway analyses respectively. Quantitative real-time PCR (qPCR) was applied to validate the CNVRs that were detected in this study.
Results and Discussion: A total of 202 and 105 CNVs and CNVRs were identified in the studied horses, respectively, which cover 1.08% of the horse genome. The number CNVs in Caspian breed were 1.6 times more than Turkmen. Also, the average size of CNVs in Caspian breed was longer than Turkmen. In both breeds, the genetic event of gain was higher than the genetic event of deletion. In Caspian breed, chromosomes 1, 3, 12, and in the Turkmen breed, chromosomes 1, 6 and 12 showed the most changes in CNVs, respectively. Functional analysis showed that the identified CNVRs overlapped with 434 genes and the most of these genes were common between the two horse breeds (more than 60%). Among these genes, PPARG and GALR have potential related with breed-specific traits. The KEGG pathway analysis also identified several pathways that are significantly enriched in olfactory sensory perception, chemical stimulus sensory perception, antigen processing, and G protein signaling pathway. Also, 60% of successfully detected CNVRs were confirmed by Real-time qPCR. The results of this study were compared with the results of eight other studies. For example, we concluded that the average size of the CNVRs detected by the 70k arrays and the 50K arrays were significantly larger than obtained by the CGH and NGS arrays. This may be due to the relatively low coverage and non-uniform distribution of SNP in the equine genome in SNP arrays. Possible reasons for the differences between our results and some CNV studies can be related to different parameters such as sample size and genetic background, different detection platforms and CNV retrieval algorithms, CNV definitions and CNVRs, as well as random error estimation methods (Pinto et al. 2011).
Conclusions: CNVs can describe part of the phenotypic diversity and adaptation evidence in Iranian horses. With regard to Genes identified in a number of cellular components, biological processes and molecular functions within CNVRs, the importance of such CNVRs and the possible effect needs to be studied and may interest insight into the functional and adaptive consequence of CNVs in horse. In total, the number of CNVs in the Caspian breed was greater than in the Turkmen breed, and also the CNV length in the number of copies in the Caspian breed was greater than in the Turkmen breed. In both breeds, there were overlapping genes with CNVRs that were significantly enriched in biological pathways, including sensory perception, immunity, and metabolism. This is the first CNV report on Turkmen and Caspian horses and the findings of this study could provide valuable information for better understanding of the horse genome and also the important performance traits with CNVRs and associated genes for the future studies in horse breeds.

Keywords

Main Subjects


Abdoli M, Zandi MB, Harkinezhad T and Kahlili M. 2021. Genetic structure survey of Iranian native horse breeds by microsatellite markers, Journal of Animal Productions 23(2):155-163.
Ala-Amjadi M, Yeganeh H.M and Sadeghi M. 2017. Study of genetic variation in Iranian Kurdish horse using microsatellite marker. Iranian Journal of Animal Science 48(3):335-342.
Beckmann JS, Estivill X  and Antonarakis SE. 2007. Copy number variants and genetic traits: closer to the resolution of phenotypic to genotypic variability. Nature Reviews Genetics 8:639–46
Chen CY, Qiao RM, Wei RX, Guo YM, Ai HS, Ma JW, Ren J and Huang LS. 2012. A comprehensive survey of copy number variation in 18 diverse pig populations and identification ofcandidate copy number variable genes associated with complex traits. BMC Genomics 13, 733:1-12
Clop A, Vidal O and Amills M, 2012. Copy number variation in the genomes ofdomestic animals. Animal Genetics. 43: 503–17.
Corbi-Botto CM, Morales-Durand H, Zappa ME, Sadaba SA, Peral-García P, Giovambattista G, and Díaz S. 2019. Genomic structural diversity in Criollo Argentino horses: Analysis of copy number variations. Gene 695: 26-31.
Depczynski B, Nichol K, Faithi Z, Iismaa T, Shine J and Cunningham A. 1998. Distribution and characterization of the cell type sex expressing GALR2 mRNA in brain and pituitary gland. Annals ofthe New York Academy of Sciences 863:120–128.
Doan R, Cohen N, Harrington J, Veazey K, Juras R, Cothran G, McCue ME, Skow L and Dindot SV.2012. Identification of copy number variants in horses. Genome Research. 22: 899–907.
Dupuis, MC, Zhang Z, Durkin K, Charlier C, Lekeux P and Georges M. 2013. Detection of copy number variants in the horse genome and examination of their association with recurrent laryngeal neuropathy. Animal Genetics 44 (2), 206–208.
Firouz L. 1998. Original ancestors of the Turkoman and Caspian horses. First International Conference on Turkoman horses; Ashgabat, Turkmenistan.
Fotovati A, 2000. Persian horse breeds from ancient time to present and their rules in development of world horse breeds. Asian-Australasian Journal of Animal Sciences, 13(Suppl): 401.
Freeman JL, Perry GH, Feuk L, Redon R, McCarroll SA, Altshuler DM, Aburatani H, Jones KW, Tyler-Smith C, Hurles ME and Carter NP. 2006. Copy number variation: new insights in genome diversity. Genome Research,16(8): 949-61.
Ghosh S, Qu Z, Das PJ., Fang E, Juras R, Cothran EG, McDonell S, Lear TL, Adelson DL, Chowdhary BP, Raudsepp T, 2014. Copy number variation in thehorse genome. PLoS Genetics, (10): e1004712.
Hevener AL, He W, Barak Y, Le J, Bandyopadhyay G, Olson P,Wilkes J, Evans RM. and Olefsky J, 2003. Muscle-specific Pparg deletion causes insulin resistance. Nature Medicine, 9: 1491–1497.
Hou Y, Liu GE, Bickhart DM, Cardone MF, Wang K, Kim ES, Matukumalli LK, Ventura M, Song J, VanRaden PM and Sonstegard TS. 2011. Genomic characteristics of cattle copy number variations. BMC Genomics,12(1): 127.
Hussain A, Saraiva LR and Korsching SI. 2009. Positive Darwinian selection and the birth of an olfactory receptor cladein teleosts. Proceedings of the National Academy of Sciences of the United States of America 106, 4313–8.
Jiang L, Jiang J, Wang J, Ding X, Liu J. and Zhang Q, 2012. Genome-wide identification of copy number variations in Chinese Holstein. Plos One, 7(11): e48732.
Jun J, Cho, YS, Hu H, Kim HM, Jho S, Gadhvi P, ParkKM, Lim J and Paek WK,Han K. 2014. Whole genome sequence and analysis of the Marwari horse breed and its genetic origin. BMC Genomics 15, S4.
Kader A, Liu X, Dong K, Song S, Pan J, Yang M, Chen X, He X, Jiang L and Ma Y. 2016. Identification of copy number variations in three Chinese horse breeds using 70K single nucleotide polymorphism BeadChip array. Animal Genetics 47 (5): 560–569.
Keel B, Nonneman D, Lindholm-Perry AK, Oliver W. and Rohrer G, 2019. A survey of copy number variation in the porcine genome detected from whole-genome sequence. Frontiers in Genetics 10: 737.
Kim JH, Hu HJ, Yim SH, Bae JS, Kim SY and Chung YJ. 2012. CNVRuler: a copy number variation-based case–control association analysis tool. Bioinformatics 28(13): 1790-1792.
Krueger K and Flauger B. 2011. Olfactory recognition of individual competitors by means of faeces in horse (Equus caballus). Animal Cognition 14, 245–57.
Liu J, Calhoun VD, Chen J, Claus E D and Hutchison KE, 2013. Effect of homozygous deletions at 22q13. 1 on alcohol dependence severity and cue‐elicited BOLD response in the precuneus. Addiction Biology 18(3): 548-558.
Liu J, Zhang L, Xu L, Ren H, Lu J, Zhang X, Zhang S, Zhou X, Wei C, Zhao F and Du L. 2013. Analysis of copy number variations in the sheep genome using 50K SNP BeadChip array. BMC Genomics 14(1): 229.
Metzger J, Philipp U, Lopes MS, Machado da Camara, A, Felicetti, M, Silvestrelli M and Distl O. 2013. Analysis of copy number variants by three detection algorithms and their association with body size in horses. BMC Genomics 14 (1), 487.
Paudel Y, Madsen O, Megens HJ, Frantz LA, Bosse M, Bastiaansen JW, Crooijmans RP and Groenen MA, 2013. Evolutionary dynamics of copy number variation in pig genomes in the context of adaptation and domestication. BMC Genomics, 14(1): 449.
Pawlina-Tyszko K, Gurgul A, Szmatoła T, Ropka-Molik K, Semik-GurgulEKlukowska-Rötzler J, Koch C, Mählmann K and Bugno-Poniewierska M, 2017. Genomic landscape of copy number variation and copy neutral loss of heterozygosity events in equine sarcoids reveals increased instability of the sarcoid genome. Biochimie 140: 122–32.
Pinto D, Darvishi K, Shi X, Rajan D, Rigler D, Fitzgerald T, Lionel AC, Thiruvahindrapuram B, MacDonald JR, Mills R and Prasad A. 2011. Comprehensive assessment of array-based platforms and calling algorithms for detection of copy number variants. Nature Biotechnology 29(6): 512.
Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MA, Bender D, Maller J, Sklar P, De Bakker PI, Daly MJ. and Sham PC, 2007. PLINK: a tool set for whole-genome association and population-based linkage analyses. The American Journal of Human Genetics 81(3): 559-75.
Rafati N, Andersson LS, Mikko S, Feng C, Raudsepp T and Pettersson J, 2016. Large deletions at the SHOX locus in the pseudoautosomal region are associated with skeletal atavism in Shetland ponies. G3 (Bethesda), 6(7): 2213–23.
Redon R, Ishikawa S, Fitch KR, Feuk L, Perry GH and Andrews TD. 2006. Global variation in copy number in the human genome. Nature, 444:444–454.
Rosengren PG, Golovko A, Sundstrom E, Curik I, Lennartsson J and Druml T,2008. A cis-acting regulatory mutation causes premature hair graying and susceptibility to melanoma in the horse. Nature Genetics 40:1004–9.
Sadeghi R, Moradi-Shahrbabak M, MiraeiAshtiani SR, Schlamp F, Cosgrove EJ and Antczak DF. 2019. Genetic diversity of Persian Arabian horses and their relationship to other native Iranian horse breeds. Journal of Heredity, 110(2): 173-182.
Schurink A, da Silva VH, Velie BD, Dibbits BW, Crooijmans RPMA and Franҫois L.2018. Copy number variations in Friesian horses and genetic risk factors for insect bite hypersensitivity. BMC Genomic 19: 49.
Sharp AJ, Cheng Z and Eichler EE, 2006. Structural variation of the human genome. Annual Review of Genomics and Human Genetics 7: 407–442.
Stachowiak M, Szczerbal I, Nowacka-Woszuk J, Jackowiak H, Sledzinski P, Iskrzak P, Stanislaw D and Switonski M. 2017. Polymorphisms in the SOX9 region and testicular disorder of sex development (38, XX; SRY-negative) in pigs. Livestock Science 203:48–53.
Vila C, Leonard JA, Gotherstrom A, Marklund S, Sandberg K, Liden K, WayneRK and Ellegren H. 2001. Widespread origins of domestic horse lineages. Science 291: 474–477.
Wang K, Li M, Hadley D, Liu R, Glessner J, Grant SF. and Bucan M, 2007. PennCNV: an integrated hidden Markov model designed for high-resolution copy number variation detection in whole-genome SNP genotyping data. Genome Research 17(11):1665-1674.
Wang W, Wang S, Hou C, Xing Y, Cao J, Wu K, Liu C, Zhang D, Zhang L and Zhang Y. 2014. Genome-wide detection of copy number variations among diverse horse breeds by array CGH. Plos One, 9, e86860.
Young JM, Endicott RM, Parghi SS, Walker M, Kidd JM. and Trask BJ, 2008. Extensive copy-number variation of the human olfactory receptor gene family. The American Journal of Human Genetics 83(2): 228-42.
Young JM, Endicott RM, Parghi SS, Walker M, Kidd JM and Trask BJ. 2008. Extensive copy-number variation of the human olfactory receptor gene family. The American Journal of Human Genetics 83(2): 228-42.
Yousefi-Mashouf N, Mehrabani-Yeganeh H, Nejati-Javaremi A, Bailey E and Petersen JL. 2021. Genomic comparisons of Persian Kurdish, Persian Arabian and American Thoroughbred horse populations. Plos One, 16(2): e0247123.
Zandi Baghcheh Maryam MB, 2014. Genomic scan for detection of selective sweeps in Turkmen horse population. PhD. Campus of Agriculture and Natural Resources, University of Tehran, Tehran, Iran, 112 pages.