Copy number variations in the genome of the Qatari population

<p>The populations of the Arabian Peninsula remain the least represented in public genetic databases, both in terms of single nucleotide variants and of larger genomic mutations. We present the first high-resolution copy number variation (CNV) map for a Gulf Arab population, using a hybrid app...

Full description

Saved in:
Bibliographic Details
Main Author: Khalid A. Fakhro (3158862) (author)
Other Authors: Noha A. Yousri (1392577) (author), Juan L. Rodriguez-Flores (18806845) (author), Amal Robay (3158892) (author), Michelle R. Staudt (18806848) (author), Francisco Agosto-Perez (519205) (author), Jacqueline Salit (124697) (author), Joel A. Malek (10327973) (author), Karsten Suhre (67967) (author), Amin Jayyousi (3158868) (author), Mahmoud Zirie (124704) (author), Dora Stadler (3158865) (author), Jason G. Mezey (8813648) (author), Ronald G. Crystal (8813645) (author)
Published: 2015
Subjects:
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1864513511365804032
author Khalid A. Fakhro (3158862)
author2 Noha A. Yousri (1392577)
Juan L. Rodriguez-Flores (18806845)
Amal Robay (3158892)
Michelle R. Staudt (18806848)
Francisco Agosto-Perez (519205)
Jacqueline Salit (124697)
Joel A. Malek (10327973)
Karsten Suhre (67967)
Amin Jayyousi (3158868)
Mahmoud Zirie (124704)
Dora Stadler (3158865)
Jason G. Mezey (8813648)
Ronald G. Crystal (8813645)
author2_role author
author
author
author
author
author
author
author
author
author
author
author
author
author_facet Khalid A. Fakhro (3158862)
Noha A. Yousri (1392577)
Juan L. Rodriguez-Flores (18806845)
Amal Robay (3158892)
Michelle R. Staudt (18806848)
Francisco Agosto-Perez (519205)
Jacqueline Salit (124697)
Joel A. Malek (10327973)
Karsten Suhre (67967)
Amin Jayyousi (3158868)
Mahmoud Zirie (124704)
Dora Stadler (3158865)
Jason G. Mezey (8813648)
Ronald G. Crystal (8813645)
author_role author
dc.creator.none.fl_str_mv Khalid A. Fakhro (3158862)
Noha A. Yousri (1392577)
Juan L. Rodriguez-Flores (18806845)
Amal Robay (3158892)
Michelle R. Staudt (18806848)
Francisco Agosto-Perez (519205)
Jacqueline Salit (124697)
Joel A. Malek (10327973)
Karsten Suhre (67967)
Amin Jayyousi (3158868)
Mahmoud Zirie (124704)
Dora Stadler (3158865)
Jason G. Mezey (8813648)
Ronald G. Crystal (8813645)
dc.date.none.fl_str_mv 2015-10-22T03:00:00Z
dc.identifier.none.fl_str_mv 10.1186/s12864-015-1991-5
dc.relation.none.fl_str_mv https://figshare.com/articles/journal_contribution/Copy_number_variations_in_the_genome_of_the_Qatari_population/26018206
dc.rights.none.fl_str_mv CC BY 4.0
info:eu-repo/semantics/openAccess
dc.subject.none.fl_str_mv Biological sciences
Genetics
Biomedical and clinical sciences
Clinical sciences
Copy number variation
Next-generation sequencing
Genotyping
Genomics
Mendelian disease
Qatar
dc.title.none.fl_str_mv Copy number variations in the genome of the Qatari population
dc.type.none.fl_str_mv Text
Journal contribution
info:eu-repo/semantics/publishedVersion
text
contribution to journal
description <p>The populations of the Arabian Peninsula remain the least represented in public genetic databases, both in terms of single nucleotide variants and of larger genomic mutations. We present the first high-resolution copy number variation (CNV) map for a Gulf Arab population, using a hybrid approach that integrates array genotyping intensity data and next-generation sequencing reads to call CNVs in the Qatari population. CNVs were detected in 97 unrelated Qatari individuals by running two calling algorithms on each of two primary datasets: high-resolution genotyping (Illumina Omni 2.5M) and high depth whole-genome sequencing (Illumina PE 100bp). The four call-sets were integrated to identify high confidence CNV regions, which were subsequently annotated for putative functional effect and compared to public databases of CNVs in other populations. The availability of genome sequence was leveraged to identify tagging SNPs in high LD with common deletions in this population, enabling their imputation from genotyping experiments in the future. Genotyping intensities and genome sequencing data from 97 Qataris were analyzed with four different algorithms and integrated to discover 16,660 high confidence CNV regions (CNVRs) in the total population, affecting ~28 Mb in the median Qatari genome. Up to 40 % of all CNVs affected genes, including novel CNVs affecting Mendelian disease genes, segregating at different frequencies in the 3 major Qatari subpopulations, including those with Bedouin, Persian/South Asian, and African ancestry. Consistent with high consanguinity levels in the Bedouin subpopulation, we found an increased burden for homozygous deletions in this group. In comparison to known CNVs in the comprehensive Database of Genomic Variants, we found that 5 % of all CNVRs in Qataris were completely novel, with an enrichment of CNVs affecting several known chromosomal disorder loci and genes known to regulate sugar metabolism and type 2 diabetes in the Qatari cohort. Finally, we leveraged the availability of genome sequence to find suitable tagging SNPs for common deletions in this population. We combine four independently generated datasets from 97 individuals to study CNVs for the first time at high-resolution in a Gulf Arab population.</p><h2>Other Information</h2> <p> Published in: BMC Genomics<br> License: <a href="http://creativecommons.org/licenses/by/4.0/" target="_blank">http://creativecommons.org/licenses/by/4.0/</a><br>See article on publisher's website: <a href="https://dx.doi.org/10.1186/s12864-015-1991-5" target="_blank">https://dx.doi.org/10.1186/s12864-015-1991-5</a></p>
eu_rights_str_mv openAccess
id Manara2_cfaf9b43d977e0393c0013c2eea9e927
identifier_str_mv 10.1186/s12864-015-1991-5
network_acronym_str Manara2
network_name_str Manara2
oai_identifier_str oai:figshare.com:article/26018206
publishDate 2015
repository.mail.fl_str_mv
repository.name.fl_str_mv
repository_id_str
rights_invalid_str_mv CC BY 4.0
spelling Copy number variations in the genome of the Qatari populationKhalid A. Fakhro (3158862)Noha A. Yousri (1392577)Juan L. Rodriguez-Flores (18806845)Amal Robay (3158892)Michelle R. Staudt (18806848)Francisco Agosto-Perez (519205)Jacqueline Salit (124697)Joel A. Malek (10327973)Karsten Suhre (67967)Amin Jayyousi (3158868)Mahmoud Zirie (124704)Dora Stadler (3158865)Jason G. Mezey (8813648)Ronald G. Crystal (8813645)Biological sciencesGeneticsBiomedical and clinical sciencesClinical sciencesCopy number variationNext-generation sequencingGenotypingGenomicsMendelian diseaseQatar<p>The populations of the Arabian Peninsula remain the least represented in public genetic databases, both in terms of single nucleotide variants and of larger genomic mutations. We present the first high-resolution copy number variation (CNV) map for a Gulf Arab population, using a hybrid approach that integrates array genotyping intensity data and next-generation sequencing reads to call CNVs in the Qatari population. CNVs were detected in 97 unrelated Qatari individuals by running two calling algorithms on each of two primary datasets: high-resolution genotyping (Illumina Omni 2.5M) and high depth whole-genome sequencing (Illumina PE 100bp). The four call-sets were integrated to identify high confidence CNV regions, which were subsequently annotated for putative functional effect and compared to public databases of CNVs in other populations. The availability of genome sequence was leveraged to identify tagging SNPs in high LD with common deletions in this population, enabling their imputation from genotyping experiments in the future. Genotyping intensities and genome sequencing data from 97 Qataris were analyzed with four different algorithms and integrated to discover 16,660 high confidence CNV regions (CNVRs) in the total population, affecting ~28 Mb in the median Qatari genome. Up to 40 % of all CNVs affected genes, including novel CNVs affecting Mendelian disease genes, segregating at different frequencies in the 3 major Qatari subpopulations, including those with Bedouin, Persian/South Asian, and African ancestry. Consistent with high consanguinity levels in the Bedouin subpopulation, we found an increased burden for homozygous deletions in this group. In comparison to known CNVs in the comprehensive Database of Genomic Variants, we found that 5 % of all CNVRs in Qataris were completely novel, with an enrichment of CNVs affecting several known chromosomal disorder loci and genes known to regulate sugar metabolism and type 2 diabetes in the Qatari cohort. Finally, we leveraged the availability of genome sequence to find suitable tagging SNPs for common deletions in this population. We combine four independently generated datasets from 97 individuals to study CNVs for the first time at high-resolution in a Gulf Arab population.</p><h2>Other Information</h2> <p> Published in: BMC Genomics<br> License: <a href="http://creativecommons.org/licenses/by/4.0/" target="_blank">http://creativecommons.org/licenses/by/4.0/</a><br>See article on publisher's website: <a href="https://dx.doi.org/10.1186/s12864-015-1991-5" target="_blank">https://dx.doi.org/10.1186/s12864-015-1991-5</a></p>2015-10-22T03:00:00ZTextJournal contributioninfo:eu-repo/semantics/publishedVersiontextcontribution to journal10.1186/s12864-015-1991-5https://figshare.com/articles/journal_contribution/Copy_number_variations_in_the_genome_of_the_Qatari_population/26018206CC BY 4.0info:eu-repo/semantics/openAccessoai:figshare.com:article/260182062015-10-22T03:00:00Z
spellingShingle Copy number variations in the genome of the Qatari population
Khalid A. Fakhro (3158862)
Biological sciences
Genetics
Biomedical and clinical sciences
Clinical sciences
Copy number variation
Next-generation sequencing
Genotyping
Genomics
Mendelian disease
Qatar
status_str publishedVersion
title Copy number variations in the genome of the Qatari population
title_full Copy number variations in the genome of the Qatari population
title_fullStr Copy number variations in the genome of the Qatari population
title_full_unstemmed Copy number variations in the genome of the Qatari population
title_short Copy number variations in the genome of the Qatari population
title_sort Copy number variations in the genome of the Qatari population
topic Biological sciences
Genetics
Biomedical and clinical sciences
Clinical sciences
Copy number variation
Next-generation sequencing
Genotyping
Genomics
Mendelian disease
Qatar