Extensive Alignment Dataset of COVID-19 Gene Primers and Probes Across SARS-CoV-2 Variants

<p dir="ltr"><b>This dataset contains a comprehensive analysis of COVID-19 genetic sequences focused on four key genes:</b><b> Spike Glycoprotein,</b><b> Envelope Protein,</b><b> Nucleocapsid Protein,</b><b> and 3' UTR(</b...

Full description

Saved in:
Bibliographic Details
Main Author: Nahla O. Mousa (10470069) (author)
Other Authors: Marwan Osama (19194868) (author)
Published: 2025
Subjects:
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:<p dir="ltr"><b>This dataset contains a comprehensive analysis of COVID-19 genetic sequences focused on four key genes:</b><b> Spike Glycoprotein,</b><b> Envelope Protein,</b><b> Nucleocapsid Protein,</b><b> and 3' UTR(</b><b>3' </b><b>untranslated region).</b><b> It comprises 20 Excel files,</b><b> each holding 100,</b><b>000 samples.</b><b> A Python script was employed to evaluate primer sets using local alignment against sequences from the NCBI database,</b><b> with lineage determination via the Pangolin tool.</b><b> Each Excel file contains four sheets,</b><b> one per gene,</b><b> with columns for accession ID,</b><b> sample name,</b><b> primer sequences,</b><b> alignment metrics,</b><b> and lineage.</b><b> The dataset includes primer analysis for 2 million sequences across all genes and probe analysis for 1 million sequences in a separate set of 10 Excel files. The dataset contains an additional Excel file that contains the count of </b><b>lineage</b><b> samples to which the primer and the probes were aligned.</b></p>