1. SARS-CoV-2 genome sequences collection, along with key contextual information (metadata).
To do this, after quality filtering, we retrieved the full sequence and metadata files for 13,344,494 SARS-CoV-2 genomes from GISAID to develop our database. These genomes were collected between December 2019 and February 2023. These genomic sequences comprise 2,735 viral lineages submitted from 218 geographical regions and originating from 35 different hosts.
2. Understanding of COV2Var's annotation category
Search page, example: N501Y (Protein level)
Search result table
Mutation annotation result page
Summary information of mutation category
2) Analyzing the distribution of mutation across geographic regions, temporal trends, and lineages category
3) Examining mutation found in abundant sequences of non-human animal hosts category
4) Investigating the association between mutation and patients of different ages, genders, and infection status category
5) Investigating natural selection at mutation site for genetic adaptation and diversity category
6) Alterations in protein physicochemical properties induced by mutation category
7) Alterations in protein stability induced by mutation category
8) Impact on protein function induced by mutation category
9) Exploring mutation distribution within intrinsically disordered protein regions category
10) Alterations in enzyme cleavage sites induced by mutation category
11) Impact of spike protein mutation on antigenicity and immunogenicity category
12) Impact of mutation on viral transmissibility by the affinity between RBD and ACE2 receptor category
13) Impact of mutation on immune escape by the affinity between RBD and antibody/serum category
14) Investigating the co-mutation patterns of SARS-CoV-2 across 2,735 viral lineages category
15) Manual curation of mutation-related literature from PubMed category
Download data and contact us
All related files can be downloaded in the download page (
download page)
The users can contact us for suggestions and comments (
contact page)