Back to top

The current mutation

ID: V8255
DNA: 18204A>G
Protein: G6068G
Position: 18468








COV2Var annotation categories







Summary information of mutation (18204A>G)

Basic Information about Mutation.

  Gene Information   Gene ID   GU280_gp01_pp1ab
  Gene Name   ORF1ab_pp1ab
  Gene Type   protein_coding
  Genome position   18468
  Reference genome   GenBank ID: NC_045512.2
  Mutation type   synonymous_variant
  DNA Level   DNA Mutation: 18204A>G
  Ref Seq: A
  Mut Seq: G
  Protein Level   Protein 1-letter Mutation: G6068G
  Protein 3-letter Mutation: Gly6068Gly

Overview of the genomic positions of Mutation.
Note: The annotated 12 genes were retrieved from GeneBank (Accession: NC_045512.2). "MP" represents genomic position of mutation.





Analyzing the distribution of mutation (18204A>G) across geographic regions, temporal trends, and lineages

The count of genome sequences harboring this mutation and its distribution across global regions offer insights into regional variations.
Note: The distribution of mutation across 218 geographical regions. Color representation of genome sequence counts. The data is obtained from GISAID's metadata, specifically capturing the regional distribution of genomic sequences.



The dynamic count of genome sequences containing this mutation over time.
Note: Clicking the "Count" or "Cumulative Count" button toggles the view. Count represents the number of genome sequences per month. Cumulative count represents the accumulated total count up to the respective month. The data is obtained from GISAID's metadata, specifically capturing the collection date of genomic sequences.



For every time point represented in the graph above, identifying the top 3 lineages with the highest count of genome sequences carrying this mutation aids in pinpointing noteworthy lineages for further analysis.
Note: Users can filter the lineages by entering a "Year-Month" term in the search box. For example, entering 2020-01 will display lineages that appeared in January 2020. The data is obtained from GISAID's metadata, specifically capturing the collection date of genomic sequences.

Collection date Lineage Total lineage monthly counts Lineage-specific monthly counts Lineage-specific monthly frequency
2020-10 B.1.177 3 2 6.67e-1
2020-10 AY.16 3 1 3.33e-1
2020-11 B.1.349 2 2 1.00e+0
2020-12 AS.1 6 2 3.33e-1
2020-12 B.1.1.316 6 2 3.33e-1
2020-12 B.1.349 6 2 3.33e-1
2020-04 B.1.1 2 2 1.00e+0
2020-05 B.1 1 1 1.00e+0
2020-06 B.1 1 1 1.00e+0
2020-07 B.1.335 1 1 1.00e+0
2020-08 B.1 1 1 1.00e+0
2020-09 B.1.468 1 1 1.00e+0
2021-01 B.1.349 24 9 3.75e-1
2021-01 B.1.177 24 6 2.50e-1
2021-01 AY.16 24 4 1.67e-1
2021-10 AY.13 574 348 6.06e-1
2021-10 AY.16 574 76 1.32e-1
2021-10 B.1.617.2 574 44 7.67e-2
2021-11 AY.13 394 169 4.29e-1
2021-11 AY.16 394 116 2.94e-1
2021-11 AY.122 394 38 9.64e-2
2021-12 AY.13 284 120 4.23e-1
2021-12 AY.16 284 89 3.13e-1
2021-12 AY.122 284 39 1.37e-1
2021-02 B.1.349 32 8 2.50e-1
2021-02 AY.16 32 7 2.19e-1
2021-02 B.1.177 32 5 1.56e-1
2021-03 AY.16 65 36 5.54e-1
2021-03 B.1.1.7 65 13 2.00e-1
2021-03 B.1.617.2 65 4 6.15e-2
2021-04 AY.16 467 375 8.03e-1
2021-04 B.1.617.2 467 58 1.24e-1
2021-04 B.1.1.7 467 20 4.28e-2
2021-05 AY.16 829 412 4.97e-1
2021-05 B.1.617.2 829 170 2.05e-1
2021-05 AY.13 829 149 1.80e-1
2021-06 AY.13 1418 662 4.67e-1
2021-06 AY.16 1418 324 2.28e-1
2021-06 B.1.617.2 1418 231 1.63e-1
2021-07 AY.13 3346 2475 7.40e-1
2021-07 AY.16 3346 481 1.44e-1
2021-07 AY.16.1 3346 221 6.60e-2
2021-08 AY.13 3205 1978 6.17e-1
2021-08 AY.16 3205 790 2.46e-1
2021-08 AY.16.1 3205 185 5.77e-2
2021-09 AY.13 1224 622 5.08e-1
2021-09 AY.16 1224 372 3.04e-1
2021-09 B.1.617.2 1224 98 8.01e-2
2022-01 BA.1.1 46 15 3.26e-1
2022-01 AY.16 46 6 1.30e-1
2022-01 AY.13 46 5 1.09e-1
2022-10 BF.7 16 10 6.25e-1
2022-10 BA.5.2.39 16 3 1.88e-1
2022-10 BA.5.2.3 16 2 1.25e-1
2022-11 BQ.1.23 17 7 4.12e-1
2022-11 BF.7 17 3 1.76e-1
2022-11 BA.5.2.39 17 2 1.18e-1
2022-12 BQ.1.23 30 11 3.67e-1
2022-12 BQ.1.1 30 6 2.00e-1
2022-12 BF.5 30 3 1.00e-1
2022-02 BA.2 77 39 5.06e-1
2022-02 BA.1.1 77 29 3.77e-1
2022-02 BA.1.1.18 77 3 3.90e-2
2022-03 BA.2 57 53 9.30e-1
2022-03 BA.1.1 57 3 5.26e-2
2022-03 BA.2.85 57 1 1.75e-2
2022-04 BA.2 27 20 7.41e-1
2022-04 BA.2.16 27 2 7.41e-2
2022-04 BA.2.9 27 2 7.41e-2
2022-05 BA.2 46 38 8.26e-1
2022-05 BA.2.3.1 46 2 4.35e-2
2022-05 BA.2.9 46 2 4.35e-2
2022-06 BA.2 8 7 8.75e-1
2022-06 BA.2.44 8 1 1.25e-1
2022-07 BA.4 3 2 6.67e-1
2022-07 BA.5.1.2 3 1 3.33e-1
2022-08 BA.5.2.3 5 2 4.00e-1
2022-08 BA.5.8 5 2 4.00e-1
2022-08 CA.3 5 1 2.00e-1
2022-09 BF.7 10 7 7.00e-1
2022-09 BA.5.1.23 10 1 1.00e-1
2022-09 BA.5.2.3 10 1 1.00e-1
2023-01 BF.7.5 14 6 4.29e-1
2023-01 BF.7.4.1 14 2 1.43e-1
2023-01 XBB.1 14 2 1.43e-1
2023-02 BQ.1.1 3 1 3.33e-1
2023-02 XBB.1 3 1 3.33e-1
2023-02 XBB.1.5 3 1 3.33e-1

The count of genome sequences and the frequency of this mutation in each lineage.
Note: Displaying mutation frequencies (>0.01) among 2,735 lineages. Mutation Count represents the count of sequences carrying this mutation. Users can filter the lineages by entering a search term in the search box. For example, entering "A.1" will display A.1 lineages. The data is obtained from GISAID's metadata, specifically capturing the lineage of genomic sequences. Mutation count: Count of sequences carrying this mutation.

Mutation ID Lineage Mutation frequency Mutation count Earliest lineage emergence Latest lineage emergence
V8255 AY.16.1 9.92e-1 515 2021-5-21 2021-11-29
V8255 AY.16 9.73e-1 3089 2020-10-26 2022-1-17
V8255 B.1.349 1.63e-2 23 2020-6-5 2021-8-21
V8255 AY.13 9.81e-1 6540 2021-1-11 2022-1-21
V8255 AY.18 9.91e-1 106 2021-5-11 2021-7-20
V8255 BA.5.2.39 2.58e-2 5 2022-8-18 2023-2-10
V8255 XW 1.14e-2 2 2022-3-9 2022-6-30






Examining mutation (18204A>G) found in abundant sequences of non-human animal hosts

Exploring mutation presence across 35 non-human animal hosts for cross-species transmission.
Note: We retained the mutation that appear in at least three non-human animal hosts' sequences. The data is obtained from GISAID's metadata, specifically capturing the host of genomic sequences.

Animal host Lineage Source region Collection date Accession ID




Association between mutation (18204A>G) and patients of different ages, genders, and statuses

Note: The logistic regression model was employed to examine changes in patient data before and after the mutation. The logistic regression model was conducted using the glm function in R. The data is obtained from GISAID's metadata, specifically capturing the patient status, gender, and age of genomic sequences.

Analyzing the association between mutation and patient status.
Note: we categorized the data into different patient statuses (ambulatory, deceased, homebound, hospitalized, mild, and recovered) based on GISAID classifications. In the analysis exploring the association between mutation and patient status, the model included mutation, patient status, patient age, gender, sequence region of origin, and sequence collection time point. In the 'increase' direction of the mutation, it means that when this mutation occurs, it increases the corresponding effect proportion. In the 'decrease' direction of the mutation, it means that when this mutation occurs, it decreases the corresponding effect proportion. A p-value lower than 0.001 signifies a notable differentiation between the population with and without the mutation.

Attribute Effect Estimate SE Z-value P-value Direction
Patient status Ambulatory -1.06e+0 1.09e+0 -9.77e-1 3.28e-1 Decrease
Deceased 1.47e-1 1.73e+0 8.54e-2 9.32e-1 Increase
Homebound 1.12e-15 8.47e+4 1.32e-20 1.00e+0 Increase
Hospitalized 7.27e-1 6.62e-1 1.10e+0 2.72e-1 Increase
Mild 2.05e+1 1.20e+4 1.70e-3 9.99e-1 Increase
Recovered -6.55e-1 6.14e-1 -1.07e+0 2.86e-1 Decrease

Analyzing the association between mutation and patient status.
Note: we categorized the data into different patient age (0-17, 18-39, 40-64, 65-84, and 85+). In the analysis exploring the association between mutation and patient age, the model included mutation, patient age, gender, sequence region of origin, and sequence collection time point. In the 'increase' direction of the mutation, it means that when this mutation occurs, it increases the corresponding effect proportion. In the 'decrease' direction of the mutation, it means that when this mutation occurs, it decreases the corresponding effect proportion. A p-value lower than 0.001 signifies a notable differentiation between the population with and without the mutation.

Attribute Effect Estimate SE Z-value P-value Direction
Patient age, years 0-17 4.47e-1 1.41e-1 3.17e+0 1.55e-3 Increase
18-39 4.22e-1 7.57e-2 5.57e+0 2.51e-8 Increase
40-64 -2.93e-1 7.44e-2 -3.94e+0 8.16e-5 Decrease
65-84 -3.16e-1 1.08e-1 -2.92e+0 3.51e-3 Decrease
>=85 -9.90e-1 1.93e-1 -5.12e+0 3.03e-7 Decrease

Analyzing the association between mutation and patient status.
Note: we categorized the data into different patient gender (male and female). In the analysis exploring the association between mutation and patient gender, the model included mutation, patient gender, patient age, sequence region of origin, and sequence collection time point. In the 'increase' direction of the mutation, it means that when this mutation occurs, it increases the corresponding effect proportion. In the 'decrease' direction of the mutation, it means that when this mutation occurs, it decreases the corresponding effect proportion. A p-value lower than 0.001 signifies a notable differentiation between the population with and without the mutation.

Attribute Effect Estimate SE Z-value P-value Direction
Patient gender Male -4.70e-2 7.25e-2 -6.48e-1 5.17e-1 Decrease





Investigating natural selection at mutation (18204A>G) site for genetic adaptation and diversity

Note: Investigating the occurrence of positive selection or negative selection at this mutation site reveals implications for genetic adaptation and diversity.

The MEME method within the HyPhy software was employed to analyze positive selection. MEME: episodic selection.
Note: List of sites found to be under episodic selection by MEME (p < 0.05). "Protein Start" corresponds to the protein's starting genomic position. "Protein End" corresponds to the protein's ending genomic position. The term 'site' represents a selection site within the protein.

Protein name Protein start Protein end Protein length Site P-value Lineage Method

The FEL method within the HyPhy software was employed to analyze both positive and negative selection. FEL: pervasive selection on samll datasets.
Note: List of sites found to be under pervasive selection by FEL (p < 0.05). A beta value greater than alpha signifies positive selection, while a beta value smaller than alpha signifies negative selection. "Protein Start" corresponds to the protein's starting genomic position. "Protein End" corresponds to the protein's ending genomic position. The term 'site' represents a selection site within the protein.

Protein name Protein start Protein end Protein length Site Alpha Beta P-value Lineage Method

The FUBAR method within the HyPhy software was employed to analyze both positive and negative selection. FUBAR: pervasive selection on large datasets.
Note: List of sites found to be under pervasive selection by FUBAR (prob > 0.95). A prob[alpha < beta] value exceeding 0.95 indicates positive selection, while a prob[alpha > beta] value exceeding 0.95 indicates negative selection. "Protein Start" corresponds to the protein's starting genomic position. "Protein End" corresponds to the protein's ending genomic position. The term 'site' represents a selection site within the protein.

Protein name Protein start Protein end Protein length Site Prob[alpha>beta] Prob[alpha<beta] Lineage Method




Alterations in protein physicochemical properties induced by mutation (18204A>G)

Understanding the alterations in protein physicochemical properties can reveal the evolutionary processes and adaptive changes of viruses
Note: ProtParam software was used for the analysis of physicochemical properties. Significant change threshold: A change exceeding 10% compared to the reference is considered a significant change. "GRAVY" is an abbreviation for "grand average of hydropathicity".

Group Protein name Molecular weight Theoretical PI Extinction coefficients Aliphatic index GRAVY




Alterations in protein stability induced by mutation (18204A>G)

The impact of mutations on protein stability directly or indirectly affects the biological characteristics, adaptability, and transmission capacity of the virus
Note: iMutant 2.0 was utilized to analyze the effects of mutations on protein stability. pH 7 and a temperature of 25°C are employed to replicate the in vitro environment. pH 7.4 and a temperature of 37°C are utilized to simulate the in vivo environment.

Mutation Protein name Mutation type Position ΔDDG Stability pH Temperature Condition




Impact on protein function induced by mutation (18204A>G)

The impact of mutations on protein function
Note: The MutPred2 software was used to predict the pathogenicity of a mutation and gives the molecular mechanism of pathogenicity. A score above 0.5 indicates an increased likelihood of pathogenicity. "Pr" is the abbreviation for "proportion. P" is the abbreviation for "p-value.

Mutation Protein name Mutation type Score Molecular mechanisms




Exploring mutation (18204A>G) distribution within intrinsically disordered protein regions

Intrinsically Disordered Proteins (IDPs) which refers to protein regions that have no unique 3D structure. In viral proteins, mutations in the disordered regions s are critical for immune evasion and antibody escape, suggesting potential additional implications for vaccines and monoclonal therapeutic strategies.
Note: The iupred3 software was utilized for analyzing IDPs. A score greater than 0.5 is considered indicative of an IDP. In the plot, "POS" represents the position of the mutation.





Alterations in enzyme cleavage sites induced by mutation (18204A>G)

Exploring the impact of mutations on the cleavage sites of 28 enzymes.
Note: The PeptideCutter software was used for detecting enzymes cleavage sites. The increased enzymes cleavage sites refer to the cleavage sites in the mutated protein that are added compared to the reference protein. Conversely, the decreased enzymes cleavage sites indicate the cleavage sites in the mutated protein that are reduced compared to the reference protein.

Mutation Protein name Genome position Enzyme name Increased cleavage sites Decreased cleavage sites




Impact of spike protein mutation (18204A>G) on antigenicity and immunogenicity

Investigating the impact of mutations on antigenicity and immunogenicity carries important implications for vaccine design and our understanding of immune responses.
Note: An absolute change greater than 0.0102 (three times the median across sites) in antigenicity score is considered significant. An absolute changegreater than 0.2754 (three times the median across sites) in immunogenicity score is considered significant. The VaxiJen tool was utilized for antigenicity analysis. The IEDB tool was used for immunogenicity analysis. Antigens with a prediction score of more than 0.4 for this tool are considered candidate antigens. MHC I immunogenicity score >0, indicating a higher probability to stimulate an immune response.

Group Protein name Protein region Antigenicity score Immunogenicity score




Impact of mutation (18204A>G) on viral transmissibility by the affinity between RBD and ACE2 receptor

Unraveling the impact of mutations on the interaction between the receptor binding domain (RBD) and ACE2 receptor using deep mutational scanning (DMS) experimental data to gain insights into their effects on viral transmissibility.
Note: The ΔBinding affinity represents the disparity between the binding affinity of a mutation and the reference binding affinity. A positive Δbinding affinity value (Δlog10(KD,app) > 0) signifies an increased affinity between RBD and ACE2 receptor due to the mutation. Conversely, a negative value (Δlog10(KD,app) < 0) indicates a reduced affinity between RBD and ACE2 receptor caused by the mutation. A p-value smaller than 0.05 indicates significance. "Ave mut bind" represents the average binding affinity of this mutation. "Ave ref bind" refers to the average binding affinity at a site without any mutation (reference binding affinity).

;
Mutation Protein name Protein region Mutation Position Ave mut bind Ave ref bind ΔBinding affinity P-value Image


The interface between the receptor binding domain (RBD) and ACE2 receptor is depicted in the crystal structure 6JM0.
Note: The structure 6M0J encompasses the RBD range of 333 to 526. The binding sites (403-406, 408, 417, 439, 445-447, 449, 453, 455-456, 473-478, 484-498, and 500-506) on the RBD that interface with ACE2 are indicated in magenta. The binding sites on the RBD that have been identified through the interface footprints experiment. The ACE2 binding sites within the interface are shown in cyan, representing residues within 5Å proximity to the RBD binding sites. The mutation within the RBD range of 333 to 526 is depicted in red.

        Show as:

        Show interface residues:





Impact of mutation (18204A>G) on immune escape by the affinity between RBD and antibody/serum

By utilizing experimental data from deep mutational scanning (DMS), we can uncover how mutations affect the interaction between the receptor binding domain (RBD) and antibodies/serum. This approach provides valuable insights into strategies for evading the host immune response.
Note: We considered a mutation to mediate strong escape if the escape score exceeded 0.1 (10% of the maximum score of 1). A total of 1,504 antibodies/serum data were collected for this analysis. "Condition name" refers to the name of the antibodies/serum. "Mut escape score" represents the escape score of the mutation in that specific condition. "Avg mut escape score" indicates the average escape score of the mutation site in that condition, considering the occurrence of this mutation and other mutations. Class 1 antibodies bind to an epitope only in the RBD “up” conformation, and are the most abundant. Class 2 antibodies bind to the RBD both in “up” and “down” conformations. Class 3 and class 4 antibodies both bind outside the ACE2 binding site. Class 3 antibodies bind the RBD in both the open and closed conformation, while class 4 antibodies bind only in the open conformation.

Mutation Condition name Condition type Condition subtype Condition year Mut escape score Avg mut escape score




Investigating the co-mutation patterns of mutation (18204A>G) across 2,735 viral lineages

Investigating the co-mutation patterns of SARS-CoV-2 across 2,735 viral lineages to unravel the cooperative effects of different mutations. In biological research, correlation analysis of mutation sites helps us understand whether there is a close relationship or interaction between certain mutations.
Note: The Spearman correlation coefficient is used to calculate the correlation between two mutations within each Pango lineage. Holm–Bonferroni method was used for multiple test adjustment. We retained mutation pairs with correlation values greater than 0.6 or less than -0.6 and Holm–Bonferroni corrected p-values less than 0.05.

Associated mutation ID DNA mutation Mutation type Protein name Protein mutation correlation coefficient Lineage
V7059 8940T>C synonymous_variant ORF1ab_pp1a D2980D 6.00e-1 AY.25.1
V8835 1626C>T synonymous_variant S N542N 6.39e-1 AY.25.1
V1358 5659G>A missense_variant ORF1ab_pp1a V1887I 6.03e-1 P.1
V6391 3792T>C synonymous_variant ORF1ab_pp1a H1264H 1.00e+0 AY.98
V2180 11090C>T missense_variant ORF1ab_pp1a A3697V 6.91e-1 BF.7
V1337 5483C>T missense_variant ORF1ab_pp1a T1828I 7.07e-1 AY.107
V3006 17842G>T missense_variant ORF1ab_pp1ab A5948S 7.07e-1 AY.107
V3550 74C>T missense_variant S P25L 7.07e-1 AY.107
V4034 2063C>T missense_variant S A688V 7.07e-1 AY.107
V6425 4035G>T synonymous_variant ORF1ab_pp1a V1345V 7.07e-1 AY.107
V7313 11043C>T synonymous_variant ORF1ab_pp1a D3681D 7.07e-1 AY.107
V1750 7982C>T missense_variant ORF1ab_pp1a S2661F 7.07e-1 AY.111
V2563 13925C>T missense_variant ORF1ab_pp1ab A4642V 1.00e+0 AY.111
V6723 6198G>T synonymous_variant ORF1ab_pp1a V2066V 7.07e-1 AY.111
V7053 8904C>T synonymous_variant ORF1ab_pp1a Y2968Y 1.00e+0 AY.111
V8154 17469C>T synonymous_variant ORF1ab_pp1ab F5823F 7.07e-1 AY.111
V9624 333C>T synonymous_variant N Y111Y 1.00e+0 AY.111
V9633 384C>T synonymous_variant N D128D 1.00e+0 AY.111
V2860 16775A>G missense_variant ORF1ab_pp1ab N5592S 9.13e-1 AY.119
V1681 7585G>A missense_variant ORF1ab_pp1a A2529T 1.00e+0 AY.15
V4785 524C>T missense_variant M T175M 7.07e-1 AY.15
V1972 9626C>T missense_variant ORF1ab_pp1a A3209V 8.65e-1 AY.16.1
V1982 9764C>T missense_variant ORF1ab_pp1a T3255I -7.06e-1 AY.16.1
V2395 12646G>A missense_variant ORF1ab_pp1a V4216I 6.09e-1 AY.16.1
V2880 16880G>T missense_variant ORF1ab_pp1ab R5627L 1.00e+0 AY.1
V4454 280C>T missense_variant ORF3a L94F 1.00e+0 AY.1
V7528 12681T>C synonymous_variant ORF1ab_pp1a Y4227Y 7.07e-1 AY.1
V8108 17142A>G synonymous_variant ORF1ab_pp1ab R5714R 1.00e+0 AY.1
V3112 18449C>T missense_variant ORF1ab_pp1ab A6150V 1.00e+0 AY.27
V952 3227A>G missense_variant ORF1ab_pp1a N1076S 7.07e-1 AY.27
V9149 102G>A synonymous_variant ORF3a T34T 1.00e+0 AY.34.1
V9676 573C>T synonymous_variant N R191R 8.16e-1 AY.34.1
V1175 4498C>T missense_variant ORF1ab_pp1a H1500Y 6.86e-1 AY.36
V2119 10838C>A missense_variant ORF1ab_pp1a P3613H 6.86e-1 AY.36
V3576 199G>T missense_variant S A67S 6.71e-1 AY.36
V7564 12960C>T synonymous_variant ORF1ab_pp1a S4320S 1.00e+0 AY.4.2.1
V1763 8069C>T missense_variant ORF1ab_pp1a A2690V 9.16e-1 AY.46
V2857 16755G>T missense_variant ORF1ab_pp1ab E5585D 7.00e-1 AY.46
V3023 17912C>T missense_variant ORF1ab_pp1ab P5971L 9.02e-1 AY.46
V4994 346C>T missense_variant ORF7a L116F 8.57e-1 AY.46
V5780 *4360G>A downstream_gene_variant S None 9.64e-1 AY.46
V7503 12513C>T synonymous_variant ORF1ab_pp1a Y4171Y 9.47e-1 AY.46
V7528 12681T>C synonymous_variant ORF1ab_pp1a Y4227Y 9.02e-1 AY.46
V8473 19998A>G synonymous_variant ORF1ab_pp1ab L6666L 8.17e-1 AY.46
V3201 19079C>T missense_variant ORF1ab_pp1ab A6360V 1.00e+0 AY.4.7
V2571 13961C>T missense_variant ORF1ab_pp1ab T4654I 7.07e-1 AY.53
V6866 7467C>T synonymous_variant ORF1ab_pp1a Y2489Y 7.07e-1 AY.53
V8034 16605C>T synonymous_variant ORF1ab_pp1ab Y5535Y 7.07e-1 AY.53
V7356 11403C>T synonymous_variant ORF1ab_pp1a N3801N 7.77e-1 AY.54
V8108 17142A>G synonymous_variant ORF1ab_pp1ab R5714R 7.46e-1 AY.54
V9037 3195G>T synonymous_variant S V1065V 6.56e-1 AY.54
V9722 822C>T synonymous_variant N F274F 6.12e-1 AY.54
V2597 14161C>A missense_variant ORF1ab_pp1ab L4721I 1.00e+0 AY.78
V9198 336C>T synonymous_variant ORF3a V112V 7.07e-1 AY.78
V9708 750T>C synonymous_variant N S250S 1.00e+0 AY.78
V1750 7982C>T missense_variant ORF1ab_pp1a S2661F 9.35e-1 AY.92
V2563 13925C>T missense_variant ORF1ab_pp1ab A4642V 8.45e-1 AY.92
V4608 717G>T missense_variant ORF3a E239D 9.66e-1 AY.92
V5960 669C>T synonymous_variant ORF1ab_pp1a D223D 7.71e-1 AY.92
V6655 5766C>T synonymous_variant ORF1ab_pp1a N1922N 8.81e-1 AY.92
V6723 6198G>T synonymous_variant ORF1ab_pp1a V2066V 9.07e-1 AY.92
V7053 8904C>T synonymous_variant ORF1ab_pp1a Y2968Y 8.15e-1 AY.92
V8154 17469C>T synonymous_variant ORF1ab_pp1ab F5823F 9.28e-1 AY.92
V100 -22C>T upstream_gene_variant ORF1ab_pp1a None 1.00e+0 AY.98.1
V1706 7760C>T missense_variant ORF1ab_pp1a A2587V 7.07e-1 B.1.1.284
V1928 9214G>T missense_variant ORF1ab_pp1a G3072C 1.00e+0 B.1.1.316
V320 649G>A missense_variant ORF1ab_pp1a E217K 1.00e+0 B.1.1.316
V4252 3624G>T missense_variant S Q1208H 1.00e+0 B.1.1.316
V5415 518C>T missense_variant N A173V 1.00e+0 B.1.1.316
V7795 14661C>T synonymous_variant ORF1ab_pp1ab V4887V 1.00e+0 B.1.1.316
V932 3161C>T missense_variant ORF1ab_pp1a P1054L 1.00e+0 B.1.1.317
V3499 21167C>T missense_variant ORF1ab_pp1ab A7056V 6.56e-1 B.1.349
V2611 14320G>A missense_variant ORF1ab_pp1ab A4774T 7.07e-1 B.1.351
V4197 3347C>A missense_variant S T1116N 7.07e-1 B.1.351
V2113 10810T>C missense_variant ORF1ab_pp1a F3604L 8.02e-1 B.1.526
V9170 186C>T synonymous_variant ORF3a I62I 6.07e-1 B.1.526
V9727 837A>T synonymous_variant N P279P 7.64e-1 B.1.526
V5122 134G>T missense_variant ORF8 W45L 7.07e-1 B.1.617.1
V4016 2033C>T missense_variant S T678I 1.00e+0 BA.2.16
V7496 12468C>T synonymous_variant ORF1ab_pp1a A4156A 8.16e-1 BA.2.16
V7203 10014C>T synonymous_variant ORF1ab_pp1a L3338L 6.54e-1 BA.5.2.3
V1490 6274C>T missense_variant ORF1ab_pp1a H2092Y 7.07e-1 BA.5.8
V5114 112C>T missense_variant ORF8 P38S 7.07e-1 BA.5.8
V7612 13287C>T synonymous_variant ORF1ab_pp1ab I4429I 8.16e-1 BA.5.8
V2261 11675T>C missense_variant ORF1ab_pp1a V3892A 1.00e+0 BF.1
V305 601A>G missense_variant ORF1ab_pp1a I201V 1.00e+0 BF.1
V3634 425_433delGTGTTTATT disruptive_inframe_deletion S G142_Y145delinsD 1.00e+0 BF.1
V7915 15666T>C synonymous_variant ORF1ab_pp1ab P5222P 1.00e+0 BF.1
V8663 250C>T synonymous_variant S L84L 1.00e+0 BF.1
V9614 249A>G synonymous_variant N Q83Q 1.00e+0 BF.1
V7196 9960A>G synonymous_variant ORF1ab_pp1a L3320L 1.00e+0 BN.1.2
V1334 5465C>T missense_variant ORF1ab_pp1a T1822I 7.07e-1 BQ.1.1.23
V2942 17351A>G missense_variant ORF1ab_pp1ab K5784R 7.07e-1 BQ.1.1.23
V5596 1085C>T missense_variant N T362I 8.84e-1 BQ.1.23
V8039 16651C>T synonymous_variant ORF1ab_pp1ab L5551L 9.49e-1 BQ.1.23
V1807 8380C>T missense_variant ORF1ab_pp1a H2794Y 1.00e+0 BQ.1.3
V9127 3771T>C synonymous_variant S D1257D 1.00e+0 BQ.1.3
V5731 *4317G>A downstream_gene_variant S None 7.07e-1 CA.3
V1765 8087C>T missense_variant ORF1ab_pp1a A2696V 6.67e-1 XBB.1
V5914 411C>T synonymous_variant ORF1ab_pp1a G137G 6.32e-1 XBB.1
V2819 16486C>T missense_variant ORF1ab_pp1ab P5496S 1.00e+0 B.1.468
V3812 925G>C missense_variant S E309Q 1.00e+0 B.1.468
V5937 558C>T synonymous_variant ORF1ab_pp1a V186V 7.05e-1 B.1.468
V7353 11388C>T synonymous_variant ORF1ab_pp1a L3796L 1.00e+0 B.1.468
V7397 11676C>T synonymous_variant ORF1ab_pp1a V3892V 7.05e-1 B.1.468
V7553 12873C>T synonymous_variant ORF1ab_pp1a I4291I 7.05e-1 B.1.468
V3693 541G>A missense_variant S G181R 1.00e+0 B.1.538
V3987 1919C>T missense_variant S S640F 1.00e+0 B.1.538
V4134 2816C>T missense_variant S S939F 1.00e+0 B.1.538
V4331 67G>T missense_variant ORF3a A23S 1.00e+0 B.1.538
V5462 608G>T missense_variant N R203M 1.00e+0 B.1.538
V6010 999G>T synonymous_variant ORF1ab_pp1a T333T 1.00e+0 B.1.538
V6102 1708C>T synonymous_variant ORF1ab_pp1a L570L 1.00e+0 B.1.538
V6162 2130C>A synonymous_variant ORF1ab_pp1a V710V 1.00e+0 B.1.538
V6452 4269C>T synonymous_variant ORF1ab_pp1a Y1423Y 7.03e-1 B.1.538
V6890 7671G>T synonymous_variant ORF1ab_pp1a A2557A 7.03e-1 B.1.538
V7911 15642G>A synonymous_variant ORF1ab_pp1ab Q5214Q 7.03e-1 B.1.538
V8132 17286C>T synonymous_variant ORF1ab_pp1ab L5762L 1.00e+0 B.1.538
V932 3161C>T missense_variant ORF1ab_pp1a P1054L 1.00e+0 B.1.538
V9779 1059G>T synonymous_variant N L353L 1.00e+0 B.1.538
V6526 4770T>C synonymous_variant ORF1ab_pp1a Y1590Y 7.71e-1 BA.5.2.39
V3939 1640C>A missense_variant S T547K 1.00e+0 XW





Manual curation of mutation (18204A>G)-related literature from PubMed

The pubmed.mineR and pubmed-mapper were utilized for extracting literature from PubMed, followed by manual filtering.
Note: PubMed: (COVID-19 [Title/Abstract] OR SARS-COV-2 [Title/Abstract]) AND (DNA mutation [Title/Abstract] OR Protein mutation-1 letter [Title/Abstract] OR Protein mutation-3 letter [Title/Abstract]).

DNA level Protein level Paper title Journal name Publication year Pubmed ID