Description. Schematic. Table of features
11889
Description
Messenger RNA encoding the full-length SARS-CoV-2 spike glycoprotein.
Schematic
UTR = Untranslated region; sig = extended signal sequence of the S glycoprotein; S protein_mut = S glycoprotein sequence containing mutations K986P and V987P; poly(A) = polyadenylate signal tail.
5‘- capping structure
cap G1A2 = m7G+m3'-5'-ppp-5'-Am2'-3'-p- [m7 = 7-CH3; m3' = 3'-O-CH3; m2' = 2'-O-CH3; -ppp- = -PO2H-O-PO2H-O-PO2H)-; -p- = -PO2H-]
m1Ψ = 1-methyl-3'-pseudouridylyl
Table of features
Element
| Description
| Position
| cap
| A modified 5’-cap1 structure (m7G+m3'-5'-ppp-5'-Am)
| 1-2
| 5’-UTR
| 5´ -untranslated region derived from human alpha-globin RNA with an optimized Kozak sequence
| 3-54
| sig
| S glycoprotein signal peptide (extended leader sequence), which guides translocation of the nascent polypeptide chain into the endoplasmic reticulum.
| 55-102
| S protein_mut
| Codon-optimized sequence encoding full-length SARS-CoV-2 spike (S) glycoprotein containing mutations K986P and V987P to ensure the S glycoprotein remains in an antigenically optimal pre-fusion conformation; stop codons: 3874-3879 (underlined)
| 103-3879
| 3’-UTR
| The 3´ untranslated region comprises two sequence elements derived from the amino-terminal enhancer of split (AES) mRNA and the mitochondrial encoded 12S ribosomal RNA to confer RNA stability and high total protein expression.
| 3880-4174
| poly(A)
| A 110-nucleotide poly(A)-tail consisting of a stretch of 30 adenosine residues, followed by a 10-nucleotide linker sequence and another 70 adenosine residues.
| 4175-4284
|
|