Algorithm for Optimised mRNA Design Improves Stability and Immunogenicity


Messenger RNA (mRNA) vaccines are being used to contain COVID-19 (1, 2, 3), but still suffer from the critical limitation of mRNA instability and degradation, which is a major obstacle in the storage, distribution, and efficacy of the vaccine products (4). Previous work showed that increasing secondary structure lengthens mRNA half-life, which, together with optimal codons, improves protein expression (5). Therefore, a principled mRNA design algorithm must optimize both structural stability and codon usage. However, due to synonymous codons, the mRNA design space is prohibitively large (e.g., ~10632 candidates for the SARS-CoV-2 Spike protein), which poses insurmountable computational challenges. Here we provide a simple and unexpected solution using a classical concept in computational linguistics, where finding the optimal mRNA sequence is akin to identifying the most likely sentence among similar sounding alternatives (6). Our algorithm LinearDesign takes only 11 minutes for the Spike protein, and can jointly optimize stability and codon usage. On both COVID-19 and varicella-zoster virus mRNA vaccines, LinearDesign substantially improves mRNA half-life and protein expression, and dramatically increases antibody titer by up to 128× in vivo, compared to the codon-optimization benchmark. This surprising result reveals the great potential of principled mRNA design, and enables the exploration of previously unreachable but highly stable and efficient designs. Our work is a timely tool not only for vaccines but also for mRNA medicine encoding all therapeutic proteins (e.g., monoclonal antibodies and anti-cancer drugs (7, 8)).