Enhancing mRNA Translation Efficiency Through Deep Learning

Khalaj, Siavash

Enhancing mRNA Translation Efficiency Through Deep Learning

dc.contributor.author	Khalaj, Siavash
dc.contributor.supervisor	Turcotte, Marcel
dc.date.accessioned	2026-03-30T16:24:10Z
dc.date.available	2026-03-30T16:24:10Z
dc.date.issued	2026-03-30
dc.description.abstract	The 5' untranslated region (5' UTR) of mRNA plays a key role in regulating translation efficiency. Consequently, optimizing this region is important in synthetic biology and therapeutic mRNA design for increasing protein yield and functional potency. Despite advances in computational methods for modeling 5' UTR translation efficiency and sequence design, existing approaches do not account for mRNA secondary structure, provide limited control over sequence modification, and remain inefficient for bulk optimization. This thesis proposes a secondary structure-informed framework that combines accurate translation efficiency modeling with controllable, large-scale optimization of 5' UTR sequences. A graph attention network (GAT) encoder integrating nucleotide identity, positional information, and predicted mRNA secondary structure was first trained to accurately predict 5' UTR translation efficiency. The encoder was then extended into a multitask autoencoder by adding an autoregressive long short-term memory (LSTM) decoder. The autoencoder achieved near-perfect sequence reconstruction accuracy while maintaining the encoder's performance on translation efficiency prediction. The decoder was then fine-tuned using reinforcement learning to generate 5' UTR variants with higher predicted translation efficiency. Fine-tuning the decoder using the REINFORCE algorithm substantially increased the number of generated sequences with improved predicted translation efficiency, while DAP-regularized fine-tuning delivered improvements through smaller, more controlled edits that maintained greater similarity to the original sequences. Incorporating curriculum learning in DAP-regularized fine-tuning substantially increased the proportion of improved sequences with limited disruption to composition and entropy. Interpretability analyses confirmed that the framework captures biologically meaningful determinants of translation initiation and applies optimization strategies consistent with known regulatory mechanisms. Overall, this framework presents a novel approach to RNA sequence optimization and extends to regulatory elements beyond the 5' UTR.
dc.identifier.uri	http://hdl.handle.net/10393/51480
dc.identifier.uri	https://doi.org/10.20381/ruor-31818
dc.language.iso	en
dc.publisher	Université d'Ottawa / University of Ottawa
dc.rights	Attribution 4.0 International	en
dc.rights.uri	http://creativecommons.org/licenses/by/4.0/
dc.subject	5' UTR
dc.subject	mRNA translation efficiency
dc.subject	Mean Ribosome Load (MRL)
dc.subject	Graph Neural Network (GNN)
dc.subject	Graph Attention Network (GAT)
dc.subject	Reinforcement Learning
dc.subject	RNA sequence optimization
dc.subject	RNA secondary structure
dc.subject	Deep learning
dc.subject	Sequence-to-sequence model
dc.subject	RNA design
dc.subject	Multitask autoencoder
dc.subject	REINFORCE algorithm
dc.title	Enhancing mRNA Translation Efficiency Through Deep Learning
dc.type	Thesis	en
thesis.degree.discipline	Génie / Engineering
thesis.degree.level	Masters
thesis.degree.name	MCS
uottawa.department	Science informatique et génie électrique / Electrical Engineering and Computer Science

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Khalaj_Siavash_2026_thesis.pdf
Size:: 4.25 MB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 6.65 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

- Thèses, 2011 - // Theses, 2011 -