Repository logo

Enhancing mRNA Translation Efficiency Through Deep Learning

dc.contributor.authorKhalaj, Siavash
dc.contributor.supervisorTurcotte, Marcel
dc.date.accessioned2026-03-30T16:24:10Z
dc.date.available2026-03-30T16:24:10Z
dc.date.issued2026-03-30
dc.description.abstractThe 5' untranslated region (5' UTR) of mRNA plays a key role in regulating translation efficiency. Consequently, optimizing this region is important in synthetic biology and therapeutic mRNA design for increasing protein yield and functional potency. Despite advances in computational methods for modeling 5' UTR translation efficiency and sequence design, existing approaches do not account for mRNA secondary structure, provide limited control over sequence modification, and remain inefficient for bulk optimization. This thesis proposes a secondary structure-informed framework that combines accurate translation efficiency modeling with controllable, large-scale optimization of 5' UTR sequences. A graph attention network (GAT) encoder integrating nucleotide identity, positional information, and predicted mRNA secondary structure was first trained to accurately predict 5' UTR translation efficiency. The encoder was then extended into a multitask autoencoder by adding an autoregressive long short-term memory (LSTM) decoder. The autoencoder achieved near-perfect sequence reconstruction accuracy while maintaining the encoder's performance on translation efficiency prediction. The decoder was then fine-tuned using reinforcement learning to generate 5' UTR variants with higher predicted translation efficiency. Fine-tuning the decoder using the REINFORCE algorithm substantially increased the number of generated sequences with improved predicted translation efficiency, while DAP-regularized fine-tuning delivered improvements through smaller, more controlled edits that maintained greater similarity to the original sequences. Incorporating curriculum learning in DAP-regularized fine-tuning substantially increased the proportion of improved sequences with limited disruption to composition and entropy. Interpretability analyses confirmed that the framework captures biologically meaningful determinants of translation initiation and applies optimization strategies consistent with known regulatory mechanisms. Overall, this framework presents a novel approach to RNA sequence optimization and extends to regulatory elements beyond the 5' UTR.
dc.identifier.urihttp://hdl.handle.net/10393/51480
dc.identifier.urihttps://doi.org/10.20381/ruor-31818
dc.language.isoen
dc.publisherUniversité d'Ottawa / University of Ottawa
dc.rightsAttribution 4.0 Internationalen
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/
dc.subject5' UTR
dc.subjectmRNA translation efficiency
dc.subjectMean Ribosome Load (MRL)
dc.subjectGraph Neural Network (GNN)
dc.subjectGraph Attention Network (GAT)
dc.subjectReinforcement Learning
dc.subjectRNA sequence optimization
dc.subjectRNA secondary structure
dc.subjectDeep learning
dc.subjectSequence-to-sequence model
dc.subjectRNA design
dc.subjectMultitask autoencoder
dc.subjectREINFORCE algorithm
dc.titleEnhancing mRNA Translation Efficiency Through Deep Learning
dc.typeThesisen
thesis.degree.disciplineGénie / Engineering
thesis.degree.levelMasters
thesis.degree.nameMCS
uottawa.departmentScience informatique et génie électrique / Electrical Engineering and Computer Science

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail ImageThumbnail Image
Name:
Khalaj_Siavash_2026_thesis.pdf
Size:
4.25 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail ImageThumbnail Image
Name:
license.txt
Size:
6.65 KB
Format:
Item-specific license agreed upon to submission
Description: