Repository logo

Temporal Pyramid Structure for Video Frame Interpolation

dc.contributor.authorYang, Jiaqi
dc.contributor.supervisorZhao, Jiying
dc.date.accessioned2025-01-09T14:05:45Z
dc.date.available2025-01-09T14:05:45Z
dc.date.issued2025-01-09
dc.description.abstractThe most prevalent structure in video frame interpolation involves using optical flow to guide frame warping, which typically considers only the two adjacent frames. However, these methods often fail to capture long-range temporal dependencies and often result in significant deformation in complex motion scenarios. We propose a novel Temporal Pyramid Attention (TPA) block, which employs a temporal pyramid structure to connect four frames within a sliding window for the generation of intermediate frames. The temporal pyramid structure consists of three layers to leverage multi-level features, estimate the frame window, and connect with a GRU to generate a bi-directional feature flow. Furthermore, the dual pyramid structure incorporates channel attention mechanisms, enabling the interpolation of three frames in a single process. The TPA block employs a multi-scale approach to effectively capture temporal dependencies and spatial correlations, enhancing the quality of interpolated frames. Our model achieves a state-of-the-art performance on the Vimeo90K septuplet dataset compared to existing methods using pre-trained parameters.
dc.identifier.urihttp://hdl.handle.net/10393/50062
dc.identifier.urihttps://doi.org/10.20381/ruor-30831
dc.language.isoen
dc.publisherUniversité d'Ottawa | University of Ottawa
dc.rightsAttribution 4.0 Internationalen
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/
dc.subjectDeep learning
dc.subjectVideo frame interpolation
dc.subjectGated recurrent unit
dc.subjectKnowledge distillation
dc.subjectTemporal feature extraction
dc.titleTemporal Pyramid Structure for Video Frame Interpolation
dc.typeThesisen
thesis.degree.disciplineGénie / Engineering
thesis.degree.levelMasters
thesis.degree.nameMASc
uottawa.departmentScience informatique et génie électrique / Electrical Engineering and Computer Science

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail ImageThumbnail Image
Name:
Yang_Jiaqi_2024_thesis.pdf
Size:
23.35 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail ImageThumbnail Image
Name:
license.txt
Size:
6.65 KB
Format:
Item-specific license agreed upon to submission
Description: