Assignment 5 - CSE 493G1

This assignment is due on Tue, Nov 26, 2024 at 11:59pm PST.

Starter code containing Colab notebooks can be downloaded here.

Setup
Goals
Q1: Image Captioning with Transformers
Q2: Self-Supervised Learning for Image Classification
Submitting your work

Setup

Note. Ensure you are periodically saving your notebook (File -> Save) so that you don’t lose your progress if you step away from the assignment and the Colab VM disconnects.

Once you have completed all Colab notebooks except collect_submission.ipynb, proceed to the submission instructions.

Goals

In this assignment, you will implement language networks and apply them to image captioning on the COCO dataset. Then, you will be introduced to self-supervised learning to automatically learn the visual representations of an unlabeled dataset.

The goals of this assignment are as follows:

Understand and implement Transformer networks. Combine them with CNN networks for image captioning.
Understand how to leverage self-supervised learning techniques to help with image classification tasks.

You will use PyTorch for the majority of this homework.

Q1: Image Captioning with Transformers

The notebook Transformer_Captioning.ipynb will walk you through the implementation of a Transformer model and apply it to image captioning on COCO.

For MultiHeadAttention class in Transformer_Captioning.ipynb notebook, you are expected to apply dropout to the attention weights.

Q2: Self-Supervised Learning for Image Classification

In the notebook Self_Supervised_Learning.ipynb, you will learn how to leverage self-supervised pretraining to obtain better performance on image classification tasks. When first opening the notebook, go to Runtime > Change runtime type and set Hardware accelerator to GPU.

Submitting your work

Important. Please make sure that the submitted notebooks have been run and the cell outputs are visible.

1. Open collect_submission.ipynb in Colab and execute the notebook cells.

This notebook/script will:

Generate a zip file of your code (.py and .ipynb) called a5_code_submission.zip.
Convert all notebooks into a single PDF file.

If your submission for this step was successful, you should see the following display message:

### Done! Please submit a5_code_submission.zip and the a5_inline_submission.pdf to Gradescope. ###

2. Submit the PDF and the zip file to Gradescope.

Remember to download a5_code_submission.zip and a5_inline_submission.pdf locally before submitting to Gradescope.