CS291K - Schedule home | schedule


 

Date

Topic

Comments

Mar 29 Course Introduction  
Mar 31  Transformers / Quiz Basic Concepts
Apr 5 Topic: Time Series Forecasting  (Shiyang)
Paper: DeepAR:DeepAR: Probabilistic Forecasting with Autoregressive Recurrent Networks
Paper: ConvTran:Enhancing the Locality and Breaking the Memory Bottleneck of Transformer on Time Series Forecasting
Readings: N-BEATS: Neural basis expansion analysis for interpretable time series forecasting
Readings: Temporal Fusion Transformers for Interpretable Multi-horizon Time Series Forecasting
 
Apr 7 Topic: Speech Recognition and Image Recognition Application (Shiyang)
Paper: Wav2vec: Unsupervised Pre-training for Speech Recognition 
Paper: ViT:An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale  
Apr 12

REALM, RAG, DPR,  FiD (Jing)

Topic: Retrieval-Augmented Pre-training and Fine-tuning for Knowledge-Intensive NLP Tasks

Paper: REALM: Retrieval-Augmented Language Model Pre-Training 

Paper: Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks

 
Apr 14

Topic: Denoising Sequence-to-Sequence Pre-training for Language Understanding and Generation (Jing)

Paper: Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

Paper: BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension

 
Apr 19

Topic: Contrastive Learning of Sentence Embeddings (Kha-Dinh)

Paper: DeCLUTR: Deep Contrastive Learning for Unsupervised Textual Representations

Paper: SimCSE: Simple Contrastive Learning of Sentence Embeddings 
Project Proposal Due
Apr 21

Topic: Parameter-Efficient Fine-tuning for NLP

Paper: Parameter-Efficient Transfer Learning for NLP  (Liu)

Paper: Prefix-Tuning: Optimizing Continuous Prompts for Generation (Zheng)

 
Apr 26

Topic: Conditional Natural Language Generation with Conditional Training (Xianjun)

Paper: CTRL: A Conditional Transformer Language Model for Controllable Generation

Paper: Neural Text Generation with Unlikelihood Training
 
Apr 28

Topic: Conditional Natural Language Generation with Guided Decoding

Paper: Plug and Play Language Models (Xuan)

Paper: GeDi: Generative Discriminator Guided Sequence Generation (Hezi)

Optional: Conditional Natural Language Generation with Prompting 

Paper: AutoPrompt: Eliciting Knowledge from Language Models with Automatically Generated Prompts

Prompt-Tuning

 
May 3

Object Tracking Application

Paper: TransTrack: Multiple Object Tracking with Transformer (Chengyuan)

Paper: LayoutLM: Pre-training of Text and Layout for Document Image Understanding (Pranjali)

Paper: LayoutLMv2: Multi-modal Pre-training for Visually-rich Document Understanding (Ari)
Paper review due
May 5

Topic: Knowledge Extraction from Pretrained Language Models

Paper: Language Models as Knowledge Bases? (Gyuwan)

Paper: Do As I Can, Not As I Say: Grounding Language in Robotic Affordances (Yuejie)
 
May 10

Topic: Make it smaller

Paper: DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter (Navya)

Paper: TinyBERT: Distilling BERT for Natural Language Understanding (Dan)

May 12

Topic: Pretrained Models for Long Documents

Paper: Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context (Jiachen)

Paper: Longformer: The Long-Document Transformer
(Reeva)
May 17

Topic:  Architecture Idea (Hong)

Paper: Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer 

Paper: Lifelong Learning with Dynamically Expandable Networks
Paper: DEMix Layers: Disentangling Domains for Modular Language Modeling
 
May 19 Topic:  Transformer based Reinforcement Learning
Paper: Decision Transformer: Reinforcement Learning via Sequence Modeling  (Eddie)
Midterm Quiz (cover April 5 -  May 10)
May 24 Dialogue Application: Pre-trained Models for End-to-End Task-oriented Dialogue Modeling
Paper: A Simple Language Model for Task-Oriented Dialogue (Apoorva)
Paper:  Zero-Shot Text-to-Image Generation   (Weixi)
Paper: Multi-Task Pre-Training for Plug-and-Play Task-Oriented Dialogue System (optional)
 
May 26 Project Presentation  
May 31 Project Presentation  
June 2 Project Presentation  
June 8 Project Final Report Due