Date
Topic
Comments
Topic: GPT, ChatGPT (Weizhi)
Paper: GPT: Improving Language Understanding by Generative Pre-Training (Weizhi)
Paper: GPT-3: Language Models are Few-Shot Learners (Weizhi)
Paper: InstructGPT: Training Language Models to Follow Instructions with Human Feedback (Weizhi)
Topic: Retrieval-Augmented Generation
Paper: Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks (Xuan)
Paper: From Local to Global: A Graph RAG Approach to Query-Focused Summarization (Xuan)
Paper: SimCSE: Simple Contrastive Learning of Sentence Embeddings (Xuan)
Practice: Try LangChain to build a question answering chatbot
Topic: Multimodality
Paper: CLIP: Learning Transferable Visual Models From Natural Language Supervision (Lixing Guo)
Paper: BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models (Yuexi Shen)
Paper: Visual Instruction Tuning (Sathvika Anand)
Topic: Long Context
Topic: Prefix and Adapter in Context Learning Paper: Adapters: Parameter-Efficient Transfer Learning for NLP (Jinghan Zhang)
Paper: Prefix-Tuning: Optimizing Continuous Prompts for Generation (Ron)
Paper: LoRA: Low-Rank Adaptation of Large Language Models (Wesley)
Topic: Make it smaller
Paper: DeepSeek-V3 Technical Report (knowledge distillation part) (Guannan Wang)
Paper: Fast Inference from Transformers via Speculative Decoding (Chieh-Ying Lai)
Paper: Adaptive Layer-skipping in Pre-trained LLMs (brand new, Xuan)
Holiday