Date
Topic
Comments
Topic: GPT, ChatGPT (Weizhi)
Paper: GPT: Improving Language Understanding by Generative Pre-Training (Weizhi)
Paper: GPT-3: Language Models are Few-Shot Learners (Weizhi)
Paper: InstructGPT: Training Language Models to Follow Instructions with Human Feedback (Weizhi)
Topic: Retrieval-Augmented Generation
Paper: Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks (Xuan)
Paper: From Local to Global: A Graph RAG Approach to Query-Focused Summarization (Xuan)
Paper: SimCSE: Simple Contrastive Learning of Sentence Embeddings (Xuan)
Practice: Try LangChain to build a question answering chatbot
Topic: Multimodality
Paper: CLIP: Learning Transferable Visual Models From Natural Language Supervision
Paper: BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
Paper: Visual Instruction Tuning
Topic: Long Context
Topic: Prefix and Adapter in Context Learning Paper: Adapters: Parameter-Efficient Transfer Learning for NLP
Paper: Prefix-Tuning: Optimizing Continuous Prompts for Generation
Paper: LoRA: Low-Rank Adaptation of Large Language Models
Topic: Make it smaller
Paper: DeepSeek-V3 Technical Report (knowledge distillation part)
Paper: Fast Inference from Transformers via Speculative Decoding
Paper: FlexiDepth: Dynamic Layer-skipping in Pre-trained LLMs (brand new, Xuan)
Holiday