Trinkle23897 / CS294-112Links
CS 294-112 @ UCB Deep RL
☆23Updated 2 years ago
Alternatives and similar repositories for CS294-112
Users that are interested in CS294-112 are comparing it to the libraries listed below
Sorting:
- [ACL 2023 Findings] What In-Context Learning “Learns” In-Context: Disentangling Task Recognition and Task Learning☆21Updated last year
- Machine Learning repo☆37Updated 2 years ago
- Minimal RLHF implementation built on top of minGPT.☆29Updated 11 months ago
- Feeling confused about super alignment? Here is a reading list☆42Updated last year
- Notes of my introduction about NLP in Fudan University☆37Updated 3 years ago
- ☆18Updated 4 years ago
- ☆46Updated this week
- EMNLP'2022: BERTScore is Unfair: On Social Bias in Language Model-Based Metrics for Text Generation☆41Updated 2 years ago
- Crawl & visualize ICLR papers and reviews.☆18Updated 2 years ago
- Domain-specific preference (DSP) data and customized RM fine-tuning.☆25Updated last year
- ☆39Updated 2 years ago
- The information of NLP PhD application in the world.☆37Updated 9 months ago
- Course Materials for ML Course at Tsinghua☆25Updated 5 years ago
- ☆16Updated 4 years ago
- ☆35Updated 5 years ago
- A Kernel-Based View of Language Model Fine-Tuning https://arxiv.org/abs/2210.05643☆75Updated last year
- Momentum Decoding: Open-ended Text Generation as Graph Exploration☆19Updated 2 years ago
- the public repo for stats205 scribe notes at Stanford University☆13Updated 4 years ago
- ☆16Updated 3 years ago
- SCoRe: Training Language Models to Self-Correct via Reinforcement Learning☆10Updated 5 months ago
- Code for the paper "Query-Key Normalization for Transformers"☆41Updated 4 years ago
- Conic10K: A large-scale dataset for closed-vocabulary math problem understanding. Accepted to EMNLP2023 Findings.☆26Updated last year
- [EMNLP 2021] MuVER: Improving First-Stage Entity Retrieval with Multi-View Entity Representations☆31Updated 3 years ago
- ☆100Updated 3 years ago
- ☆15Updated 4 years ago
- Complexity Based Prompting for Multi-Step Reasoning☆17Updated 2 years ago
- 北京大学 深度学习的技术与应用 课程Projects☆13Updated 8 years ago
- ☆17Updated 4 years ago
- ☆11Updated 2 months ago
- domain adaptation in NLP☆53Updated 3 years ago