Trinkle23897 / CS294-112
CS 294-112 @ UCB Deep RL
☆23Updated 2 years ago
Alternatives and similar repositories for CS294-112
Users that are interested in CS294-112 are comparing it to the libraries listed below
Sorting:
- [ACL 2023 Findings] What In-Context Learning “Learns” In-Context: Disentangling Task Recognition and Task Learning☆21Updated last year
- Crawl & visualize ICLR papers and reviews.☆18Updated 2 years ago
- ☆18Updated 4 years ago
- ☆16Updated 4 years ago
- the public repo for stats205 scribe notes at Stanford University☆13Updated 3 years ago
- ☆100Updated 3 years ago
- Minimal RLHF implementation built on top of minGPT.☆28Updated 10 months ago
- ☆11Updated last month
- Complexity Based Prompting for Multi-Step Reasoning☆17Updated 2 years ago
- EMNLP'2022: BERTScore is Unfair: On Social Bias in Language Model-Based Metrics for Text Generation☆41Updated 2 years ago
- ☆31Updated 8 months ago
- ☆39Updated 2 years ago
- Machine Learning Course Materials, Tsinghua IIIS☆17Updated 6 years ago
- A Kernel-Based View of Language Model Fine-Tuning https://arxiv.org/abs/2210.05643☆76Updated last year
- Feeling confused about super alignment? Here is a reading list☆42Updated last year
- Machine Learning repo☆37Updated 2 years ago
- This repo is to demo the concept of lossless compression with Transformers as encoder and decoder.☆14Updated last year
- Code for the paper "Query-Key Normalization for Transformers"☆41Updated 4 years ago
- Momentum Decoding: Open-ended Text Generation as Graph Exploration☆19Updated 2 years ago
- ☆33Updated 3 years ago
- Neural Logic Inductive Learning☆42Updated 2 years ago
- The code and data for the paper JiuZhang3.0☆44Updated 11 months ago
- Official implementation of AAAI 2025 paper "Augmenting Math Word Problems via Iterative Question Composing"(https://arxiv.org/abs/2401.09…☆20Updated 5 months ago
- Source code for our AAAI'22 paper 《From Dense to Sparse: Contrastive Pruning for Better Pre-trained Language Model Compression》☆24Updated 3 years ago
- Course Materials for ML Course at Tsinghua☆24Updated 5 years ago
- Notes of my introduction about NLP in Fudan University☆37Updated 3 years ago
- ☆26Updated 3 years ago
- Interpolation between Residual and Non-Residual Networks, ICML 2020. https://arxiv.org/abs/2006.05749☆26Updated 4 years ago
- Domain-specific preference (DSP) data and customized RM fine-tuning.☆25Updated last year
- ☆14Updated last year