Trinkle23897 / CS294-112
CS 294-112 @ UCB Deep RL
☆22Updated last year
Alternatives and similar repositories for CS294-112:
Users that are interested in CS294-112 are comparing it to the libraries listed below
- [ACL 2023 Findings] What In-Context Learning “Learns ” In-Context: Disentangling Task Recognition and Task Learning☆22Updated last year
- Crawl & visualize ICLR papers and reviews.☆18Updated 2 years ago
- Domain-specific preference (DSP) data and customized RM fine-tuning.☆24Updated 10 months ago
- Source code for paper "On the Pareto Front of Multilingual Neural Machine Translation" @ NeurIPS 2023☆16Updated last year
- Complexity Based Prompting for Multi-Step Reasoning☆16Updated last year
- ☆30Updated 4 months ago
- Group-conditional DRO to alleviate spurious correlations☆15Updated 3 years ago
- The code for creating the iGSM datasets in papers "Physics of Language Models Part 2.1, Grade-School Math and the Hidden Reasoning Proces…☆19Updated this week
- ☆30Updated 9 months ago
- Neural Logic Inductive Learning☆41Updated 2 years ago
- EMNLP'2022: BERTScore is Unfair: On Social Bias in Language Model-Based Metrics for Text Generation☆40Updated 2 years ago
- A Kernel-Based View of Language Model Fine-Tuning https://arxiv.org/abs/2210.05643☆74Updated last year
- The official repository for the paper "From Zero to Hero: Examining the Power of Symbolic Tasks in Instruction Tuning".☆63Updated last year
- ☆16Updated 3 years ago
- Machine Learning repo☆37Updated 2 years ago
- Source code for COLING 2022 paper "Automatic Label Sequence Generation for Prompting Sequence-to-sequence Models"☆24Updated 2 years ago
- ☆35Updated 5 years ago
- Code for Paper (Policy Optimization in RLHF: The Impact of Out-of-preference Data)☆24Updated last year
- Code for the ACL-2022 paper "StableMoE: Stable Routing Strategy for Mixture of Experts"☆43Updated 2 years ago
- ☆16Updated 3 years ago
- code for RIM☆22Updated 2 years ago
- Offical code of the paper Large Language Models Are Implicitly Topic Models: Explaining and Finding Good Demonstrations for In-Context Le…☆70Updated 9 months ago
- Resources for our AAAI 2022 paper: "Unsupervised Editing for Counterfactual Stories".☆11Updated 2 years ago
- A probabilitic model for contextual word representation. Accepted to ACL2023 Findings.☆21Updated last year
- The information of NLP PhD application in the world.☆35Updated 4 months ago
- Feeling confused about super alignment? Here is a reading list☆42Updated last year
- Code for Residual Energy-Based Models for Text Generation in PyTorch.☆23Updated 3 years ago
- ☆39Updated 2 years ago
- ☆27Updated 11 months ago