This repository contains some of the code used in the paper "Training Language Models with Langauge Feedback at Scale"
☆27Mar 30, 2023Updated 2 years ago
Alternatives and similar repositories for imitation_learning_from_language_feedback
Users that are interested in imitation_learning_from_language_feedback are comparing it to the libraries listed below
Sorting:
- [ICLR 2022] "Bayesian Modeling and Uncertainty Quantification for Learning to Optimize: What, Why, and How" by Yuning You, Yue Cao, Tianl…☆14Aug 19, 2022Updated 3 years ago
- Code for NAACL 2025 paper "AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge"☆16Updated this week
- ☆11Oct 3, 2021Updated 4 years ago
- ☆16Mar 25, 2022Updated 3 years ago
- Code for EMNLP 2022 Paper: On the Calibration of Massively Multilingual Language Models☆15Jun 12, 2023Updated 2 years ago
- Code for☆16Oct 16, 2020Updated 5 years ago
- ☆17Nov 30, 2022Updated 3 years ago
- A publishing website of a table collecting meta-learning-related papers in the area of human language processing.☆17Aug 2, 2021Updated 4 years ago
- [NAACL 2022] This is the code repo for our paper `ACTUNE: Uncertainty-based Active Self-Training for Active Fine-Tuning of Pretrained Lan…☆15Nov 16, 2022Updated 3 years ago
- GCN and BERT for relation extraction☆18Jun 29, 2020Updated 5 years ago
- ☆15Feb 21, 2024Updated 2 years ago
- Official implementation of the paper "IteraTeR: Understanding Iterative Revision from Human-Written Text" (ACL 2022)☆80Nov 15, 2023Updated 2 years ago
- ☆26Nov 21, 2022Updated 3 years ago
- The "GPT-API-Accelerate" project provides a set of Python classes for accelerating the process of generating responses to prompts using t…☆23Oct 12, 2024Updated last year
- ☆25Jun 10, 2025Updated 8 months ago
- In-Context Learning User Simulators for Task-Oriented Dialog Systems☆30Jun 2, 2023Updated 2 years ago
- RENT (Reinforcement Learning via Entropy Minimization) is an unsupervised method for training reasoning LLMs.☆41Oct 31, 2025Updated 4 months ago
- ☆20Jan 16, 2024Updated 2 years ago
- [ACL 2024 Findings] CriticBench: Benchmarking LLMs for Critique-Correct Reasoning☆30Mar 5, 2024Updated 2 years ago
- Merging Generated and Retrieved Knowledge for Open-Domain QA (EMNLP 2023)☆22Oct 8, 2023Updated 2 years ago
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆54Feb 23, 2024Updated 2 years ago
- Align your LM to express calibrated verbal statements of confidence in its long-form generations.☆29Jun 4, 2024Updated last year
- Some python scripts for drawing figures in scientific papers☆28Jun 26, 2019Updated 6 years ago
- Active and Sample-Efficient Model Evaluation☆27May 22, 2025Updated 9 months ago
- This is the official repo for Towards Uncertainty-Aware Language Agent.☆31Aug 15, 2024Updated last year
- Neural topic modeling☆29Aug 19, 2020Updated 5 years ago
- No Parameters Left Behind: Sensitivity Guided Adaptive Learning Rate for Training Large Transformer Models (ICLR 2022)☆29Feb 9, 2022Updated 4 years ago
- Source code for the paper "Exploiting Excessive Invariance caused by Norm-Bounded Adversarial Robustness"☆25Feb 12, 2020Updated 6 years ago
- ☆35May 30, 2022Updated 3 years ago
- ☆30Dec 27, 2024Updated last year
- ☆10Nov 8, 2022Updated 3 years ago
- Python code to automatically produce a summary of a piece of text.☆12Sep 8, 2016Updated 9 years ago
- ☆10Nov 1, 2022Updated 3 years ago
- ☆38May 20, 2021Updated 4 years ago
- Code for "Democratizing Reasoning Ability: Tailored Learning from Large Language Model", EMNLP 2023☆36Dec 12, 2023Updated 2 years ago
- Implementation of latent-GLAT (ACL-2022)☆34Apr 30, 2022Updated 3 years ago
- Teaching Models to Express Their Uncertainty in Words☆39May 26, 2022Updated 3 years ago
- 抓取汽车之家全站☆10Dec 26, 2019Updated 6 years ago
- Supporting material for Princeton ORF307☆12Jan 14, 2026Updated last month