JeremyAlain / imitation_learning_from_language_feedbackView external linksLinks
This repository contains some of the code used in the paper "Training Language Models with Langauge Feedback at Scale"
☆27Mar 30, 2023Updated 2 years ago
Alternatives and similar repositories for imitation_learning_from_language_feedback
Users that are interested in imitation_learning_from_language_feedback are comparing it to the libraries listed below
Sorting:
- [ICLR 2022] "Bayesian Modeling and Uncertainty Quantification for Learning to Optimize: What, Why, and How" by Yuning You, Yue Cao, Tianl…☆14Aug 19, 2022Updated 3 years ago
- Code for NAACL 2025 paper "AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge"☆16Oct 14, 2024Updated last year
- ☆11Oct 3, 2021Updated 4 years ago
- ☆16Mar 25, 2022Updated 3 years ago
- Code for EMNLP 2022 Paper: On the Calibration of Massively Multilingual Language Models☆15Jun 12, 2023Updated 2 years ago
- ☆17Nov 30, 2022Updated 3 years ago
- [NAACL 2022] This is the code repo for our paper `ACTUNE: Uncertainty-based Active Self-Training for Active Fine-Tuning of Pretrained Lan…☆15Nov 16, 2022Updated 3 years ago
- ☆15Feb 21, 2024Updated last year
- GCN and BERT for relation extraction☆18Jun 29, 2020Updated 5 years ago
- Official implementation of the paper "IteraTeR: Understanding Iterative Revision from Human-Written Text" (ACL 2022)☆80Nov 15, 2023Updated 2 years ago
- ☆25Jun 10, 2025Updated 8 months ago
- ☆20Jan 16, 2024Updated 2 years ago
- ☆26Nov 21, 2022Updated 3 years ago
- RENT (Reinforcement Learning via Entropy Minimization) is an unsupervised method for training reasoning LLMs.☆41Oct 31, 2025Updated 3 months ago
- [ACL 2024 Findings] CriticBench: Benchmarking LLMs for Critique-Correct Reasoning☆30Mar 5, 2024Updated last year
- Merging Generated and Retrieved Knowledge for Open-Domain QA (EMNLP 2023)☆22Oct 8, 2023Updated 2 years ago
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆54Feb 23, 2024Updated last year
- Some python scripts for drawing figures in scientific papers☆27Jun 26, 2019Updated 6 years ago
- Active and Sample-Efficient Model Evaluation☆27May 22, 2025Updated 8 months ago
- No Parameters Left Behind: Sensitivity Guided Adaptive Learning Rate for Training Large Transformer Models (ICLR 2022)☆29Feb 9, 2022Updated 4 years ago
- ☆39Jul 25, 2024Updated last year
- Multimodal Transformers for biomedical text and Knowledge Graph data☆34Mar 3, 2023Updated 2 years ago
- Python code to automatically produce a summary of a piece of text.☆12Sep 8, 2016Updated 9 years ago
- ☆38May 20, 2021Updated 4 years ago
- Code for "Democratizing Reasoning Ability: Tailored Learning from Large Language Model", EMNLP 2023☆36Dec 12, 2023Updated 2 years ago
- ☆10Nov 8, 2022Updated 3 years ago
- ☆10Nov 1, 2022Updated 3 years ago
- Implementation of latent-GLAT (ACL-2022)☆34Apr 30, 2022Updated 3 years ago
- ☆49Apr 4, 2025Updated 10 months ago
- Teaching Models to Express Their Uncertainty in Words☆39May 26, 2022Updated 3 years ago
- ☆40Aug 11, 2023Updated 2 years ago
- 用Paddle复现论文ChineseBERT: Chinese Pretraining Enhanced by Glyph and Pinyin Information(ACL2021)☆10Nov 15, 2021Updated 4 years ago
- ☆10May 5, 2017Updated 8 years ago
- Machine learning for molecules workshop 2022☆13Nov 30, 2022Updated 3 years ago
- ☆11Jun 18, 2023Updated 2 years ago
- ANnotation-based ANalysis of Specific Interactions☆10Oct 10, 2025Updated 4 months ago
- With the rapid adoption of smartphones, tablets, and mobile apps, they are increasingly becoming part of children’s daily life for amusem…☆12Apr 7, 2017Updated 8 years ago
- ☆12Aug 15, 2023Updated 2 years ago
- Code for the paper "Semi-Conditional Normalizing Flows for Semi-Supervised Learning"☆11Mar 30, 2020Updated 5 years ago