This repository contains some of the code used in the paper "Training Language Models with Langauge Feedback at Scale"
☆27Mar 30, 2023Updated 2 years ago
Alternatives and similar repositories for imitation_learning_from_language_feedback
Users that are interested in imitation_learning_from_language_feedback are comparing it to the libraries listed below
Sorting:
- [ICLR 2022] "Bayesian Modeling and Uncertainty Quantification for Learning to Optimize: What, Why, and How" by Yuning You, Yue Cao, Tianl…☆14Aug 19, 2022Updated 3 years ago
- ☆11Oct 3, 2021Updated 4 years ago
- Code for NAACL 2025 paper "AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge"☆16Updated this week
- ☆16Mar 25, 2022Updated 3 years ago
- Code for EMNLP 2022 Paper: On the Calibration of Massively Multilingual Language Models☆15Jun 12, 2023Updated 2 years ago
- Code for☆16Oct 16, 2020Updated 5 years ago
- ☆17Nov 30, 2022Updated 3 years ago
- A publishing website of a table collecting meta-learning-related papers in the area of human language processing.☆17Aug 2, 2021Updated 4 years ago
- [NAACL 2022] This is the code repo for our paper `ACTUNE: Uncertainty-based Active Self-Training for Active Fine-Tuning of Pretrained Lan…☆15Nov 16, 2022Updated 3 years ago
- ☆15Feb 21, 2024Updated 2 years ago
- Official implementation of the paper "IteraTeR: Understanding Iterative Revision from Human-Written Text" (ACL 2022)☆80Nov 15, 2023Updated 2 years ago
- ☆20Jan 16, 2024Updated 2 years ago
- Privacy-Preserving Bandits (MLSys'20)☆22Dec 8, 2022Updated 3 years ago
- ☆26Nov 21, 2022Updated 3 years ago
- RENT (Reinforcement Learning via Entropy Minimization) is an unsupervised method for training reasoning LLMs.☆41Oct 31, 2025Updated 4 months ago
- Merging Generated and Retrieved Knowledge for Open-Domain QA (EMNLP 2023)☆22Oct 8, 2023Updated 2 years ago
- [ACL 2024 Findings] CriticBench: Benchmarking LLMs for Critique-Correct Reasoning☆30Mar 5, 2024Updated 2 years ago
- Align your LM to express calibrated verbal statements of confidence in its long-form generations.☆29Jun 4, 2024Updated last year
- [EMNLP 2023] The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning☆254Oct 31, 2023Updated 2 years ago
- Active and Sample-Efficient Model Evaluation☆27May 22, 2025Updated 9 months ago
- This is the official repo for Towards Uncertainty-Aware Language Agent.☆31Aug 15, 2024Updated last year
- Some python scripts for drawing figures in scientific papers☆28Jun 26, 2019Updated 6 years ago
- ☆35May 30, 2022Updated 3 years ago
- ☆30Dec 27, 2024Updated last year
- ☆38May 20, 2021Updated 4 years ago
- Streamlit apps on Cloud Run with Identity-Aware Proxy (IAP).☆10Mar 5, 2022Updated 4 years ago
- ☆10Nov 1, 2022Updated 3 years ago
- Implementation of latent-GLAT (ACL-2022)☆34Apr 30, 2022Updated 3 years ago
- MACTA: A Multi-agent Reinforcement Learning Approach for Cache Timing Attacks and Detection☆46Apr 25, 2023Updated 2 years ago
- Teaching Models to Express Their Uncertainty in Words☆39May 26, 2022Updated 3 years ago
- ☆40Aug 11, 2023Updated 2 years ago
- ANnotation-based ANalysis of Specific Interactions☆10Oct 10, 2025Updated 4 months ago
- Supporting material for Princeton ORF307☆12Jan 14, 2026Updated last month
- ☆11Feb 28, 2022Updated 4 years ago
- Machine learning for molecules workshop 2022☆13Nov 30, 2022Updated 3 years ago
- The official repo for "CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models"☆29Feb 23, 2026Updated last week
- Introduction to Machine Learning using scikit-learn and PyTorch☆10Sep 26, 2019Updated 6 years ago
- 抓取汽车之家全站☆10Dec 26, 2019Updated 6 years ago
- Prototype of Winium.StoreApps driver using CodedUI. Implements JsonWireProtocol for automation of Windows Phone applications☆10Mar 28, 2017Updated 8 years ago