This repository contains some of the code used in the paper "Training Language Models with Langauge Feedback at Scale"
☆26Mar 30, 2023Updated 3 years ago
Alternatives and similar repositories for imitation_learning_from_language_feedback
Users that are interested in imitation_learning_from_language_feedback are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for EMNLP 2022 Paper: On the Calibration of Massively Multilingual Language Models☆15Jun 12, 2023Updated 3 years ago
- [ICLR 2022] "Bayesian Modeling and Uncertainty Quantification for Learning to Optimize: What, Why, and How" by Yuning You, Yue Cao, Tianl…☆14Aug 19, 2022Updated 3 years ago
- Code for☆15Oct 16, 2020Updated 5 years ago
- ☆11Oct 3, 2021Updated 4 years ago
- A publishing website of a table collecting meta-learning-related papers in the area of human language processing.☆17Aug 2, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆16Mar 25, 2022Updated 4 years ago
- Code for NAACL 2025 paper "AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge"☆16Mar 2, 2026Updated 3 months ago
- GCN and BERT for relation extraction☆18Jun 29, 2020Updated 5 years ago
- [NAACL 2022] This is the code repo for our paper `ACTUNE: Uncertainty-based Active Self-Training for Active Fine-Tuning of Pretrained Lan…☆15Nov 16, 2022Updated 3 years ago
- ☆11Jun 7, 2023Updated 3 years ago
- ☆16Nov 30, 2022Updated 3 years ago
- Official implementation of the paper "IteraTeR: Understanding Iterative Revision from Human-Written Text" (ACL 2022)☆82Nov 15, 2023Updated 2 years ago
- Python code to automatically produce a summary of a piece of text.☆11Sep 8, 2016Updated 9 years ago
- ☆15Feb 21, 2024Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Merging Generated and Retrieved Knowledge for Open-Domain QA (EMNLP 2023)☆21Oct 8, 2023Updated 2 years ago
- ☆11Aug 15, 2023Updated 2 years ago
- Example of android app written in Qt/Qml which uses MXNet for plant image recognition.☆10Nov 4, 2017Updated 8 years ago
- Some python scripts for drawing figures in scientific papers☆28Jun 26, 2019Updated 6 years ago
- Winner of NeurIPS 2021 student leaderboard. Self-bootstrapping bayesian optimization for SCIP configuration using GNNs.☆14Oct 28, 2022Updated 3 years ago
- The Intermediate Goal of the project is to train a GPT like architecture to learn to summarise reddit posts from human preferences, as th…☆12Jul 14, 2021Updated 4 years ago
- machine translation data process tools☆10Apr 29, 2024Updated 2 years ago
- Active and Sample-Efficient Model Evaluation☆27May 22, 2025Updated last year
- A basic analyser for github repos that allows you to question the whole repo.☆17Mar 31, 2026Updated 2 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Privacy-Preserving Bandits (MLSys'20)☆22Dec 8, 2022Updated 3 years ago
- ☆20Jan 16, 2024Updated 2 years ago
- RENT (Reinforcement Learning via Entropy Minimization) is an unsupervised method for training reasoning LLMs.☆43Oct 31, 2025Updated 7 months ago
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆54Feb 23, 2024Updated 2 years ago
- An implementation of the paper "Solving the Rubik's Cube without Human Knowledge"☆14Dec 9, 2018Updated 7 years ago
- Source code for the paper "Exploiting Excessive Invariance caused by Norm-Bounded Adversarial Robustness"☆25Feb 12, 2020Updated 6 years ago
- {DeepL, Google, WMT-Best, davinci-003, turbo, gpt-4} × {En-De, En-Cs, En-Ru, En-Zh, De-Fr, En-Ja, Uk-En, Uk-Cs, En-Hr, En-Ha, En-Is}☆14Jun 18, 2023Updated 2 years ago
- This repositary hosts my experiments for the project, I did with OffNote Labs.☆10Apr 12, 2021Updated 5 years ago
- ☆15Sep 10, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Reproduza as matérias do Pindograma!☆12Oct 5, 2022Updated 3 years ago
- Multimodal Transformers for biomedical text and Knowledge Graph data☆33Mar 3, 2023Updated 3 years ago
- As principais manchetes dos jornais brasileiros diariamente!☆17Apr 24, 2024Updated 2 years ago
- An implementation of effective policy ensemble.☆16Jul 5, 2023Updated 2 years ago
- Pytorch implementation of BEAR in "Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction"☆11Oct 29, 2019Updated 6 years ago
- Hierarchical Story Generation based on (https://arxiv.org/abs/1805.04833)☆11May 6, 2020Updated 6 years ago
- A Framework to Automatically Extract Indicators of Compromise (IoCs) from Twitter☆15Dec 9, 2019Updated 6 years ago