This repository contains some of the code used in the paper "Training Language Models with Langauge Feedback at Scale"
☆27Mar 30, 2023Updated 2 years ago
Alternatives and similar repositories for imitation_learning_from_language_feedback
Users that are interested in imitation_learning_from_language_feedback are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for EMNLP 2022 Paper: On the Calibration of Massively Multilingual Language Models☆15Jun 12, 2023Updated 2 years ago
- [ICLR 2022] "Bayesian Modeling and Uncertainty Quantification for Learning to Optimize: What, Why, and How" by Yuning You, Yue Cao, Tianl…☆14Aug 19, 2022Updated 3 years ago
- ☆11Oct 3, 2021Updated 4 years ago
- A publishing website of a table collecting meta-learning-related papers in the area of human language processing.☆17Aug 2, 2021Updated 4 years ago
- ☆16Mar 25, 2022Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Code for NAACL 2025 paper "AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge"☆17Mar 2, 2026Updated 3 weeks ago
- GCN and BERT for relation extraction☆18Jun 29, 2020Updated 5 years ago
- [NAACL 2022] This is the code repo for our paper `ACTUNE: Uncertainty-based Active Self-Training for Active Fine-Tuning of Pretrained Lan…☆15Nov 16, 2022Updated 3 years ago
- ☆11Jun 7, 2023Updated 2 years ago
- Official implementation of the paper "IteraTeR: Understanding Iterative Revision from Human-Written Text" (ACL 2022)☆80Nov 15, 2023Updated 2 years ago
- ☆26Nov 21, 2022Updated 3 years ago
- In-Context Learning User Simulators for Task-Oriented Dialog Systems☆30Jun 2, 2023Updated 2 years ago
- ☆15Feb 21, 2024Updated 2 years ago
- Merging Generated and Retrieved Knowledge for Open-Domain QA (EMNLP 2023)☆22Oct 8, 2023Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆12Aug 15, 2023Updated 2 years ago
- Example of android app written in Qt/Qml which uses MXNet for plant image recognition.☆10Nov 4, 2017Updated 8 years ago
- Some python scripts for drawing figures in scientific papers☆28Jun 26, 2019Updated 6 years ago
- TPLinker: Single-stage Joint Extraction of Entities and Relations Through Token Pair Linking☆18Apr 15, 2021Updated 4 years ago
- Winner of NeurIPS 2021 student leaderboard. Self-bootstrapping bayesian optimization for SCIP configuration using GNNs.☆13Oct 28, 2022Updated 3 years ago
- The Intermediate Goal of the project is to train a GPT like architecture to learn to summarise reddit posts from human preferences, as th…☆12Jul 14, 2021Updated 4 years ago
- machine translation data process tools☆10Apr 29, 2024Updated last year
- Active and Sample-Efficient Model Evaluation☆27May 22, 2025Updated 10 months ago
- Privacy-Preserving Bandits (MLSys'20)☆22Dec 8, 2022Updated 3 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- ☆20Jan 16, 2024Updated 2 years ago
- RENT (Reinforcement Learning via Entropy Minimization) is an unsupervised method for training reasoning LLMs.☆43Oct 31, 2025Updated 4 months ago
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆54Feb 23, 2024Updated 2 years ago
- KnowMAN: Weakly Supervised Multinomial Adversarial Networks☆12Nov 9, 2021Updated 4 years ago
- An implementation of the paper "Solving the Rubik's Cube without Human Knowledge"☆14Dec 9, 2018Updated 7 years ago
- {DeepL, Google, WMT-Best, davinci-003, turbo, gpt-4} × {En-De, En-Cs, En-Ru, En-Zh, De-Fr, En-Ja, Uk-En, Uk-Cs, En-Hr, En-Ha, En-Is}☆14Jun 18, 2023Updated 2 years ago
- This repositary hosts my experiments for the project, I did with OffNote Labs.☆10Apr 12, 2021Updated 4 years ago
- ☆15Sep 10, 2023Updated 2 years ago
- Code of "Instruction Multi-Constraint Molecular Generation Using a Teacher-Student Large Language Model"☆14Jul 8, 2025Updated 8 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Multimodal Transformers for biomedical text and Knowledge Graph data☆34Mar 3, 2023Updated 3 years ago
- EMNLP 2024 | Style-Specific Neurons for Steering LLMs in Text Style Transfer☆13Mar 23, 2025Updated last year
- An implementation of effective policy ensemble.☆16Jul 5, 2023Updated 2 years ago
- A Framework to Automatically Extract Indicators of Compromise (IoCs) from Twitter☆16Dec 9, 2019Updated 6 years ago
- [ACL 2022] CLUES: A Benchmark for Learning Classifiers using Natural Language Explanations☆10Jun 5, 2022Updated 3 years ago
- ☆25Jun 10, 2025Updated 9 months ago
- [COLING 2022]: CommunityLM: Probing Partisan Worldviews from Language Models☆15Jan 31, 2023Updated 3 years ago