The official repo for the paper "Teacher Forcing Recovers Reward Functions for Text Generation"
☆31May 27, 2023Updated 2 years ago
Alternatives and similar repositories for LMReward
Users that are interested in LMReward are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An official implementation for the EMNLP 2023 Findings paper "Prompt-Based Editing for Text Style Transfer"☆13Dec 9, 2023Updated 2 years ago
- Official implementation of the ACL 2022 paper "Learning Non-Autoregressive Models from Search for Unsupervised Sentence Summarization"☆14Dec 26, 2022Updated 3 years ago
- ☆22Feb 4, 2026Updated last month
- Virtual Adversarial Training (VAT) techniques in PyTorch☆17Jul 19, 2022Updated 3 years ago
- ☆11Nov 13, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Implementation of our paper "Self-training Sampling with Monolingual Data Uncertainty for Neural Machine Translation" to appear in ACL-20…☆31Jul 16, 2021Updated 4 years ago
- Loads OpenSubtitles v2018 dataset without having to load everything into memory at once. Works well with pytorch.☆13Aug 26, 2020Updated 5 years ago
- Structural Adapters in Pretrained Language Models for AMR-to-Text Generation (EMNLP 2021)☆29Mar 30, 2023Updated 3 years ago
- [ACL 2023] Contextual Distortion Reveals Constituency: Mask Language Models are Implicit Parsers.☆14Jun 3, 2023Updated 2 years ago
- ☆23Aug 14, 2023Updated 2 years ago
- A set of pre-trained word vectors for Persian language☆15Jul 19, 2023Updated 2 years ago
- Optimal forecast reconciliation with time series selection☆11Oct 23, 2024Updated last year
- Implementation of our paper "Exploiting Unsupervised Data for Emotion Recognition in Conversations" in the Findings of EMNLP-2020.☆13Nov 17, 2020Updated 5 years ago
- ☆15Jul 16, 2021Updated 4 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Code for the TMLR 2023 paper "PPOCoder: Execution-based Code Generation using Deep Reinforcement Learning"☆117Jan 9, 2024Updated 2 years ago
- Guidelines for our secondary layer of annotation adding multi-sentence AMR links☆12Sep 6, 2017Updated 8 years ago
- source code of NAACL2021 "PCFGs Can Do Better: Inducing Probabilistic Context-Free Grammars with Many Symbols“ and ACL2021 main conferenc…☆52Mar 28, 2025Updated last year
- This is the official repository for the paper "Flora: Low-Rank Adapters Are Secretly Gradient Compressors" in ICML 2024.☆106Jul 1, 2024Updated last year
- This repository hosts the source code for the paper "ROCODE: Integrating Backtracking Mechanism and Program Analysis in Large Language Mo…☆16Dec 16, 2025Updated 3 months ago
- A general framework for univariate time series forecasting.☆10Apr 18, 2024Updated last year
- [ACL 2024] Can Watermarks Survive Translation? On the Cross-lingual Consistency of Text Watermark for Large Language Models☆42Jun 4, 2024Updated last year
- ☆19Sep 29, 2019Updated 6 years ago
- ☆13Oct 7, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Code publication to the paper "Normalized Attention Without Probability Cage"☆17Nov 9, 2021Updated 4 years ago
- ☆25Jun 10, 2025Updated 9 months ago
- Text Classification model deployment using FastAPI, Streamlit and Docker Compose☆14Feb 12, 2021Updated 5 years ago
- Implementation of Proximal Policy Optimization in Jax+Flax☆21May 18, 2023Updated 2 years ago
- ☆53Apr 9, 2025Updated 11 months ago
- ☆10Aug 3, 2023Updated 2 years ago
- Interpreting Sarcasm with Sentiment Based Monolingual Machine Translation☆11May 7, 2017Updated 8 years ago
- A novel approach to evaluating AI agents on diagnostic accuracy in symptom checking tasks.☆25Jul 10, 2025Updated 8 months ago
- Understanding and Improving Encoder Layer Fusion in Sequence-to-Sequence Learning (ICLR 2021)☆24Mar 18, 2021Updated 5 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- source code for NAACL2022 main conference "Dynamic Programming in Rank Space: Scaling Structured Inference with Low-Rank HMMs and PCFGs"☆10Sep 26, 2022Updated 3 years ago
- Learning Descriptive Image Captioning via Semipermeable Maximum Likelihood Estimation (NeurIPS 2023)☆22Oct 1, 2023Updated 2 years ago
- Hierarchical Context Tagger for utterance rewriting☆13Mar 27, 2022Updated 4 years ago
- ☆12Jun 18, 2024Updated last year
- Replicating O1 inference-time scaling laws☆93Dec 1, 2024Updated last year
- public repo for ESTER dataset and modeling (EMNLP'21)☆20Feb 2, 2022Updated 4 years ago
- Text paraphrasing tool☆12Aug 29, 2023Updated 2 years ago