The official repo for the paper "Teacher Forcing Recovers Reward Functions for Text Generation"
☆31May 27, 2023Updated 2 years ago
Alternatives and similar repositories for LMReward
Users that are interested in LMReward are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An official implementation for the EMNLP 2023 Findings paper "Prompt-Based Editing for Text Style Transfer"☆13Dec 9, 2023Updated 2 years ago
- ☆22Feb 4, 2026Updated 3 months ago
- ☆12Jan 29, 2021Updated 5 years ago
- ☆16Aug 20, 2020Updated 5 years ago
- Normalized and modified version of Bijankhan corpus☆13Feb 21, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆11Nov 13, 2024Updated last year
- ☆10Sep 18, 2021Updated 4 years ago
- The repo for paper: Exploiting the Index Gradients for Optimization-Based Jailbreaking on Large Language Models.☆14Dec 16, 2024Updated last year
- Implementation of our paper "Self-training Sampling with Monolingual Data Uncertainty for Neural Machine Translation" to appear in ACL-20…☆31Jul 16, 2021Updated 4 years ago
- Loads OpenSubtitles v2018 dataset without having to load everything into memory at once. Works well with pytorch.☆13Aug 26, 2020Updated 5 years ago
- Structural Adapters in Pretrained Language Models for AMR-to-Text Generation (EMNLP 2021)☆29Mar 30, 2023Updated 3 years ago
- [ACL 2023] Contextual Distortion Reveals Constituency: Mask Language Models are Implicit Parsers.☆14Jun 3, 2023Updated 2 years ago
- ☆23Aug 14, 2023Updated 2 years ago
- ☆16Nov 5, 2018Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Implementation of our paper "Exploiting Unsupervised Data for Emotion Recognition in Conversations" in the Findings of EMNLP-2020.☆13Nov 17, 2020Updated 5 years ago
- Code for the TMLR 2023 paper "PPOCoder: Execution-based Code Generation using Deep Reinforcement Learning"☆116Jan 9, 2024Updated 2 years ago
- Guidelines for our secondary layer of annotation adding multi-sentence AMR links☆12Sep 6, 2017Updated 8 years ago
- ☆13Oct 4, 2022Updated 3 years ago
- Automatic code generator for training Reinforcement Learning policies☆11Jan 3, 2021Updated 5 years ago
- This repository hosts the source code for the paper "ROCODE: Integrating Backtracking Mechanism and Program Analysis in Large Language Mo…☆16Dec 16, 2025Updated 4 months ago
- ☆19Sep 29, 2019Updated 6 years ago
- Natural Universal Trigger Search (NUTS)☆21Apr 17, 2021Updated 5 years ago
- Text Classification model deployment using FastAPI, Streamlit and Docker Compose☆14Feb 12, 2021Updated 5 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Implementation of Proximal Policy Optimization in Jax+Flax☆21May 18, 2023Updated 2 years ago
- ☆10Aug 3, 2023Updated 2 years ago
- A novel approach to evaluating AI agents on diagnostic accuracy in symptom checking tasks.☆25Jul 10, 2025Updated 10 months ago
- [NeurIPS 2024] Code and Data Repo for Paper "Embedding Trajectory for Out-of-Distribution Detection in Mathematical Reasoning"☆28May 28, 2024Updated last year
- Understanding and Improving Encoder Layer Fusion in Sequence-to-Sequence Learning (ICLR 2021)☆24Mar 18, 2021Updated 5 years ago
- source code for NAACL2022 main conference "Dynamic Programming in Rank Space: Scaling Structured Inference with Low-Rank HMMs and PCFGs"☆10Sep 26, 2022Updated 3 years ago
- ☆10May 26, 2022Updated 3 years ago
- Hierarchical Context Tagger for utterance rewriting☆13Mar 27, 2022Updated 4 years ago
- Unsupervised Cross-lingual Sentiment Analysis (CoNLL 2019)☆10Nov 4, 2019Updated 6 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆12Jun 18, 2024Updated last year
- Extracting Concise Bug-Fixing Patches from Human-Written Patches in Version Control Systems☆16Feb 21, 2023Updated 3 years ago
- Replicating O1 inference-time scaling laws☆93Dec 1, 2024Updated last year
- public repo for ESTER dataset and modeling (EMNLP'21)☆20Feb 2, 2022Updated 4 years ago
- Implementation of our paper "Data Rejuvenation: Exploiting Inactive Training Examples for Neural Machine Translation" in EMNLP-2020.☆23Aug 20, 2021Updated 4 years ago
- It is an open-source, educational, and Persian website on teaching "engineering probability and statistics" provided by volunteers withou…☆41Apr 3, 2024Updated 2 years ago
- ☆10May 20, 2019Updated 6 years ago