🚀 Text2Grad: Converting natural language feedback into gradient signals for precise model optimization. Revolutionizing RLHF with span-level rewards and targeted improvements across code generation, summarization, and Q&A tasks.
☆35Feb 6, 2026Updated 3 months ago
Alternatives and similar repositories for Text2Grad
Users that are interested in Text2Grad are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- (NeurIPS 2025) SoMi-ToM: Evaluating Multi-Perspective Theory of Mind in Embodied Social Interactions☆34Nov 16, 2025Updated 6 months ago
- The official implementation of the paper "Self-Updatable Large Language Models by Integrating Context into Model Parameters"☆15May 18, 2025Updated last year
- PyTorch utilities for Optimal Power Flow☆30Mar 17, 2025Updated last year
- Improving scalability of RL algorithms using GNNs: A case study in optimal EV charging.☆30Oct 16, 2025Updated 7 months ago
- ☆16Jun 25, 2025Updated 11 months ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Source code of our paper "Focus on the Target’s Vocabulary: Masked Label Smoothing for Machine Translation" @ ACL 2022☆13Apr 13, 2022Updated 4 years ago
- Unofficial implementation of Chain of Hindsight (https://arxiv.org/abs/2302.02676) using pytorch and huggingface Trainers.☆11Apr 5, 2023Updated 3 years ago
- ☆15Nov 4, 2021Updated 4 years ago
- Vision Large Language Models trained on M3IT instruction tuning dataset☆17Aug 16, 2023Updated 2 years ago
- Implementation for "Surrogate Losses for Online Learning of Stepsizes in Stochastic Non-Convex Optimization"☆10Aug 3, 2022Updated 3 years ago
- This repo explores how AMR to address tasks difficult for LLMs☆13Jan 15, 2024Updated 2 years ago
- pytorch版基于gpt+nezha的中文多轮Cdial☆11Oct 22, 2022Updated 3 years ago
- Code for COLING 2022 paper "FactMix: Using a Few Labeled In-domain Examples to Generalize to Cross-domain Named Entity Recognition"☆15Jan 15, 2023Updated 3 years ago
- Official resources of "The First Few Tokens Are All You Need: An Efficient and Effective Unsupervised Prefix Fine-Tuning Method for Reaso…☆21Jun 13, 2025Updated 11 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Embodied-Planner-R1: Unleashing Embodied Task Planning Ability in LLMs via Reinforcement Learning☆27Mar 30, 2026Updated last month
- A zero-shot faithfulness evaluation metric for text summarization☆11Oct 17, 2023Updated 2 years ago
- ☆40Dec 26, 2025Updated 5 months ago
- [ACL 2024] Predicting the Unpredictable: Uncertainty-Aware Reasoning over Temporal Knowledge Graphs via Diffusion Process☆21Oct 7, 2024Updated last year
- ☆15Oct 20, 2023Updated 2 years ago
- A Weighted GCN with Logical Adjacency Matrix for Relation Extraction (ECAI2020)☆14Jan 24, 2021Updated 5 years ago
- The official code implementation of the Autodiff algorithm.☆16Nov 10, 2023Updated 2 years ago
- Code for Invariant Policy Optimization☆15Jul 22, 2020Updated 5 years ago
- 为准备2020年清华机计算机复试机试题而做的笔记☆11Apr 17, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆14Oct 28, 2023Updated 2 years ago
- ☆11Mar 19, 2026Updated 2 months ago
- Generative models and other stuff too, maybe, perhaps even probably☆16Dec 12, 2015Updated 10 years ago
- Gated Pretrained Transformer model for robust denoised sequence-to-sequence modelling☆10May 29, 2021Updated 4 years ago
- VS Code Clinical Quality Language Extension☆12May 19, 2026Updated last week
- Code for NAACL 2025 paper "AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge"☆17Mar 2, 2026Updated 2 months ago
- ☆13Feb 15, 2023Updated 3 years ago
- APRIL: Active Partial Rollouts in Reinforcement Learning to Tame Long-tail Generation. A system-level optimization for scalable LLM tra…☆57Oct 11, 2025Updated 7 months ago
- Nested Named Entity Recognition for Chinese Biomedical Text☆12Jan 25, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- grpo to train long form QA and instructions with long-form reward model☆17Jul 17, 2025Updated 10 months ago
- Zero-Shot Learning in Named Entity Recognition with Common Sense Knowledge☆17Nov 16, 2021Updated 4 years ago
- ROUTE: Robust Multitask Tuning and Collaboration for Text-to-SQL (ICLR 2025 Pytorch Code)☆17May 15, 2025Updated last year
- Templates and examples for ACL and EMNLP conference posters.☆14Oct 5, 2024Updated last year
- Source code for paper "ATP: AMRize Than Parse! Enhancing AMR Parsing with PseudoAMRs" @NAACL-2022☆15Mar 31, 2023Updated 3 years ago
- zlib with the build system replaced by zig☆15Apr 17, 2024Updated 2 years ago
- In-BoXBART: Get Instructions into Biomedical Multi-task Learning☆15Aug 23, 2022Updated 3 years ago