🚀 Text2Grad: Converting natural language feedback into gradient signals for precise model optimization. Revolutionizing RLHF with span-level rewards and targeted improvements across code generation, summarization, and Q&A tasks.
☆31Feb 6, 2026Updated last month
Alternatives and similar repositories for Text2Grad
Users that are interested in Text2Grad are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LLMAD code☆24Oct 31, 2024Updated last year
- (NeurIPS 2025) SoMi-ToM: Evaluating Multi-Perspective Theory of Mind in Embodied Social Interactions☆34Nov 16, 2025Updated 4 months ago
- ☆17Oct 21, 2019Updated 6 years ago
- The official implementation of the paper "Self-Updatable Large Language Models by Integrating Context into Model Parameters"☆15May 18, 2025Updated 10 months ago
- Contains implementation of the DoubIL and ResiduIL algorithms from the ICML '22 paper Causal Imitation Learning under Temporally Correlat…☆11Dec 9, 2022Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆19Mar 25, 2025Updated last year
- Safe Model-Based RL HVAC Control Using Epistemic Uncertainty Estimation.☆11Feb 25, 2025Updated last year
- ☆10Apr 5, 2024Updated last year
- Improving scalability of RL algorithms using GNNs: A case study in optimal EV charging.☆26Oct 16, 2025Updated 5 months ago
- ☆16Jun 25, 2025Updated 9 months ago
- Unofficial implementation of Chain of Hindsight (https://arxiv.org/abs/2302.02676) using pytorch and huggingface Trainers.☆11Apr 5, 2023Updated 2 years ago
- This repository implements the model in MobiHoc'18: "Long-term mobile traffic forecasting using deep spatio-temporal neural networks"☆29Apr 4, 2019Updated 6 years ago
- ☆16Nov 4, 2021Updated 4 years ago
- Vision Large Language Models trained on M3IT instruction tuning dataset☆17Aug 16, 2023Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- This repository contains code for the paper "Learning Decision Trees as Amortized Structure Inference"☆16Mar 25, 2025Updated last year
- Implementation for "Surrogate Losses for Online Learning of Stepsizes in Stochastic Non-Convex Optimization"☆10Aug 3, 2022Updated 3 years ago
- Source code for paper "On the Pareto Front of Multilingual Neural Machine Translation" @ NeurIPS 2023☆17Sep 27, 2023Updated 2 years ago
- Official implementation of our paper at ACL 2023: Pre-training Multi-party Dialogue Models with Latent Discourse Inference☆10Jul 10, 2023Updated 2 years ago
- Code for COLING 2022 paper "FactMix: Using a Few Labeled In-domain Examples to Generalize to Cross-domain Named Entity Recognition"☆15Jan 15, 2023Updated 3 years ago
- Code for paper "ProgGen: Generating Named Entity Recognition Datasets Step-by-step with Self-Reflexive Large Language Models"☆17Mar 29, 2024Updated last year
- Official resources of "The First Few Tokens Are All You Need: An Efficient and Effective Unsupervised Prefix Fine-Tuning Method for Reaso…☆20Jun 13, 2025Updated 9 months ago
- Convert CVXPY expressions to PyTorch expressions☆18Jul 8, 2025Updated 8 months ago
- Embodied-Planner-R1: Unleashing Embodied Task Planning Ability in LLMs via Reinforcement Learning☆25Jan 5, 2026Updated 2 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- A zero-shot faithfulness evaluation metric for text summarization☆11Oct 17, 2023Updated 2 years ago
- Fast and reliable solver for the Optimal Power Flow Problem☆14Dec 12, 2024Updated last year
- A Pytorch implementation for "Hierarchical Attention Network with Pairwise Loss for Chinese Zero Pronoun Resolution“ (AAAI 2020).☆10Dec 10, 2020Updated 5 years ago
- ☆28Jan 4, 2026Updated 2 months ago
- [ACL 2024] Predicting the Unpredictable: Uncertainty-Aware Reasoning over Temporal Knowledge Graphs via Diffusion Process☆18Oct 7, 2024Updated last year
- ☆12Feb 15, 2023Updated 3 years ago
- The official code implementation of the Autodiff algorithm.☆15Nov 10, 2023Updated 2 years ago
- Code for Invariant Policy Optimization☆15Jul 22, 2020Updated 5 years ago
- Aligning Agentic World Models via Knowledgeable Experience Learning☆32Jan 25, 2026Updated 2 months ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- 为准备2020年清华机计算机复试机试题而做的笔记☆11Apr 17, 2023Updated 2 years ago
- ☆14Oct 28, 2023Updated 2 years ago
- Gated Pretrained Transformer model for robust denoised sequence-to-sequence modelling☆10May 29, 2021Updated 4 years ago
- This is the repository for the resources in TACL 2022 Paper "Ultra-fine Entity Typing with Indirect Supervision from Natural Language Inf…☆14Aug 17, 2022Updated 3 years ago
- Code for NAACL 2025 paper "AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge"☆17Mar 2, 2026Updated 3 weeks ago
- Diverse Demonstrations Improve In-context Compositional Generalization☆12Jul 7, 2023Updated 2 years ago
- APRIL: Active Partial Rollouts in Reinforcement Learning to Tame Long-tail Generation. A system-level optimization for scalable LLM tra…☆55Oct 11, 2025Updated 5 months ago