microsoft/Text2Grad

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/microsoft/Text2Grad)

microsoft / Text2Grad

🚀 Text2Grad: Converting natural language feedback into gradient signals for precise model optimization. Revolutionizing RLHF with span-level rewards and targeted improvements across code generation, summarization, and Q&A tasks.

☆37

Alternatives and similar repositories for Text2Grad

Users that are interested in Text2Grad are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

microsoft / GUI-Agent-RL
View on GitHub
☆43Jul 2, 2026Updated 3 weeks ago
ablghtianyi / ICL_Modular_Arithmetic
View on GitHub
☆19Mar 25, 2025Updated last year
D2I-ai / Route
View on GitHub
ROUTE: Robust Multitask Tuning and Collaboration for Text-to-SQL (ICLR 2025 Pytorch Code)
☆16May 15, 2025Updated last year
TeaPearce / Expressive_Priors_in_BNNs
View on GitHub
UAI paper 'Expressive Priors in Bayesian Neural Networks: Kernel Combinations and Periodic Functions'
☆11Jun 26, 2019Updated 7 years ago
harvardnlp / lie-access-memory
View on GitHub
☆18Mar 5, 2017Updated 9 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
NJUNLP / R-PRM
View on GitHub
☆34Apr 1, 2025Updated last year
kxfan2002 / Reagent
View on GitHub
Agent-RRM: Exploring Reasoning Reward Model for Agents
☆70Mar 17, 2026Updated 4 months ago
jiacheng-ye / UANet
View on GitHub
Code for our EMNLP 2020 paper "Uncertainty-Aware Label Refinement for Sequence Labeling"
☆22Oct 4, 2020Updated 5 years ago
gkswamy98 / causal_il
View on GitHub
Contains implementation of the DoubIL and ResiduIL algorithms from the ICML '22 paper Causal Imitation Learning under Temporally Correlat…
☆11Dec 9, 2022Updated 3 years ago
ZJU-REAL / cooper
View on GitHub
☆29Aug 19, 2025Updated 11 months ago
da03 / WildVisualizer
View on GitHub
☆28Nov 19, 2025Updated 8 months ago
falcondai / pyrouge
View on GitHub
A Python wrapper for the ROUGE summarization evaluation package
☆14Aug 9, 2017Updated 8 years ago
zhangxy-2019 / critique-GRPO
View on GitHub
[ICML 2026 Spotlight] Critique-GRPO: Advancing LLM Reasoning with Natural Language and Numerical Feedback
☆70Jun 3, 2026Updated last month
pengwei-iie / GLHG
View on GitHub
☆12Mar 12, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
LuckyyySTA / GOLF
View on GitHub
☆18Mar 16, 2026Updated 4 months ago
LinesHogan / tLLM
View on GitHub
tLLM is an test-time training extension of vLLM
☆45Apr 26, 2026Updated 3 months ago
cambridgeltl / multi3woz
View on GitHub
The official repository for Multi3WOZ: A Multilingual, Multi-Domain, Multi-Parallel Dataset for Training and Evaluating Culturally Adapte…
☆17Jan 15, 2024Updated 2 years ago
KFM135 / chiplet-optimizer
View on GitHub
This repository contains the code for this paper: Chiplet-Gym: An RL-based Optimization Framework for Chiplet-based AI Accelerator
☆22Sep 28, 2024Updated last year
xsunsim / AgentStockBenchmarkResults
View on GitHub
The daily arena where AI agents clash to rank tomorrow’s S&P 500 winners and losers. Their only judge is the future — the one truth that …
☆16Jun 16, 2026Updated last month
Edward-Sun / structured-nart
View on GitHub
☆15Dec 5, 2019Updated 6 years ago
elsa66666 / MentraSuite
View on GitHub
psychology reasoning llm
☆17Dec 16, 2025Updated 7 months ago
da03 / lightlda
View on GitHub
Distributed LDA, takes raw text as input and outputs topic word table.
☆17Apr 16, 2016Updated 10 years ago
THU-KEG / LRM-FactEval
View on GitHub
☆17Jun 25, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
psunlpgroup / FoVer
View on GitHub
This repository includes code and materials for the paper "Efficient PRM Training Data Synthesis via Formal Verification" (ACL 2026 Findi…
☆19Apr 7, 2026Updated 3 months ago
WeijiaZhang24 / DCSurvival
View on GitHub
☆11Apr 5, 2024Updated 2 years ago
navidmdn / ESConv-SRA
View on GitHub
Code and data for the paper "Steering Conversational Large Language Models for Long Emotional Support Conversations" along with a UI to v…
☆15Apr 14, 2025Updated last year
HypherX / Evolution-Analysis
View on GitHub
☆25Dec 13, 2024Updated last year
thu-coai / VPO
View on GitHub
☆25Jul 20, 2025Updated last year
FSoft-AI4Code / SRank-CodeRanker
View on GitHub
[ACL 2024] Novel reranking method to select the best solutions for code generation
☆16Jun 9, 2024Updated 2 years ago
yuntian-group / interactive-training
View on GitHub
https://interactivetraining.ai/
☆18Jul 11, 2026Updated 2 weeks ago
byronBBL / Context-DPO
View on GitHub
Official repository of paper "Context-DPO: Aligning Language Models for Context-Faithfulness"
☆23Feb 17, 2025Updated last year
akashmondal1810 / UncertaintyEstimation
View on GitHub
Uncertainty Estimation Using Deep Neural Network and Gradient Boosting Methods
☆22Jun 1, 2021Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
GFNOrg / dt-gfn
View on GitHub
This repository contains code for the paper "Learning Decision Trees as Amortized Structure Inference"
☆16Mar 25, 2025Updated last year
PKU-TANGENT / ConFiguRe
View on GitHub
Dataset and baseline for Coling 2022 long paper (oral): "ConFiguRe: Exploring Discourse-level Chinese Figures of Speech"
☆12Jul 27, 2023Updated 2 years ago
Sunmmyy / OTPR
View on GitHub
Score-Based Diffusion Policy Compatible with Reinforcement Learning via Optimal Transport
☆15Feb 26, 2025Updated last year
guaguakai / decision-focused-RL
View on GitHub
☆16Nov 4, 2021Updated 4 years ago
XinshuangL / SELF-PARAM
View on GitHub
The official implementation of the paper "Self-Updatable Large Language Models by Integrating Context into Model Parameters"
☆15May 18, 2025Updated last year
fdlm / Spaghetti
View on GitHub
Conditional Random Fields implemented as Lasagne layer
☆10Jul 22, 2016Updated 10 years ago
pkunlp-icler / MLS
View on GitHub
Source code of our paper "Focus on the Target’s Vocabulary: Masked Label Smoothing for Machine Translation" @ ACL 2022
☆13Apr 13, 2022Updated 4 years ago