๐ Text2Grad: Converting natural language feedback into gradient signals for precise model optimization. Revolutionizing RLHF with span-level rewards and targeted improvements across code generation, summarization, and Q&A tasks.
โ28Feb 6, 2026Updated last month
Alternatives and similar repositories for Text2Grad
Users that are interested in Text2Grad are comparing it to the libraries listed below
Sorting:
- R for Data Science (2e) in Simplified Chineseโ21Dec 23, 2025Updated 2 months ago
- โ39Aug 6, 2025Updated 7 months ago
- Improving scalability of RL algorithms using GNNs: A case study in optimal EV charging.โ25Oct 16, 2025Updated 4 months ago
- Embodied-Planner-R1: Unleashing Embodied Task Planning Ability in LLMs via Reinforcement Learningโ25Jan 5, 2026Updated 2 months ago
- PDF Diff Viewer, a side-by-side, visual highlight, sync-scroll, PDF comparer, written in Python. Open source, mostly powered by PyMuPDF aโฆโ41Jan 31, 2026Updated last month
- โ12Dec 19, 2023Updated 2 years ago
- โ16Jun 25, 2025Updated 8 months ago
- Tool to convert '.com' Gaussian files into files supported by 3D rendering programs, such as Blender, Maya, and others.โ13Jan 15, 2026Updated last month
- grpo to train long form QA and instructions with long-form reward modelโ17Jul 17, 2025Updated 7 months ago
- Safe Model-Based RL HVAC Control Using Epistemic Uncertainty Estimation.โ11Feb 25, 2025Updated last year
- Contains implementation of the DoubIL and ResiduIL algorithms from the ICML '22 paper Causal Imitation Learning under Temporally Correlatโฆโ11Dec 9, 2022Updated 3 years ago
- ํ๊ตญ์ด ์์ค ํ ์คํธ๋ฅผ ์ํ ์์ฐ์ด์ฒ๋ฆฌ ๋ผ์ด๋ธ๋ฌ๋ฆฌ์ ๋๋ค. Natural Language Processing Library for Korean Literary Text. (Will be open in February, 2024)โ11Jan 16, 2024Updated 2 years ago
- Nested Named Entity Recognition for Chinese Biomedical Textโ11Jan 25, 2024Updated 2 years ago
- VS Code Clinical Quality Language Extensionโ11Feb 27, 2026Updated last week
- Uses LSTM-based autoencoders to detect abnormal resting heart rate during the coronavirus (SARS-CoV-2) infectious period using the wearabโฆโ11Jan 22, 2021Updated 5 years ago
- โ10Apr 5, 2024Updated last year
- Zig Vector Database!โ14Jan 30, 2026Updated last month
- Code for COLING 2022 paper "FactMix: Using a Few Labeled In-domain Examples to Generalize to Cross-domain Named Entity Recognition"โ15Jan 15, 2023Updated 3 years ago
- This is the official repo for our paper: "Generative Knowledge-Guided Retrieval System for Construction Disclosure Documents Reviewing"โ21Nov 17, 2025Updated 3 months ago
- Paper: โMEMRL: SELF-EVOLVING AGENTS VIA RUNTIME REINFORCEMENT LEARNING ON EPISODIC MEMORYโ Open-Source Codeโ36Feb 27, 2026Updated last week
- Source code of our paper "Focus on the Targetโs Vocabulary: Masked Label Smoothing for Machine Translation" @ ACL 2022โ13Apr 13, 2022Updated 3 years ago
- Official implementation of Recurrent Action Transformer with Memory, an offline RL agent with memory mechanisms. https://sites.google.comโฆโ18Nov 23, 2025Updated 3 months ago
- Official implementation of our paper at ACL 2023: Pre-training Multi-party Dialogue Models with Latent Discourse Inferenceโ10Jul 10, 2023Updated 2 years ago
- The official implementation of the paper "Self-Updatable Large Language Models by Integrating Context into Model Parameters"โ15May 18, 2025Updated 9 months ago
- Vehicle direction identification consists of three module detection , tracking and direction recognization.โ11Nov 13, 2021Updated 4 years ago
- Network Flows Optimization - Shortest Path, Max Flow and Min Cost Flow Algorithms in Pythonโ11Sep 13, 2019Updated 6 years ago
- Templates and examples for ACL and EMNLP conference posters.โ14Oct 5, 2024Updated last year
- โ19Jun 11, 2025Updated 8 months ago
- Dataset and baseline for Coling 2022 long paper (oral): "ConFiguRe: Exploring Discourse-level Chinese Figures of Speech"โ13Jul 27, 2023Updated 2 years ago
- โ13Feb 24, 2025Updated last year
- UAI paper 'Expressive Priors in Bayesian Neural Networks: Kernel Combinations and Periodic Functions'โ11Jun 26, 2019Updated 6 years ago
- โ11Feb 20, 2025Updated last year
- Unofficial implementation of Chain of Hindsight (https://arxiv.org/abs/2302.02676) using pytorch and huggingface Trainers.โ11Apr 5, 2023Updated 2 years ago
- A fast CUDA accelerated implementation for MVS evaluation.โ12Dec 1, 2022Updated 3 years ago
- LLMAD codeโ23Oct 31, 2024Updated last year
- Useful code for querying the NHSBSA Open Data Portal API.โ15Jun 28, 2022Updated 3 years ago
- Gated Pretrained Transformer model for robust denoised sequence-to-sequence modellingโ10May 29, 2021Updated 4 years ago
- Graph Data Processing with Cypher, published by Packtโ13Sep 27, 2023Updated 2 years ago
- โ26Jan 4, 2026Updated 2 months ago