Fine-tunes a student LLM using teacher feedback for improved reasoning and answer quality. Implements GRPO with teacher-provided evaluations.
☆54May 7, 2025Updated 11 months ago
Alternatives and similar repositories for grpo-llm-evaluator
Users that are interested in grpo-llm-evaluator are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- this is based on the paper Chain-of-Retrieval Augmented Generation☆14Mar 29, 2025Updated last year
- ☆19Mar 10, 2025Updated last year
- Train and finutune text-to-speech models for Bengali and many other languages!☆18Apr 2, 2025Updated last year
- A simple one file python script that executes AI processes defined in YML.☆14Mar 26, 2023Updated 3 years ago
- Easy to deploy your LLM(large language model) server with no public address GPU machine.☆15Apr 30, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆60Oct 18, 2025Updated 6 months ago
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆150Jan 7, 2026Updated 3 months ago
- ☆15Jan 26, 2025Updated last year
- Local LLM Agent with Guidance☆13May 26, 2023Updated 2 years ago
- Interpreting Learned Search and Planning: Reverse-engineering recurrent convolutional networks (DRC) that play Sokoban☆19Jun 29, 2025Updated 9 months ago
- Seamless Voice Interactions with LLMs☆12Oct 28, 2023Updated 2 years ago
- Generating Easy-to-Understand Referring Expressions for Target Identifications☆18Aug 30, 2019Updated 6 years ago
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆175Jan 16, 2025Updated last year
- [ACL 2025] Knowledge Unlearning for Large Language Models☆49Sep 18, 2025Updated 7 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Tcurtsni: Reverse Instruction Chat, ever wonder what your LLM wants to ask you?☆23Jun 25, 2024Updated last year
- Learning adapter weights from task descriptions☆19Nov 12, 2023Updated 2 years ago
- Tiny Agent: Production-Ready LLM Agent SDK for Every Developer☆38Sep 29, 2025Updated 6 months ago
- Hercules: Attributable and Scalable Opinion Summarization (ACL 2023)☆20Nov 8, 2023Updated 2 years ago
- ☆20Aug 1, 2024Updated last year
- A locally trained model of Stoney Nakoda has been developed and released. You can access the working model here or train your own instanc…☆10Oct 29, 2025Updated 5 months ago
- An introduction to LLM Sampling☆80Dec 15, 2024Updated last year
- minimal pytorch implementation of bm25 (with sparse tensors)☆104Oct 28, 2025Updated 5 months ago
- Official Project Page for HLA: Higher-order Linear Attention (https://arxiv.org/abs/2510.27258)☆48Jan 6, 2026Updated 3 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A fast vector database written in C.☆32Apr 1, 2026Updated 2 weeks ago
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- Visualization of Haskell data structures☆16Feb 13, 2024Updated 2 years ago
- Exploring Applications of GRPO☆252Aug 25, 2025Updated 7 months ago
- Approximating the joint distribution of language models via MCTS☆22Nov 3, 2024Updated last year
- Explore training for quantized models☆26Jul 12, 2025Updated 9 months ago
- ☆28Dec 16, 2025Updated 4 months ago
- Simulation of job offers and CVs with real-time processing, classification, and analytics using Kafka, Ray, Spark, and Databricks. Includ…☆14Dec 25, 2024Updated last year
- ☆10Apr 25, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [AAAI26] LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs☆56Dec 7, 2025Updated 4 months ago
- Entropy Based Sampling and Parallel CoT Decoding☆17Oct 9, 2024Updated last year
- Codebase for decoding compressed trust.☆26May 7, 2024Updated last year
- A web client for Linux from scratch in C for a variety of alternative web protocols☆17Nov 4, 2023Updated 2 years ago
- coded with and corrected by Google Anti-Gravity☆13Nov 23, 2025Updated 4 months ago
- Lego for GRPO☆30May 27, 2025Updated 10 months ago
- A browser based bitcoin miner☆16Dec 15, 2013Updated 12 years ago