Group-relative Trajectory-based Policy Optimization: Increasing Quality and Training Stability
☆40Feb 23, 2026Updated 2 months ago
Alternatives and similar repositories for GTPO
Users that are interested in GTPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The Official PyTorch implementation of Shared LoRA Subspaces for almost Strict Continual Learning☆30Mar 19, 2026Updated last month
- Curriculum training of instruction-following LLMs with Unsloth☆14Dec 15, 2025Updated 4 months ago
- Running LLMs against a sandbox airport to see if they can make the correct decisions in real time☆26Jul 22, 2025Updated 9 months ago
- Vector functions and indexing for SQLite☆10Mar 26, 2023Updated 3 years ago
- ROSA+: RWKV's ROSA implementation with fallback statistical predictor☆34Oct 13, 2025Updated 6 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Documentation:☆18May 2, 2025Updated last year
- ☆13Oct 5, 2025Updated 6 months ago
- ☆13Apr 17, 2024Updated 2 years ago
- An AI tool designed to generate explanations for every file in a project☆14Mar 7, 2025Updated last year
- Cleanai (https://github.com/willmil11/cleanai) except I'm making it in c now. Fast and clean from the start this time :)☆16Mar 6, 2026Updated last month
- Code for minimum-entropy coupling.☆33Jan 6, 2026Updated 3 months ago
- A web-app to explore topics using LLM (less typing and more clicks)☆67Mar 15, 2026Updated last month
- Reasoning-based Evaluation and Ranking of Translations.☆20Jul 18, 2025Updated 9 months ago
- NICE: Neurogenesis Inspired Contextual Encoding for Replay-free Class Incremental Learning☆28Jul 28, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A Model Agnostic function to directly remove specified layers from the LLM☆10May 23, 2024Updated last year
- Correlation-aware Change-point Detection via Graph Neural Networks☆16Sep 28, 2020Updated 5 years ago
- 5X faster 60% less memory QLoRA finetuning☆21May 28, 2024Updated last year
- Mamba R1 represents a novel architecture that combines the efficiency of Mamba's state space models with the scalability of Mixture of Ex…☆25Oct 13, 2025Updated 6 months ago
- Multilingual Entity Linking model by BELA model☆12Jul 20, 2023Updated 2 years ago
- Open-sourced evaluation suite from the Monitoring Monitorability paper☆69Apr 22, 2026Updated last week
- Simple Tool Caller for llama.cpp☆11Aug 12, 2024Updated last year
- ☆31Aug 27, 2024Updated last year
- The AI Code Cartographer: A Prompt for Self-Generating Knowledge Graphs☆29Jan 4, 2026Updated 4 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A chat UI for Llama.cpp☆16Apr 20, 2026Updated 2 weeks ago
- High-Performance Text Deduplication Toolkit☆62Aug 25, 2025Updated 8 months ago
- EmbeDB is a small Python wrapper around LMDB built as key-value storage for embeddings.☆14Nov 4, 2022Updated 3 years ago
- High-performance Text-to-Speech server with OpenAI-compatible API, 8 voices, emotion tags, and modern web UI. Optimized for RTX GPUs.☆25May 16, 2025Updated 11 months ago
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"☆104Dec 22, 2024Updated last year
- A text-based, 5e-compatible RPG with an AI Dungeon Master that rolls real dice, tracks real stats, and plays by the rules. Built on the S…☆30Updated this week
- Official Implementation of "Transferring Inductive Biases Through Knowledge Distillation"☆15Jun 3, 2020Updated 5 years ago
- jQuery, React and Streamlit applications written by LLMs☆15Dec 24, 2023Updated 2 years ago
- ☆17Mar 28, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [EMNLP2025] Remedy: Learning Machine Translation Evaluation from Human Preferences with Reward Modeling☆15Nov 20, 2025Updated 5 months ago
- ☆15Mar 18, 2026Updated last month
- ☆10Apr 22, 2024Updated 2 years ago
- Authenticated independently verifiable agent delegation.☆33Dec 17, 2025Updated 4 months ago
- Code for Accelerated Linearized Laplace Approximation for Bayesian Deep Learning (ELLA, NeurIPS 22')☆17Oct 12, 2022Updated 3 years ago
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM☆61Apr 8, 2024Updated 2 years ago
- EfficientSAM + YOLO World base model for use with Autodistill.☆10Feb 21, 2024Updated 2 years ago