Group-relative Trajectory-based Policy Optimization: Increasing Quality and Training Stability
☆41Feb 23, 2026Updated 3 months ago
Alternatives and similar repositories for GTPO
Users that are interested in GTPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆35Mar 6, 2026Updated 2 months ago
- An experimental desktop client for using Claude Desktop's MCP with Novelcrafter codices.☆11Dec 3, 2024Updated last year
- Hill Space is All You Need☆17Jul 11, 2025Updated 10 months ago
- Curriculum training of instruction-following LLMs with Unsloth☆14Dec 15, 2025Updated 5 months ago
- Vector functions and indexing for SQLite☆10Mar 26, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ROSA+: RWKV's ROSA implementation with fallback statistical predictor☆35Oct 13, 2025Updated 7 months ago
- This is AI implementation (not official) of the DreamGym framework from the paper "Scaling Agent Learning via Experience Synthesis" (arXi…☆41Nov 9, 2025Updated 6 months ago
- ☆15Apr 26, 2025Updated last year
- An AI tool designed to generate explanations for every file in a project☆14Mar 7, 2025Updated last year
- Code for minimum-entropy coupling.☆33Jan 6, 2026Updated 4 months ago
- A web-app to explore topics using LLM (less typing and more clicks)☆67Mar 15, 2026Updated 2 months ago
- ☆10Dec 17, 2020Updated 5 years ago
- Reasoning-based Evaluation and Ranking of Translations.☆20Jul 18, 2025Updated 10 months ago
- Machine translation with tinygrad☆19Apr 7, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A Model Agnostic function to directly remove specified layers from the LLM☆10May 23, 2024Updated 2 years ago
- Single-file, pure CUDA C implementation for running inference on Qwen3 0.6B GGUF. No Dependencies.☆24Nov 26, 2025Updated 5 months ago
- Mamba R1 represents a novel architecture that combines the efficiency of Mamba's state space models with the scalability of Mixture of Ex…☆25Oct 13, 2025Updated 7 months ago
- Multilingual Entity Linking model by BELA model☆12Jul 20, 2023Updated 2 years ago
- Implementation of a Hierarchical Mamba as described in the paper: "Hierarchical State Space Models for Continuous Sequence-to-Sequence Mo…☆15Nov 11, 2024Updated last year
- [ICLR 2026] Generative View Stitching☆108Nov 7, 2025Updated 6 months ago
- Open-sourced evaluation suite from the Monitoring Monitorability paper☆75Apr 22, 2026Updated last month
- Can we estimate the economic impact of EIP-1559 on miners? This repository try to estimate the loss of miners' revenue coming from transa…☆13Mar 15, 2021Updated 5 years ago
- Simple Tool Caller for llama.cpp☆11Aug 12, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [EMNLP2022] Source code for Neural Machine Translation with Contrastive Translation Memories☆12Feb 15, 2023Updated 3 years ago
- A framework for evaluating Machine Translation models.☆12Apr 21, 2026Updated last month
- The rag pipeline for optimizing dynamic data editing.☆21Oct 30, 2025Updated 6 months ago
- High-Performance Text Deduplication Toolkit☆61Aug 25, 2025Updated 9 months ago
- High-performance Text-to-Speech server with OpenAI-compatible API, 8 voices, emotion tags, and modern web UI. Optimized for RTX GPUs.☆25May 16, 2025Updated last year
- L2E llama2.c on Commodore C-64☆18Feb 22, 2025Updated last year
- Official Implementation of "Transferring Inductive Biases Through Knowledge Distillation"☆15Jun 3, 2020Updated 5 years ago
- This Streamlit application allows users to upload images and engage in interactive conversations about them using the Ollama Vision Model…☆15Nov 11, 2024Updated last year
- ☆15Mar 18, 2026Updated 2 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆17Mar 28, 2025Updated last year
- [EMNLP2025] Remedy: Learning Machine Translation Evaluation from Human Preferences with Reward Modeling☆17Nov 20, 2025Updated 6 months ago
- ☆10Apr 22, 2024Updated 2 years ago
- Yumdocs is a template engine to automate Word, PowerPoint and Excel documents.☆15Sep 23, 2025Updated 8 months ago
- docker for HF wav2vec2-sprint☆13Mar 26, 2021Updated 5 years ago
- Applies ROME and MEMIT on Mamba-S4 models☆15Apr 5, 2024Updated 2 years ago
- Code for "Discovering Non-monotonic Autoregressive Orderings with Variational Inference" (paper and code updated from ICLR 2021)☆12Mar 7, 2024Updated 2 years ago