Group-relative Trajectory-based Policy Optimization: Increasing Quality and Training Stability
☆39Feb 23, 2026Updated last month
Alternatives and similar repositories for GTPO
Users that are interested in GTPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A c++ framework on efficient training & fine-tuning LLMs☆27Mar 14, 2026Updated last week
- An experimental desktop client for using Claude Desktop's MCP with Novelcrafter codices.☆10Dec 3, 2024Updated last year
- Running LLMs against a sandbox airport to see if they can make the correct decisions in real time☆26Jul 22, 2025Updated 8 months ago
- Curriculum training of instruction-following LLMs with Unsloth☆14Dec 15, 2025Updated 3 months ago
- Controllable Language Model Interactions in TypeScript☆10May 17, 2024Updated last year
- ☆13Oct 5, 2025Updated 5 months ago
- An AI tool designed to generate explanations for every file in a project☆14Mar 7, 2025Updated last year
- Cleanai (https://github.com/willmil11/cleanai) except I'm making it in c now. Fast and clean from the start this time :)☆17Mar 6, 2026Updated 2 weeks ago
- Code for minimum-entropy coupling.☆33Jan 6, 2026Updated 2 months ago
- NICE: Neurogenesis Inspired Contextual Encoding for Replay-free Class Incremental Learning☆27Jul 28, 2024Updated last year
- ☆10Dec 17, 2020Updated 5 years ago
- Mamba R1 represents a novel architecture that combines the efficiency of Mamba's state space models with the scalability of Mixture of Ex…☆25Oct 13, 2025Updated 5 months ago
- Reasoning-based Evaluation and Ranking of Translations.☆20Jul 18, 2025Updated 8 months ago
- [EMNLP2022] Source code for Neural Machine Translation with Contrastive Translation Memories☆12Feb 15, 2023Updated 3 years ago
- Implementation of a Hierarchical Mamba as described in the paper: "Hierarchical State Space Models for Continuous Sequence-to-Sequence Mo…☆15Nov 11, 2024Updated last year
- Single-file, pure CUDA C implementation for running inference on Qwen3 0.6B GGUF. No Dependencies.☆23Nov 26, 2025Updated 3 months ago
- A blueprint for next-gen AI. Project Infinity uses a token-efficient, Codified Agent Protocol to create specialized, secure, and imaginat…☆26Mar 13, 2026Updated last week
- Multilingual Entity Linking model by BELA model☆12Jul 20, 2023Updated 2 years ago
- Automated LLM novelist☆46Apr 11, 2024Updated last year
- [ICLR 2026] Generative View Stitching☆106Nov 7, 2025Updated 4 months ago
- The AI Code Cartographer: A Prompt for Self-Generating Knowledge Graphs☆29Jan 4, 2026Updated 2 months ago
- ☆17Mar 28, 2025Updated 11 months ago
- L2E llama2.c on Commodore C-64☆18Feb 22, 2025Updated last year
- A lightweight Python utility that aggregates and exports comprehensive system information to JSON, specifically designed for feeding syst…☆13Apr 13, 2025Updated 11 months ago
- jQuery, React and Streamlit applications written by LLMs☆16Dec 24, 2023Updated 2 years ago
- ☆10Apr 22, 2024Updated last year
- ☆15Mar 18, 2026Updated last week
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM☆61Apr 8, 2024Updated last year
- BitNet a4.8 Implementation in one file of pytorch☆21Jan 13, 2025Updated last year
- Code for Accelerated Linearized Laplace Approximation for Bayesian Deep Learning (ELLA, NeurIPS 22')☆17Oct 12, 2022Updated 3 years ago
- docker for HF wav2vec2-sprint☆13Mar 26, 2021Updated 4 years ago
- Code for "Discovering Non-monotonic Autoregressive Orderings with Variational Inference" (paper and code updated from ICLR 2021)☆12Mar 7, 2024Updated 2 years ago
- Applies ROME and MEMIT on Mamba-S4 models☆14Apr 5, 2024Updated last year
- Yumdocs is a template engine to automate Word, PowerPoint and Excel documents.☆15Sep 23, 2025Updated 6 months ago
- ☆13Aug 20, 2021Updated 4 years ago
- ☆14Apr 29, 2025Updated 10 months ago
- Code for the NAACL 2024 HCI+NLP Workshop paper "LLMCheckup: Conversational Examination of Large Language Models via Interpretability Tool…☆13Mar 24, 2024Updated 2 years ago
- Code to reproduce key results accompanying "SAEs (usually) Transfer Between Base and Chat Models"☆13Jul 18, 2024Updated last year
- Stable Diffusion in TensorRT 8.5+☆14Mar 19, 2023Updated 3 years ago