winstonsmith1897/GTPO

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/winstonsmith1897/GTPO)

winstonsmith1897 / GTPO

Group-relative Trajectory-based Policy Optimization: Increasing Quality and Training Stability

☆41

Alternatives and similar repositories for GTPO

Users that are interested in GTPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

slashml / awesome-finetuning
View on GitHub
☆31Aug 27, 2024Updated last year
architehc / nanochat-rs-ternary
View on GitHub
Production-ready ternary quantized (1.58-bit) Rust code generation model with mHC-lite, MaxRL training, and comprehensive benchmarking
☆21Mar 7, 2026Updated 4 months ago
Pleias / RL-Reasoning
View on GitHub
Collection of resources for RL and Reasoning
☆27Feb 3, 2025Updated last year
porespellar / Zer0Fit
View on GitHub
Zero-shot forecasting, tabular classification, and regression via MCP — exposes Google TimesFM 2.5 and TabFM v1.0.0 to AI assistants. Jus…
☆24Jul 12, 2026Updated last week
ThomasVuNguyen / MakeMe
View on GitHub
Create 3D files in the CLI with Small Language Model
☆44Oct 15, 2025Updated 9 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
thad0ctor / KrunchWrapper
View on GitHub
☆18Jul 1, 2025Updated last year
Pavankunchala / Reinforcement-learning-with-verifable-rewards-Learnings
View on GitHub
RLVR Testing and Training
☆22Aug 28, 2025Updated 10 months ago
juzi5201314 / RepoExplainer
View on GitHub
An AI tool designed to generate explanations for every file in a project
☆15Mar 7, 2025Updated last year
pradyGn / zoof
View on GitHub
Zoof is a high-efficiency Small Language Model (SLM) engineered from scratch. It demonstrates how modern architectural choices and high-q…
☆47Jan 13, 2026Updated 6 months ago
MaggotHATE / Llama_chat
View on GitHub
A chat UI for Llama.cpp
☆16Jun 4, 2026Updated last month
pierrel55 / llama_st
View on GitHub
Load and run Llama from safetensors files in C
☆15Oct 24, 2024Updated last year
ordavid-s / snmf-mlp-decomposition
View on GitHub
☆15Jul 7, 2026Updated 2 weeks ago
marcopoli / LLaMAntino-3-ANITA
View on GitHub
The 🌟ANITA project🌟 *(Advanced Natural-based interaction for the ITAlian language)* wants to provide Italian NLP researchers with an im…
☆24Sep 11, 2024Updated last year
NEUIR / CONAN
View on GitHub
Code for our paper: "Building A Coding Assistant via Retrieval-Augmented Language Models"
☆10Nov 2, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
EdAbati / outlines-haystack
View on GitHub
Use `outlines` generators with Haystack.
☆14Updated this week
3-ark / Cognito-AI_Sidekick
View on GitHub
Cognito: Supercharge your Chrome browser with AI. Guide, query, and control everything using natural language.
☆57Jun 19, 2026Updated last month
gruai / koifish
View on GitHub
Sparse & quantized LLM training/inference/CPT/SFT/DPO
☆28Jul 7, 2026Updated 2 weeks ago
daniel3303 / StoryReasoning
View on GitHub
Code for the paper: "StoryReasoning Dataset: Using Chain-of-Thought for Scene Understanding and Grounded Story Generation"
☆40May 16, 2025Updated last year
54rt1n / shardmerge
View on GitHub
Using fourier interpolation to merge large language models
☆11Jul 11, 2026Updated last week
Geralt-Targaryen / MC-Evaluation
View on GitHub
☆14May 21, 2024Updated 2 years ago
reinterpretcat / qwen3-rs
View on GitHub
An educational Rust project for exporting and running inference on Qwen3 LLM family
☆44Aug 3, 2025Updated 11 months ago
CentML / lorafusion
View on GitHub
LoRAFusion: Efficient LoRA Fine-Tuning for LLMs
☆28Jul 2, 2026Updated 3 weeks ago
lucidrains / vit-arc-slot
View on GitHub
Explorations into improving ViTArc with Slot Attention
☆43Oct 19, 2024Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
Zhayr1 / bitmamba.cpp
View on GitHub
Ultra-lightweight C++ inference engine for BitMamba-2 (1.58-bit SSM). Runs 1B models on consumer CPUs at 50+ tok/s using <700MB RAM. No h…
☆21Jun 2, 2026Updated last month
Laszlobeer / Dungeo_ai_lan_play
View on GitHub
this is a dungeon ai run locally that use your llm in the terminal with multiple players from 2 to 5
☆17Jan 25, 2026Updated 5 months ago
ShuHuang / chemdatawriter
View on GitHub
ChemDataWriter is a transformer-based library for automatically generating research books in the chemistry area.
☆13Oct 7, 2023Updated 2 years ago
dejan94it / cc_Rtools
View on GitHub
This plugin allows the Cheshire Cat to use tools written in R language
☆10Dec 23, 2024Updated last year
Deveraux-Parker / Qwen3-Coder-30B-A3B-Monkey-Wrenches
View on GitHub
Efforts toward giving Qwen 3 Coder 30B A3B proper agentic tool calling capabilities at or near 100% reliability.
☆63Aug 10, 2025Updated 11 months ago
timothelaborie / text_classification_scripts
View on GitHub
Scripts for text classification with llama and bert
☆35Jul 23, 2025Updated last year
deadshot465 / novelcrafter-mcp
View on GitHub
An experimental desktop client for using Claude Desktop's MCP with Novelcrafter codices.
☆11Dec 3, 2024Updated last year
TuanaCelik / unstructuredio-haystack
View on GitHub
💙 Unstructured Data Connectors for Haystack 2.0
☆18Sep 21, 2023Updated 2 years ago
ArzelaAscoIi / haystack-github-bot
View on GitHub
🤖 A GitHub bot that summarizes your actions throughout a day and lists all your achievements. Built with Haystack and OpenAI
☆16Apr 14, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
oKatanaaa / kolibrify
View on GitHub
Curriculum training of instruction-following LLMs with Unsloth
☆14Dec 15, 2025Updated 7 months ago
ankit-vaidya19 / Share
View on GitHub
The Official PyTorch implementation of Shared LoRA Subspaces for almost Strict Continual Learning
☆33May 7, 2026Updated 2 months ago
anakin87 / who-killed-laura-palmer
View on GitHub
Simple Question Answering system, based on data crawled from Twin Peaks Wiki. It is built using 🔍 Haystack, an awesome open-source frame…
☆11Jun 22, 2023Updated 3 years ago
danielscottjames / dominion
View on GitHub
Benchmarking LLMs as Casual Card Game AIs
☆20Jan 22, 2025Updated last year
severian42 / Proteus-The-Genesis-LLM
View on GitHub
Proteus is an experimental platform that combines the power of Large Language Models with the Genesis physics engine
☆25Dec 20, 2024Updated last year
LostRuins / datasetexplorer
View on GitHub
Easily view and modify JSON datasets for large language models
☆90May 16, 2025Updated last year
tomsherborne / zx-parse
View on GitHub
Zero-Shot Cross-Lingual Semantic Parsing (Sherborne & Lapata, ACL 2022)
☆17May 16, 2022Updated 4 years ago