KhoomeiK/LlamaGym

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/KhoomeiK/LlamaGym)

KhoomeiK / LlamaGym

Fine-tune LLM agents with online reinforcement learning

☆1,252

Alternatives and similar repositories for LlamaGym

Users that are interested in LlamaGym are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

flowersteam / lamorel
View on GitHub
Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).
☆249Dec 11, 2025Updated 7 months ago
kingjulio8238 / Memary
View on GitHub
The Open Source Memory Layer For Autonomous Agents
☆2,635Oct 22, 2024Updated last year
lavague-ai / LaVague
View on GitHub
Large Action Model framework to develop AI Web Agents
☆6,381Jan 21, 2025Updated last year
openai / transformer-debugger
View on GitHub
☆4,117Apr 15, 2026Updated 3 months ago
allenai / lumos
View on GitHub
Code and data for "Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs"
☆478Mar 19, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
bananaml / fructose
View on GitHub
☆748Apr 17, 2024Updated 2 years ago
dottxt-ai / outlines
View on GitHub
Structured Outputs
☆14,833Updated this week
reworkd / tarsier
View on GitHub
Vision utilities for web interaction agents 👀
☆1,762Nov 25, 2024Updated last year
SciPhi-AI / R2R
View on GitHub
SoTA production-ready AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.
☆7,937Nov 7, 2025Updated 8 months ago
labmlai / inspectus
View on GitHub
LLM Analytics
☆713Jul 8, 2026Updated last week
axolotl-ai-cloud / axolotl
View on GitHub
Go ahead and axolotl questions
☆12,222Updated this week
lucidrains / self-rewarding-lm-pytorch
View on GitHub
Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI
☆1,411Apr 11, 2024Updated 2 years ago
aymenfurter / microagents
View on GitHub
Agents Capable of Self-Editing Their Prompts / Python Code
☆824Mar 15, 2024Updated 2 years ago
guidance-ai / guidance
View on GitHub
A guidance language for controlling large language models.
☆21,688May 21, 2026Updated 2 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
bramses / gpt-to-chatgpt-py
View on GitHub
Convert a regular GPT call into a ChatGPT call
☆14Mar 2, 2023Updated 3 years ago
stanfordnlp / dspy
View on GitHub
DSPy: The framework for programming—not prompting—language models
☆36,293Updated this week
nilsherzig / LLocalSearch
View on GitHub
LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a ch…
☆5,955Mar 24, 2026Updated 3 months ago
microsoft / LLMLingua
View on GitHub
[EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which ach…
☆6,459Apr 8, 2026Updated 3 months ago
WeihaoTan / TWOSOME
View on GitHub
Implementation of TWOSOME
☆82Jan 11, 2025Updated last year
SWE-agent / SWE-agent
View on GitHub
SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersec…
☆19,867Updated this week
letta-ai / letta
View on GitHub
Platform for stateful agents: AI with advanced memory that can learn and self-improve over time.
☆23,903Jul 3, 2026Updated 2 weeks ago
jackmpcollins / magentic
View on GitHub
Seamlessly integrate LLMs as Python functions
☆2,413Mar 11, 2026Updated 4 months ago
banagale / FileKitty
View on GitHub
GUI for selecting text files for concatenation and submission to LLMs
☆186Nov 19, 2025Updated 8 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
microsoft / aici
View on GitHub
AICI: Prompts as (Wasm) Programs
☆2,077Jan 22, 2025Updated last year
huggingface / trl
View on GitHub
Train transformer language models with reinforcement learning.
☆18,898Updated this week
1rgs / jsonformer
View on GitHub
A Bulletproof Way to Generate Structured JSON from Language Models
☆4,932Feb 24, 2024Updated 2 years ago
eth-sri / lmql
View on GitHub
A language for constraint-guided and efficient LLM programming.
☆4,202May 22, 2025Updated last year
facebookresearch / Pearl
View on GitHub
A Production-ready Reinforcement Learning AI Agent Library brought by the Applied Reinforcement Learning team at Meta.
☆3,014Jul 11, 2026Updated last week
FanaHOVA / smol-scheduler
View on GitHub
🐣🕐📅 A simple utility to draft scheduling emails.
☆12Sep 13, 2023Updated 2 years ago
NeumTry / NeumAI
View on GitHub
Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.
☆864Jan 15, 2024Updated 2 years ago
DLYuanGod / TinyGPT-V
View on GitHub
TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones
☆1,316Feb 5, 2026Updated 5 months ago
AutoCodeRoverSG / auto-code-rover
View on GitHub
A project structure aware autonomous software engineer aiming for autonomous program improvement. Resolved 37.3% tasks (pass@1) in SWE-be…
☆3,095Apr 24, 2025Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
arcee-ai / mergekit
View on GitHub
Tools for merging pretrained large language models.
☆7,250Jun 17, 2026Updated last month
bigscience-workshop / petals
View on GitHub
🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading
☆10,341Sep 7, 2024Updated last year
ShishirPatil / gorilla
View on GitHub
Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)
☆12,955Apr 13, 2026Updated 3 months ago
elfvingralf / macOSpilot-ai-assistant
View on GitHub
Voice + Vision powered AI assistant that answers questions about any application, in context and in audio.
☆1,157Dec 21, 2023Updated 2 years ago
dleemiller / WordLlama
View on GitHub
Things you can do with the token embeddings of an LLM
☆1,450Dec 1, 2025Updated 7 months ago
semanser / codel
View on GitHub
✨ Fully autonomous AI Agent that can perform complicated tasks and projects using terminal, browser, and editor.
☆2,459Apr 29, 2024Updated 2 years ago
567-labs / instructor
View on GitHub
structured outputs for llms
☆13,593Jul 13, 2026Updated last week