Fine-tune LLM agents with online reinforcement learning
☆1,250Mar 19, 2024Updated 2 years ago
Alternatives and similar repositories for LlamaGym
Users that are interested in LlamaGym are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).☆249Dec 11, 2025Updated 6 months ago
- The Open Source Memory Layer For Autonomous Agents☆2,624Oct 22, 2024Updated last year
- Large Action Model framework to develop AI Web Agents☆6,375Jan 21, 2025Updated last year
- ☆262Mar 27, 2024Updated 2 years ago
- ☆4,120Apr 15, 2026Updated 2 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆750Apr 17, 2024Updated 2 years ago
- Code and data for "Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs"☆477Mar 19, 2024Updated 2 years ago
- Structured Outputs☆13,984Jun 19, 2026Updated last week
- SoTA production-ready AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.☆7,892Nov 7, 2025Updated 7 months ago
- Vision utilities for web interaction agents 👀☆1,764Nov 25, 2024Updated last year
- LLM Analytics☆714Oct 19, 2024Updated last year
- DSPy: The framework for programming—not prompting—language models☆35,310Jun 18, 2026Updated last week
- Go ahead and axolotl questions☆12,082Updated this week
- Agents Capable of Self-Editing Their Prompts / Python Code☆819Mar 15, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI☆1,410Apr 11, 2024Updated 2 years ago
- A guidance language for controlling large language models.☆21,519May 21, 2026Updated last month
- [EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which ach…☆6,353Apr 8, 2026Updated 2 months ago
- LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a ch…☆5,955Mar 24, 2026Updated 3 months ago
- AICI: Prompts as (Wasm) Programs☆2,078Jan 22, 2025Updated last year
- GUI for selecting text files for concatenation and submission to LLMs☆186Nov 19, 2025Updated 7 months ago
- Seamlessly integrate LLMs as Python functions☆2,410Mar 11, 2026Updated 3 months ago
- Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.☆866Jan 15, 2024Updated 2 years ago
- Implementation of TWOSOME☆82Jan 11, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Platform for stateful agents: AI with advanced memory that can learn and self-improve over time.☆23,543Updated this week
- A language for constraint-guided and efficient LLM programming.☆4,188May 22, 2025Updated last year
- SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersec…☆19,631Updated this week
- A Production-ready Reinforcement Learning AI Agent Library brought by the Applied Reinforcement Learning team at Meta.☆3,007Jun 9, 2026Updated 2 weeks ago
- Train transformer language models with reinforcement learning.☆18,701Updated this week
- A Bulletproof Way to Generate Structured JSON from Language Models☆4,929Feb 24, 2024Updated 2 years ago
- Voice + Vision powered AI assistant that answers questions about any application, in context and in audio.☆1,159Dec 21, 2023Updated 2 years ago
- A project structure aware autonomous software engineer aiming for autonomous program improvement. Resolved 37.3% tasks (pass@1) in SWE-be…☆3,087Apr 24, 2025Updated last year
- Tools for merging pretrained large language models.☆7,173Jun 17, 2026Updated last week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones☆1,315Feb 5, 2026Updated 4 months ago
- 🐣🕐📅 A simple utility to draft scheduling emails.☆12Sep 13, 2023Updated 2 years ago
- Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)☆12,921Apr 13, 2026Updated 2 months ago
- 🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading☆10,248Sep 7, 2024Updated last year
- structured outputs for llms☆13,210Updated this week
- Things you can do with the token embeddings of an LLM☆1,451Dec 1, 2025Updated 6 months ago
- ✨ Fully autonomous AI Agent that can perform complicated tasks and projects using terminal, browser, and editor.☆2,453Apr 29, 2024Updated 2 years ago