KhoomeiK / LlamaGymView external linksLinks
Fine-tune LLM agents with online reinforcement learning
☆1,246Mar 19, 2024Updated last year
Alternatives and similar repositories for LlamaGym
Users that are interested in LlamaGym are comparing it to the libraries listed below
Sorting:
- Large Action Model framework to develop AI Web Agents☆6,295Jan 21, 2025Updated last year
- The Open Source Memory Layer For Autonomous Agents☆2,564Oct 22, 2024Updated last year
- SoTA production-ready AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.☆7,673Nov 7, 2025Updated 3 months ago
- ☆748Apr 17, 2024Updated last year
- ☆263Mar 27, 2024Updated last year
- Structured Outputs☆13,403Feb 6, 2026Updated last week
- Vision utilities for web interaction agents 👀☆1,753Nov 25, 2024Updated last year
- ☆4,109Jun 4, 2024Updated last year
- Code and data for "Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs"☆473Mar 19, 2024Updated last year
- LLM Analytics☆705Oct 19, 2024Updated last year
- Go ahead and axolotl questions☆11,289Updated this week
- DSPy: The framework for programming—not prompting—language models☆32,156Updated this week
- Agents Capable of Self-Editing Their Prompts / Python Code☆801Mar 15, 2024Updated last year
- [EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which ach…☆5,834Oct 28, 2025Updated 3 months ago
- AICI: Prompts as (Wasm) Programs☆2,061Jan 22, 2025Updated last year
- A guidance language for controlling large language models.☆21,270Feb 6, 2026Updated last week
- Seamlessly integrate LLMs as Python functions☆2,388Nov 24, 2025Updated 2 months ago
- Letta is the platform for building stateful agents: AI with advanced memory that can learn and self-improve over time.☆21,125Jan 29, 2026Updated 2 weeks ago
- A Bulletproof Way to Generate Structured JSON from Language Models☆4,902Feb 24, 2024Updated last year
- Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.☆867Jan 15, 2024Updated 2 years ago
- TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones☆1,307Feb 5, 2026Updated last week
- LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a ch…☆5,959Dec 11, 2025Updated 2 months ago
- A language for constraint-guided and efficient LLM programming.☆4,148May 22, 2025Updated 8 months ago
- GUI for selecting text files for concatenation and submission to LLMs☆180Nov 19, 2025Updated 2 months ago
- Tools for merging pretrained large language models.☆6,783Jan 26, 2026Updated 3 weeks ago
- A project structure aware autonomous software engineer aiming for autonomous program improvement. Resolved 37.3% tasks (pass@1) in SWE-be…☆3,053Apr 24, 2025Updated 9 months ago
- 🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading☆9,930Sep 7, 2024Updated last year
- A Production-ready Reinforcement Learning AI Agent Library brought by the Applied Reinforcement Learning team at Meta.☆2,978Feb 6, 2026Updated last week
- SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersec…☆18,478Feb 10, 2026Updated last week
- Visualize the intermediate output of Mistral 7B☆384Jan 22, 2025Updated last year
- Things you can do with the token embeddings of an LLM☆1,454Dec 1, 2025Updated 2 months ago
- Voice + Vision powered AI assistant that answers questions about any application, in context and in audio.☆1,156Dec 21, 2023Updated 2 years ago
- Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)☆12,717Feb 9, 2026Updated last week
- structured outputs for llms☆12,357Updated this week
- Llama 2 Everywhere (L2E)☆1,527Aug 27, 2025Updated 5 months ago
- A RAG LLM co-pilot for browsing the web, powered by local LLMs☆1,514Jan 26, 2025Updated last year
- RAG that intelligently adapts to your use case, data, and queries☆3,693Nov 1, 2025Updated 3 months ago
- Train transformer language models with reinforcement learning.☆17,360Updated this week
- AutoChain: Build lightweight, extensible, and testable LLM Agents☆1,865Dec 16, 2025Updated 2 months ago