tensorzero / llmgymLinks
☆14Updated 2 weeks ago
Alternatives and similar repositories for llmgym
Users that are interested in llmgym are comparing it to the libraries listed below
Sorting:
- Automated Capability Discovery via Foundation Model Self-Exploration☆55Updated 4 months ago
- ☆23Updated 7 months ago
- ☆15Updated this week
- Evolutionary Search for expert-level performance on any task with environmental feedback☆14Updated last year
- A structured framework for defining, verifying and certifying AI systems.☆13Updated 3 months ago
- Run computational experiments using marimo notebooks☆14Updated 2 weeks ago
- Framework for creating reliable LLM-based conversational agents☆45Updated this week
- 🛤️ Pathik - High-Performance Web Crawler ⚡☆26Updated 3 months ago
- The Swarm Ecosystem☆21Updated 10 months ago
- ☆16Updated 9 months ago
- An integration that allows Claude Desktop to interact with Spotify using the Model Context Protocol (MCP).☆12Updated last month
- A demo of cluade computer use playing minecraft☆22Updated 8 months ago
- OMNI-EPIC: Open-endedness via Models of human Notions of Interestingness with Environments Programmed in Code (ICLR 2025).☆59Updated 6 months ago
- The official Python library for Formulaic☆16Updated last year
- Blueprint to Build Your Own Timeline Algorithm☆58Updated 3 weeks ago
- Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models☆58Updated 4 months ago
- ML-Dev-Bench is a benchmark for evaluating AI agents against various ML development tasks.☆36Updated 2 weeks ago
- A framework making it effortless to convert any llm model into a reasoning agent like o1 or DeepSeek's r1☆21Updated this week
- Landing page + leaderboard for SWE-Bench benchmark☆6Updated this week
- ☆19Updated 10 months ago
- ☆14Updated 2 months ago
- ☆28Updated this week
- ☆13Updated last week
- GPT4 based personalized ArXiv paper assistant bot☆10Updated last year
- ☆17Updated last month
- anything you want can be built with morph cloud☆19Updated 2 months ago
- OLMost every training recipe you need to perform data interventions with the OLMo family of models.☆34Updated this week
- Simple orchestration for EC2 spot containers☆19Updated 9 months ago
- The developper starter pack for document processing☆15Updated this week
- coloring terminal text with intensities (used for plotting probability, entropy with tokens)☆12Updated 8 months ago