hud-evals / hud-pythonLinks
OSS RL environment + evals toolkit
☆253Updated this week
Alternatives and similar repositories for hud-python
Users that are interested in hud-python are comparing it to the libraries listed below
Sorting:
- AgentTrace is a lightweight observability library to trace and evaluate agentic systems.☆38Updated 8 months ago
- Agent Markdown Language☆106Updated 3 weeks ago
- AGI SDK☆382Updated 2 weeks ago
- Enable LLMs to Program Themselves.☆630Updated 8 months ago
- EFFICIENT AND OPTIMIZED TOKENIZER ENGINE FOR LLM INFERENCE SERVING☆465Updated 3 months ago
- Official Magic UI MCP server.☆157Updated 6 months ago
- An encyclopedia of jailbreaking techniques to make AI models safer.☆532Updated 6 months ago
- Bindu: Turn any AI agent into a living microservice - interoperable, observable, composable.☆276Updated this week
- An open source benchmarking framework for IT automation☆304Updated last month
- [ICML 2025] The Diffusion Duality☆180Updated 2 months ago
- Training-Ready RL Environments + Evals☆190Updated last week
- Deep Research for crypto - free & fully local☆147Updated last month
- Timbal is an open-source python framework for building reliable AI applications, battle-tested in production with simple, transparent arc…☆40Updated this week
- The AI Browser Automation Framework☆356Updated this week
- ⚖️ Awesome LLM Judges ⚖️☆146Updated 7 months ago
- Leap: AI-powered educational animation generator☆62Updated 8 months ago
- Govern & Secure your AI☆432Updated this week
- Agentic AI System for Company Data Enrichment☆95Updated 2 months ago
- MCP-enabled AI conversation engine with MCTS analysis, FastAPI backend, and async operations for building advanced LLM applications☆45Updated 4 months ago
- ContextLoom is the shared "brain" for multi-agent systems. It weaves together memory threads from frameworks like DSPy and CrewAI into a …☆32Updated 2 weeks ago
- Dynamiq is an orchestration framework for agentic AI and LLM applications☆1,011Updated last week
- ☆136Updated 9 months ago
- Real-Time Detection of Hallucinated Entities in Long-Form Generation☆271Updated last month
- rl from zero pretrain, can it be done? yes.☆282Updated 2 months ago
- Open-source generalized AI agent for everyday task automations.☆447Updated 5 months ago
- j1-micro (1.7B) & j1-nano (600M) are absurdly tiny but mighty reward models.☆99Updated 5 months ago
- A comprehensive repository of reasoning tasks for LLMs (and beyond)☆453Updated last year
- ☆68Updated 6 months ago
- DimanDocs public repo☆22Updated last week
- AI benchmark runtime framework that allows you to integrate and evaluate AI tasks using Docker-based benchmarks.☆168Updated last week