All-Hands-AI / openhands-resolver
A system that tries to resolve all issues on a github repo with OpenHands.
☆74Updated this week
Related projects ⓘ
Alternatives and complementary repositories for openhands-resolver
- Python SDK for experimenting, testing, evaluating & monitoring LLM-powered applications - Parea AI (YC S23)☆74Updated 2 months ago
- Tutorial for building LLM router☆159Updated 3 months ago
- Just a bunch of benchmark logs for different LLMs☆114Updated 3 months ago
- A toolkit for building multimodal AI agents☆108Updated 2 weeks ago
- RAG example using DSPy, Gradio, FastAPI☆64Updated 7 months ago
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆62Updated this week
- An automated tool for discovering insights from research papaer corpora☆135Updated 5 months ago
- 🤖 Headless IDE for AI agents☆129Updated this week
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆97Updated 7 months ago
- ☆100Updated 3 weeks ago
- 🚀 A list of Haystack Integrations, maintained by the community or deepset.☆62Updated last week
- Collection of recipes aiding Gen AI model development☆83Updated this week
- Official homepage for "Self-Harmonized Chain of Thought"☆83Updated last month
- Initiative to evaluate and rank the most popular LLMs across common task types based on their propensity to hallucinate.☆100Updated 2 months ago
- Build hours code to share.☆129Updated this week
- Source code for our paper: "SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals".☆64Updated 4 months ago
- ☆61Updated 3 weeks ago
- ☆30Updated 4 months ago
- Automating enterprise workflows with multimodal agents☆94Updated last month
- Python SDK for Llama Stack☆68Updated this week
- Finetune Llama-3-8b on the MathInstruct dataset☆97Updated 3 weeks ago
- LangChain chat model abstractions for dynamic failover, load balancing, chaos engineering, and more!☆79Updated 9 months ago
- Dynamic Metadata based RAG Framework☆71Updated 3 months ago
- Framework for building, orchestrating and deploying multi-agent systems. Managed by OpenAI Solutions team. Experimental framework.☆77Updated last month
- Synthetic Data for LLM Fine-Tuning☆93Updated 11 months ago
- A new benchmark for measuring LLM's capability to detect bugs in large codebase.☆27Updated 5 months ago
- Red-Teaming Language Models with DSPy☆142Updated 7 months ago
- Sphynx Hallucination Induction☆47Updated 3 months ago
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆81Updated 2 months ago
- Foyle is a copilot to help developers deploy and operate their applications.☆106Updated this week
- A subset of jailbreaks automatically discovered by the Haize Labs haizing suite.☆85Updated 5 months ago