unit-mesh / unit-evalLinks
UnitEval is a benchmarking and evaluation tools for AutoDev Coder.
☆12Updated last year
Alternatives and similar repositories for unit-eval
Users that are interested in unit-eval are comparing it to the libraries listed below
Sorting:
- ☆40Updated 11 months ago
- ☆31Updated last year
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…☆17Updated last month
- ☆17Updated 4 months ago
- Contains the model patches and the eval logs from the passing swe-bench-lite run.☆10Updated last year
- Data preparation code for CrystalCoder 7B LLM☆45Updated last year
- An Implementation of "Orca: Progressive Learning from Complex Explanation Traces of GPT-4"☆44Updated last year
- GPT-4 Level Conversational QA Trained In a Few Hours☆66Updated last year
- Nexusflow function call, tool use, and agent benchmarks.☆30Updated 11 months ago
- Reproducible Language Agent Research☆31Updated 5 months ago
- ☆11Updated last year
- Finetune any model on HF in less than 30 seconds☆56Updated last month
- A swarm of LLM agents that will help you test, document, and productionize your code!☆17Updated last week
- Luann (fka TypeAgent) allows you to create many LLM based agent(Various types of agent,scale up)☆23Updated last week
- Moatless Testbeds allows you to create isolated testbed environments in a Kubernetes cluster where you can apply code changes through git…☆14Updated 8 months ago
- ☆16Updated last year
- ☆18Updated last year
- ☆11Updated last year
- Various agents from all of the top agent frameworks to integrate into swarms! Langchain, Griptape, CrewAI, and more!☆16Updated this week
- ☆21Updated last year
- Welcome to AGI, the cutting-edge project dedicated to building the core components of Artificial General Intelligence.☆11Updated 3 weeks ago
- ☆26Updated 3 weeks ago
- OVALChat is a customizable Web app aimed at conducting user studies with chatbots☆28Updated last year
- Measuring RAG solutions throughput and latency☆18Updated last year
- The Swarm Ecosystem☆26Updated last year
- NewsAgent is an enterprise-grade news aggregation agent designed to fetch, query, and summarize news from multiple sources at scale.☆23Updated last month
- Pre-training code for CrystalCoder 7B LLM☆55Updated last year
- ☆44Updated last year
- Official Repo for The Paper "Talk Structurally, Act Hierarchically: A Collaborative Framework for LLM Multi-Agent Systems"☆58Updated 9 months ago
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆26Updated last year