unit-mesh / unit-evalLinks
UnitEval is a benchmarking and evaluation tools for AutoDev Coder.
☆13Updated 2 years ago
Alternatives and similar repositories for unit-eval
Users that are interested in unit-eval are comparing it to the libraries listed below
Sorting:
- ☆12Updated last year
- An Implementation of "Orca: Progressive Learning from Complex Explanation Traces of GPT-4"☆43Updated last year
- Moatless Testbeds allows you to create isolated testbed environments in a Kubernetes cluster where you can apply code changes through git…☆14Updated 9 months ago
- OVALChat is a customizable Web app aimed at conducting user studies with chatbots☆29Updated 2 years ago
- Nexusflow function call, tool use, and agent benchmarks.☆30Updated last year
- Data preparation code for CrystalCoder 7B LLM☆45Updated last year
- Contains the model patches and the eval logs from the passing swe-bench-lite run.☆10Updated last year
- ☆31Updated last year
- Open-source examples and guides for building with the Qwen. Browse a collection of snippets, advanced techniques and walkthroughs.☆37Updated last year
- ☆39Updated last year
- ☆11Updated last year
- Probably one of the lightest native RAG + Agent apps out there,experience the power of Agent-powered models and Agent-driven knowledge ba…☆32Updated 8 months ago
- Measuring RAG solutions throughput and latency☆19Updated last year
- Finetune any model on HF in less than 30 seconds☆56Updated last week
- A swarm of LLM agents that will help you test, document, and productionize your code!☆16Updated last week
- ☆16Updated last year
- Luann (fka TypeAgent) allows you to create many LLM based agent(Various types of agent,scale up)☆24Updated 2 months ago
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…☆18Updated 3 months ago
- My personal implementation of the model from "Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities", they haven't rel…☆12Updated 2 years ago
- ☆18Updated last year
- Pre-training code for CrystalCoder 7B LLM☆57Updated last year
- ☆66Updated this week
- ☆44Updated last year
- Reproducible Language Agent Research☆33Updated 7 months ago
- Multi-Granularity LLM Debugger [ICSE2026]☆95Updated 7 months ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆24Updated last year
- ☆21Updated last year
- GPT-4 Level Conversational QA Trained In a Few Hours☆65Updated last year
- ☆18Updated last year
- ☆67Updated 10 months ago