unit-mesh / unit-evalLinks
UnitEval is a benchmarking and evaluation tools for AutoDev Coder.
☆13Updated last year
Alternatives and similar repositories for unit-eval
Users that are interested in unit-eval are comparing it to the libraries listed below
Sorting:
- ☆11Updated last year
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…☆17Updated 2 months ago
- Finetune any model on HF in less than 30 seconds☆56Updated 2 months ago
- My personal implementation of the model from "Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities", they haven't rel…☆12Updated last year
- A swarm of LLM agents that will help you test, document, and productionize your code!☆17Updated 3 weeks ago
- Various agents from all of the top agent frameworks to integrate into swarms! Langchain, Griptape, CrewAI, and more!☆16Updated this week
- Measuring RAG solutions throughput and latency☆18Updated last year
- Open-source examples and guides for building with the Qwen. Browse a collection of snippets, advanced techniques and walkthroughs.☆33Updated last year
- An Implementation of "Orca: Progressive Learning from Complex Explanation Traces of GPT-4"☆44Updated last year
- Data preparation code for CrystalCoder 7B LLM☆45Updated last year
- a suite of finetuned LLMs for atomically precise function calling 🧪☆17Updated 2 weeks ago
- Contains the model patches and the eval logs from the passing swe-bench-lite run.☆10Updated last year
- Probably one of the lightest native RAG + Agent apps out there,experience the power of Agent-powered models and Agent-driven knowledge ba…☆31Updated 6 months ago
- Moatless Testbeds allows you to create isolated testbed environments in a Kubernetes cluster where you can apply code changes through git…☆14Updated 8 months ago
- Solve Geometric & Graph Problems with Large Language Models☆32Updated 2 years ago
- ☆11Updated last year
- ☆16Updated last year
- My implementation of "Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models"☆99Updated 2 years ago
- ☆40Updated last year
- ☆74Updated last year
- Reproducible Language Agent Research☆31Updated 6 months ago
- Luann (fka TypeAgent) allows you to create many LLM based agent(Various types of agent,scale up)☆23Updated 3 weeks ago
- Nexusflow function call, tool use, and agent benchmarks.☆30Updated last year
- OVALChat is a customizable Web app aimed at conducting user studies with chatbots☆28Updated last year
- GPT-4 Level Conversational QA Trained In a Few Hours☆66Updated last year
- Pre-training code for CrystalCoder 7B LLM☆55Updated last year
- ☆31Updated last year
- Code generation with LLMs 🔗☆53Updated 2 years ago
- ☆18Updated 11 months ago
- A multimodal RAG application that enables semantic search on multimedia sources like audio, video and images☆41Updated 2 years ago