unit-mesh / unit-eval
UnitEval is a benchmarking and evaluation tools for AutoDev Coder.
☆12Updated last year
Alternatives and similar repositories for unit-eval:
Users that are interested in unit-eval are comparing it to the libraries listed below
- Unit Minions 的各种数据准备、处理脚本,诸如 OpenAI 处理、格式转换等等。☆14Updated 2 years ago
- TPO 是一个优化 LLM 输出文本的框架,通过迭代反馈和优化提示的方式来“微调模型”,而非直接调整模型的参数,使模型在推理过程中与人类偏好对齐以生成更好的结果。本项目提供了一个友好的 WebUI 来加载模型,实时优化基础模型并展示最佳结果。☆10Updated 2 months ago
- ☆16Updated 11 months ago
- OpenAI compatible API for open source LLMs☆15Updated last year
- Open-source examples and guides for building with the Qwen. Browse a collection of snippets, advanced techniques and walkthroughs.☆21Updated 5 months ago
- Real-time video understanding and interaction through text,audio,image and video with large multi-modal model. 利用多模态大模型的实时视频理解和交互框架,通过文本…☆23Updated last year
- Collection of model-centric MCP servers☆13Updated 2 weeks ago
- Exploration: using technology to aid people who lack both the ability to speak and fine motor control.☆20Updated 6 months ago
- A minimal LLM sales agent framework for sales agent fast deployment and benchmark. Support OpenAI models, Claude, HuggingFace models, Gem…☆17Updated 8 months ago
- Easy to use and open-source unknown stealer☆22Updated last year
- Probably one of the lightest native RAG + Agent apps out there,experience the power of Agent-powered models and Agent-driven knowledge ba…☆26Updated 2 weeks ago
- ☆10Updated last month
- smart chinese LLm☆18Updated last year
- ☆11Updated 3 months ago
- Reproducible Language Agent Research☆23Updated 2 months ago
- CodeAssist is an advanced code completion tool that provides high-quality code completions for Python, Java, C++ and so on. CodeAssist 是一…☆58Updated last year
- Code for "The Whole Truth and Nothing But the Truth: Faithful and Controllable Dialogue Response Generation with Dataflow Transduction an…☆10Updated last year
- Multi-threading, Concurrency, Asynchrony, and various Execution Methods implemented in a Rust backend for bleeding edge performance.☆12Updated 5 months ago
- Perform facts checks on your conversations with LLMs to catch fake-news, misleading information, and LLMs confusion.☆13Updated 2 years ago
- GPT-4 powered code tool with no token limits. Works on repos or files. Can cleanup, optimize, comment, convert languages and more☆11Updated 2 years ago
- Nexusflow function call, tool use, and agent benchmarks.☆19Updated 4 months ago
- 🚀 Time MCP Server: Giving LLMs Time Awareness Capabilities☆13Updated last month
- GPT+神器,简单实用的一站式AGI架构,内置本地化,LLM模型,agent,矢量数据库,智能链chain☆48Updated last year
- A highly contextualized retrieval system integrating Large Language Models (LLMs), embeddings, and a dynamic agent-driven framework. Supp…☆19Updated 3 months ago
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.☆12Updated 9 months ago
- Contains the model patches and the eval logs from the passing swe-bench-lite run.☆10Updated 10 months ago
- BMInf demos.☆14Updated 3 years ago
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…☆13Updated 2 weeks ago
- ☆37Updated 2 years ago
- OVALChat is a customizable Web app aimed at conducting user studies with chatbots☆28Updated last year