unit-mesh / unit-evalLinks
UnitEval is a benchmarking and evaluation tools for AutoDev Coder.
☆12Updated last year
Alternatives and similar repositories for unit-eval
Users that are interested in unit-eval are comparing it to the libraries listed below
Sorting:
- ☆11Updated last month
- ☆16Updated last year
- TPO 是一个优化 LLM 输出文本的框架,通过迭代反馈和优化提示的方式来“微调模型”,而非直接调整模型的参数,使模型在推理过程中与人类偏好对齐以生成更好的结果。本项目提供了一个友好的 WebUI 来加载模型,实时优化基础模型并展示最佳结果。☆10Updated 4 months ago
- Unit Minions 的各种数据准备、处理脚本,诸如 OpenAI 处理、格式转换等等。☆14Updated 2 years ago
- Empowering RAG with a versatile model-driven data interface for all-purpose applications!☆11Updated 9 months ago
- Exploration: using technology to aid people who lack both the ability to speak and fine motor control.☆21Updated 8 months ago
- Real-time video understanding and interaction through text,audio,image and video with large multi-modal model. 利用多模态大模型的实时视频理解和交互框架,通过文本…☆23Updated last year
- OpenAI compatible API for open source LLMs☆15Updated last year
- Nexusflow function call, tool use, and agent benchmarks.☆20Updated 6 months ago
- Collection of model-centric MCP servers☆20Updated last month
- GPT+神器,简单实用的一站式AGI架构,内置本地化,LLM模型,agent,矢量数据库,智能链chain☆48Updated last year
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆38Updated last year
- Yet another coding assistant powered by LLM.☆16Updated 9 months ago
- ☆11Updated last month
- Large-scale exact string matching tool☆17Updated 3 months ago
- Probably one of the lightest native RAG + Agent apps out there,experience the power of Agent-powered models and Agent-driven knowledge ba…☆28Updated 3 weeks ago
- ☆11Updated 2 years ago
- Various LLM Benchmarks☆21Updated 3 weeks ago
- An Implementation of "Orca: Progressive Learning from Complex Explanation Traces of GPT-4"☆43Updated 8 months ago
- A minimal LLM sales agent framework for sales agent fast deployment and benchmark. Support OpenAI models, Claude, HuggingFace models, Gem…☆18Updated 9 months ago
- agentcp是一个基于ACP协议的Agent sdk,用于解决Agent间的身份认证及通信问题;用于创建AID、连接入网、构建会话,收发消息等;支持多Agent协作,异步消息处理,支持内网穿透,支持Agent访问的负载均衡☆12Updated last month
- kimi-chat 测试数据☆7Updated last year
- Open-source examples and guides for building with the Qwen. Browse a collection of snippets, advanced techniques and walkthroughs.☆21Updated 7 months ago
- Contains the model patches and the eval logs from the passing swe-bench-lite run.☆10Updated 11 months ago
- OVALChat is a customizable Web app aimed at conducting user studies with chatbots☆28Updated last year
- UnitGen 是一个用于生成微调代码的数据框架 —— 直接从你的代码库中生成微调数据:代码补全、测试生成、文档生成等。UnitGen is a code fine-tuning data framework that generates data from your ex…☆56Updated 11 months ago
- AgileGen: Empowering Agile-Based Generative Software Development through Human-AI Teamwork (accepted by ACM TOSEM)☆23Updated 7 months ago
- Easy to use and open-source unknown stealer☆22Updated last year
- AGM阿格姆:AI基因图谱模型,从token-weight权重微粒角度,探索AI模型,GPT\LLM大模型的内在运作机制。☆28Updated last year
- ☆12Updated 11 months ago