pandada8 / llm-inference-benchmarkView external linksLinks
LLM 推理服务性能测试
☆44Dec 17, 2023Updated 2 years ago
Alternatives and similar repositories for llm-inference-benchmark
Users that are interested in llm-inference-benchmark are comparing it to the libraries listed below
Sorting:
- survery of small language models☆18Jul 23, 2024Updated last year
- 用大模型批量处理数据,现支持各种大模型做OCR,支持通义千问, 月之暗面, 百度飞桨OCR, OpenAI 和LLAVA。Use LLM to generate or clean data for academic use. Support OCR with qwen, m…☆16Sep 15, 2024Updated last year
- ☆11Feb 6, 2026Updated last week
- 基于知识库的测试用例生成工具☆37Jun 11, 2025Updated 8 months ago
- ModelScope+Transformers+SwanLab实现Qwen-1.5-7b的指令微调任务☆23Jun 9, 2024Updated last year
- 使用FastAPI+vLLM部署Qwen2.5☆25Sep 29, 2024Updated last year
- This is the official implementation for our paper;"LAR:Look Around and Refer".☆30Dec 1, 2022Updated 3 years ago
- Large Language Model (LLM) powered evaluator for Retrieval Augmented Generation (RAG) pipelines.☆33Apr 29, 2024Updated last year
- ☆32Jul 2, 2025Updated 7 months ago
- A simple WeChat Official Account layout tool based on Dify☆16Jun 27, 2025Updated 7 months ago
- Difyで作る生成AIアプリ完全入門☆17May 25, 2025Updated 8 months ago
- 小飞机翻墙教程☆24Nov 14, 2019Updated 6 years ago
- Workflow automation, but you just describe what you want and it happens.☆26Nov 22, 2025Updated 2 months ago
- 大语言模型应用:RAG、NL2SQL、聊天机器人、预训练、MOE混合专家模型、微调训练、强化学习、天池数据竞赛☆75Feb 10, 2025Updated last year
- Write the database metadata into the dify knowledge☆12Dec 30, 2025Updated last month
- A full-stack AI-powered business intelligence tool for non-experts, featuring serverless backend processing and a secure Streamlit fronte…☆25Jan 6, 2026Updated last month
- 100 Production-Ready Claude Code Skills - The most comprehensive collection of AI skills for sales, business automation, content creation…☆33Oct 22, 2025Updated 3 months ago
- ☆28Dec 4, 2025Updated 2 months ago
- ☆11Aug 29, 2025Updated 5 months ago
- 大语言模型评估平台,支持多种评估基准、自定义数据集和性能测试。支持基于自定义数据集的RAG评估。☆78Aug 20, 2025Updated 5 months ago
- A distilled DeepSeek-R1 variant built on Qwen2.5-32B, fine-tuned with curated data for enhanced performance and efficiency. <metadata> gp…☆16Mar 11, 2025Updated 11 months ago
- ☆28Updated this week
- ☆12Jun 28, 2024Updated last year
- Java implementation for the Agent2Agent Protocol (A2A - https://github.com/google/A2A), enabling interaction between AI agents through a …☆11Apr 21, 2025Updated 9 months ago
- Use the knowledge graph generated by GraphRAG as the external knowledge base for the Dify workflow.☆20Jun 4, 2025Updated 8 months ago
- 知予人工智能:从学习者到研究者☆13Jan 20, 2025Updated last year
- This is a fork from Ryan Carson's AI Dev Tasks repository, with some code cleanup and refactoring to enable support for PostgreSQL databa…☆15Sep 8, 2025Updated 5 months ago
- 🤖AI Agents for Financial Trading💰: LLM-Driven Stock Prediction & Investment Recommendation System☆13Apr 14, 2025Updated 10 months ago
- ☆10Apr 30, 2025Updated 9 months ago
- dify 知识库检索工具☆13Apr 3, 2025Updated 10 months ago
- In this programming assignment you will implement a streaming video server and client that communicate control commands via the Real-Time…☆11Dec 29, 2012Updated 13 years ago
- Python Telegraph api.☆15Mar 22, 2025Updated 10 months ago
- ☆28Jun 27, 2025Updated 7 months ago
- LangReact 是一个配置化的 Planning Agent 应用开发工具,通过配置、插件,能快速为你的 GPT 应用提供 Planning 功能。☆12Apr 23, 2024Updated last year
- A small framework to benchmark forecasting models via backtesting☆13Nov 25, 2023Updated 2 years ago
- A multi-agent framework to help with your homework.☆10Mar 1, 2025Updated 11 months ago
- ☆10Dec 29, 2023Updated 2 years ago
- An SSH plugin for Dify☆12Jan 16, 2026Updated 3 weeks ago
- a simple pingpong buffer test☆12Feb 11, 2015Updated 11 years ago