suyoumo / DeepClaude_BenchmarkLinks
This project is designed to evaluate the effectiveness of DeepClaude and other combination models.
☆41Updated 10 months ago
Alternatives and similar repositories for DeepClaude_Benchmark
Users that are interested in DeepClaude_Benchmark are comparing it to the libraries listed below
Sorting:
- DeepClaude Rust的升级版本☆208Updated 9 months ago
- Repo-level benchmark for real-world Code Agents: from repo understanding → env setup → incremental dev/bug-fixing → task delivery, with c…☆245Updated 4 months ago
- ☆293Updated 7 months ago
- LLM Rag Intelligent Q&A Robot☆86Updated 5 months ago
- Marco Search Agent for Realistic and Challenging Agentic Search☆240Updated 3 months ago
- A database operations and data analysis AI agent☆430Updated 5 months ago
- 一个用于分析创业公司数据的综合平台,包含爬虫系统、数据分析工具、创业评估AI模型、Web端和小程序端☆117Updated last month
- Code for "FaithLens: Detecting and Explaining Faithfulness Hallucination"☆97Updated last month
- Valuation of tokens corresponding to influential individuals on social platforms through AI algorithms☆229Updated 4 months ago
- An AI-powered multi-agent platform for automated investment research — combining LLM reasoning, RAG retrieval, and real-time market data …☆154Updated 2 months ago
- 电子赛博游戏xAutoGLM 📦 标准 Android APK 一键安装 (无需电脑/Root)。🏗 核心: 基于 Chaquopy 引擎,将 Python Agent 直接嵌入安卓原生进程。⚡️ 特性: 赛博朋克叙事 UI (Native AutoGLM Clien…☆171Updated this week
- EvaLearn is a pioneering benchmark designed to evaluate large language models (LLMs) on their learning capability and efficiency in chall…☆430Updated 2 weeks ago
- 🌐Web Agent Protocol (WAP) - Record and replay user interactions in the browser with MCP support☆490Updated 7 months ago
- JittorGeometric is a Jittor-based graph machine learning library.☆603Updated 5 months ago
- Dataset and evaluation code of ISDrama(ACM-MM 2025): Immersive Spatial Drama Generation through Multimodal Prompting☆236Updated 5 months ago
- 这是一个MCP客户端,让你轻松配置各个大模型,对接各种MCP Server而开发。This is an MCP client that allows you to easily configure various large models and develop inter…☆139Updated 3 months ago
- AI 笔试助手,解题助手,在编码笔试或面试时,借助AI实时提供解题思路和答案。A interview assistant that leverages AI to provide real-time solutions during coding interviews.☆266Updated 3 weeks ago
- ☆218Updated last week
- Open-source framework for automatic video annotation.☆38Updated 8 months ago
- ☆185Updated 6 months ago
- Fast and free zeroshot lipsync MCP server☆93Updated 8 months ago
- ☆38Updated 9 months ago
- 4th Place Solution for the Kaggle Competition: LMSYS - Chatbot Arena Human Preference Predictions☆170Updated last year
- 面向飞书聊天机器人的全功能AI服务器端实现,用一个容器,实现在飞书对话框里操作属于自己的Manus。☆512Updated last month
- 从0训练类 o1 大语言模型。☆132Updated 3 weeks ago
- Official code of the paper "MIND: Multi-rationale INtegrated Discriminative Reasoning Framework for Multi-modal Large Models"☆31Updated last month
- Link: https://kc-li.com/mytools , Weekly update trending AI apps.☆87Updated 3 weeks ago
- A toolkit enhances PyTorch with specialized functions for low-bit quantized neural networks.☆196Updated last year
- Siray ComfyUI Nodes☆86Updated last month
- (EMNLP 2025 Findings) Source Evaluation scripts for Humanity's Last Code Exam☆95Updated 5 months ago