suyoumo / DeepClaude_BenchmarkLinks
This project is designed to evaluate the effectiveness of DeepClaude and other combination models.
☆41Updated 9 months ago
Alternatives and similar repositories for DeepClaude_Benchmark
Users that are interested in DeepClaude_Benchmark are comparing it to the libraries listed below
Sorting:
- DeepClaude Rust的升级版本☆208Updated 9 months ago
- LLM Rag Intelligent Q&A Robot☆84Updated 4 months ago
- A database operations and data analysis AI agent☆431Updated 4 months ago
- Repo-level benchmark for real-world Code Agents: from repo understanding → env setup → incremental dev/bug-fixing → task delivery, with c…☆243Updated 3 months ago
- Code for "FaithLens: Detecting and Explaining Faithfulness Hallucination"☆95Updated last week
- A powerful multi-format file parsing, data cleaning, and AI annotation toolkit.☆144Updated last month
- 本项目是一款结合15 个主流平台的 26 个榜单实时数据与大模型分析能力的舆情分析助手。通过前端页面,用户可实现对话式热搜榜单查询、特定主题搜索、话题聚类分析及情感倾向分析。系统支持快捷键控制爬虫启停、多平台数据快速查询与跳转,并能基于新闻详情页内容(即使是视频信息也能挖掘…☆41Updated last week
- EvaLearn is a pioneering benchmark designed to evaluate large language models (LLMs) on their learning capability and efficiency in chall…☆431Updated 3 months ago
- ☆293Updated 6 months ago
- DPO-Shift: Shifting the Distribution of Direct Preference Optimization☆59Updated 10 months ago
- 一个用于分析创业公司数据的综合平台,包含爬虫系统、数据分析工具、创业评估AI模型、Web端和小程序端☆117Updated last week
- ☆124Updated last week
- ☆223Updated 2 weeks ago
- AI 笔试助手,解题助手,在编码笔试或面试时,借助AI实时提供解题思路和答案。A interview assistant that leverages AI to provide real-time solutions during coding interviews.☆258Updated this week
- 电子赛博游戏xAutoGLM 📦 标准 Android APK 一键安装 (无需电脑/Root)。🏗 核心: 基于 Chaquopy 引擎,将 Python Agent 直接嵌入安卓原生进程。⚡️ 特性: 赛博朋克叙事 UI (Native AutoGLM Clien…☆130Updated this week
- Marco Search Agent for Realistic and Challenging Agentic Search☆240Updated 2 months ago
- A toolkit enhances PyTorch with specialized functions for low-bit quantized neural networks.☆196Updated last year
- 4th Place Solution for the Kaggle Competition: LMSYS - Chatbot Arena Human Preference Predictions☆171Updated last year
- Dataset and evaluation code of ISDrama(ACM-MM 2025): Immersive Spatial Drama Generation through Multimodal Prompting☆237Updated 4 months ago
- ☆220Updated 7 months ago
- ☆185Updated 5 months ago
- Valuation of tokens corresponding to influential individuals on social platforms through AI algorithms☆229Updated 3 months ago
- (EMNLP 2025 Findings) Source Evaluation scripts for Humanity's Last Code Exam☆95Updated 4 months ago
- 超能 文献|AI驱动的文档翻译与学术搜索服务。支持PDF、DOCX、PPTX等多格式文档的高质量翻译(支持11种语言),特别优化了数学公式翻译。同时提供PubMed学术文献智能搜索功能。更多访问:https://suppr.wilddata.cn☆246Updated 2 months ago
- 从0训练类 o1 大语言模型。☆132Updated last week
- 这是一个MCP客户端,让你轻松配置各个大模型,对接各种MCP Server而开发。This is an MCP client that allows you to easily configure various large models and develop inter…☆138Updated 2 months ago
- 🌐Web Agent Protocol (WAP) - Record and replay user interactions in the browser with MCP support☆488Updated 6 months ago
- ☆392Updated 8 months ago
- ☆205Updated 3 weeks ago
- Link: https://kc-li.com/mytools , Weekly update trending AI apps.☆87Updated last week