suyoumo / DeepClaude_BenchmarkLinks
This project is designed to evaluate the effectiveness of DeepClaude and other combination models.
☆40Updated 5 months ago
Alternatives and similar repositories for DeepClaude_Benchmark
Users that are interested in DeepClaude_Benchmark are comparing it to the libraries listed below
Sorting:
- DeepClaude Rust的升级版本☆208Updated 4 months ago
- 解题助手,面试助手,在「编码笔试」或「面试」时,借助AI实时提供解题思路和答案。A problem-solving and interview assistant that leverages AI to provide real-time solution approac…☆43Updated last week
- JittorGeometric is a Jittor-based graph machine learning library.☆160Updated this week
- ☆281Updated 2 months ago
- EvaLearn is a pioneering benchmark designed to evaluate large language models (LLMs) on their learning capability and efficiency in chall…☆422Updated last month
- DPO-Shift: Shifting the Distribution of Direct Preference Optimization☆60Updated 5 months ago
- 🌐Web Agent Protocol (WAP) - Record and replay user interactions in the browser with MCP support☆478Updated 2 months ago
- Official Repository for Paper: The Curse of CoT: On the Limitations of Chain-of-Thought in In-Context Learning☆51Updated 4 months ago
- 智川x-agent☆308Updated last week
- 一个用于分析创业公司数据的综合平台,包含爬虫系统、数据分析工具、创业评估AI模型、Web端和小程序端☆112Updated 3 months ago
- A powerful multi-format file parsing, data cleaning, and AI annotation toolkit.☆139Updated last week
- 医学中文RAG项目,使用langchain+milvus,支持快速一键式部署,支持无缝领域迁移☆123Updated this week
- Tokenize The Virtual Agents Onchain☆243Updated 2 months ago
- Vite plugin to help LLMs to interact with your React App☆128Updated 4 months ago
- A toolkit enhances PyTorch with specialized functions for low-bit quantized neural networks.☆197Updated last year
- This repo collects research papers that use AI tools and are in the field of scientific research (including computer science, agronomy, c…☆98Updated 5 months ago
- Source Evaluation scripts for Humanity's Last Code Exam☆90Updated last week
- ☆219Updated 2 months ago
- 4th Place Solution for the Kaggle Competition: LMSYS - Chatbot Arena Human Preference Predictions☆172Updated 10 months ago
- [MM 2025] EventVAD: Training-Free Event-Aware Video Anomaly Detection☆340Updated last month
- Link: https://kc-li.com/mytools , Weekly update trending AI apps.☆85Updated 3 weeks ago
- ☆38Updated 4 months ago
- LLM-FuzzX is a user-friendly fuzz testing tool for Large Language Models (e.g., GPT, Claude, LLaMA), featuring advanced task-aware mutati…☆114Updated 3 months ago
- ☆282Updated 2 months ago
- a multiscale multimodal large language models for radiology report generation (RRG) tasks☆261Updated 2 weeks ago
- ☆87Updated this week
- support all servers in Ai☆213Updated last week
- A multimodal personal assistant that allows Large Language Models (LLMs) to run code locally, acting as an autonomous agent capable of co…☆206Updated 7 months ago
- Fast and free zeroshot lipsync MCP server☆90Updated 3 months ago
- 一个功能强大的小红书自动化运营系统,支持多账号管理、定时批量智能发文、素材二创等功能。☆354Updated 2 months ago