中文原生检索增强生成测评基准
☆126Apr 18, 2024Updated last year
Alternatives and similar repositories for SuperCLUE-RAG
Users that are interested in SuperCLUE-RAG are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Math24o: 高中奥林匹克数学竞赛测评集 High School Olympiad Mathematics Chinese Benchmark☆11Mar 27, 2025Updated 11 months ago
- CRUD-RAG: A Comprehensive Chinese Benchmark for Retrieval-Augmented Generation of Large Language Models☆363May 20, 2025Updated 10 months ago
- KuaiSearch PERKS☆12Nov 16, 2021Updated 4 years ago
- ☆18Apr 18, 2025Updated 11 months ago
- 中文原生工业测评基准☆15Mar 21, 2024Updated 2 years ago
- [ICML‘2024] "LoCoCo: Dropping In Convolutions for Long Context Compression", Ruisi Cai, Yuandong Tian, Zhangyang Wang, Beidi Chen☆17Sep 7, 2024Updated last year
- Knowledge Graph based Question Answering benchmark.☆10Feb 1, 2020Updated 6 years ago
- 中文原生多层次文生视频测评基准☆18Jul 8, 2024Updated last year
- ☆358May 17, 2024Updated last year
- SuperCLUE-Math6:新一代中文原生多轮多步数学推理数据集的探索之旅☆58Feb 5, 2024Updated 2 years ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆141Dec 6, 2024Updated last year
- Recursive Abstractive Processing for Tree-Organized Retrieval☆10May 30, 2024Updated last year
- ☆10Mar 18, 2024Updated 2 years ago
- Local DeepSearch (Advantage: Low Threshold): an implementation of Agentic RAG based on DeepSeek-R1 API and Tavily API☆17Jun 21, 2025Updated 9 months ago
- A Massive Multi-Level Multi-Subject Knowledge Evaluation benchmark☆105Jul 20, 2023Updated 2 years ago
- This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai,…☆2,340May 25, 2024Updated last year
- Code for the MTEB leaderboard☆30Feb 4, 2025Updated last year
- The code in "SlideCoder: Layout-aware RAG-enhanced Hierarchical Slide Generation from Design"☆44Oct 20, 2025Updated 5 months ago
- Code for "An Empirical Study of Retrieval Augmented Generation with Chain-of-Thought"☆17Jul 27, 2024Updated last year
- PromptCLUE, 全中文任务支持零样本学习模型☆665Jun 16, 2023Updated 2 years ago
- This the implementation of LeCo☆32Jan 20, 2025Updated last year
- Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、…☆6,652Oct 24, 2024Updated last year
- Llama3开源模型中文版-全方位测评,基于SuperCLUE基准 | Llama3 Chinese Evaluation with SuperCLUE☆16Apr 21, 2024Updated last year
- ☆19Sep 19, 2024Updated last year
- ☆2,126May 8, 2024Updated last year
- 儿童故事常识推理与寓意理解评测(Commonsense Reasoning and Moral Understanding Evaluation in Children's Stories,CRMU)☆18Oct 22, 2024Updated last year
- code for piccolo embedding model from SenseTime☆145May 21, 2024Updated last year
- Retrieval and Retrieval-augmented LLMs☆11,443Mar 10, 2026Updated 2 weeks ago
- ☆21Aug 19, 2024Updated last year
- Chinese safety prompts for evaluating and improving the safety of LLMs. 中文安全prompts,用于评估和提升大模型的安全性。☆1,136Feb 27, 2024Updated 2 years ago
- ☆38Jan 9, 2026Updated 2 months ago
- machine reading comprehension with deep learning☆19Feb 6, 2018Updated 8 years ago
- [COLING 2024] CMNEE: A Large-Scale Document-Level Event Extraction Dataset based on Open-Source Chinese Military News☆46Jan 26, 2026Updated last month
- 雅意信息抽取大模型:在百万级人工构造的高质量信息抽取数据上进行指令微调,由中科闻歌算法团队研发。 (Repo for YAYI Unified Information Extraction Model)☆316Aug 8, 2024Updated last year
- ☆36Jul 8, 2025Updated 8 months ago
- SuperCLUE高考作文机器自动阅卷系统☆18Jun 8, 2023Updated 2 years ago
- ☆62Oct 29, 2024Updated last year
- Repository of LV-Eval Benchmark☆74Aug 31, 2024Updated last year
- [CVPR 2025] Docopilot: Improving Multimodal Models for Document-Level Understanding☆36Jul 22, 2025Updated 8 months ago