Laoyu84 / 4onebenchLinks

A minimalist benchmarking tool designed to test the routine-generation capabilities of LLMs.

☆25

Alternatives and similar repositories for 4onebench

Users that are interested in 4onebench are comparing it to the libraries listed below

Sorting:

Lightblues / AgentRE
Repo for for paper "AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction".
☆67Updated 11 months ago
linancn / TianGong-AI-Unstructure
TianGong-AI-Unstructure
☆68Updated last month
shibing624 / agentica
Agentica: Effortlessly Build Intelligent, Reflective, and Collaborative Multimodal AI Agents! 构建智能的多模态AI Agent。
☆187Updated last week
Yangjiaxi / Sense
[ACL24] Official repo for "Synthesizing Text-to-SQL Data from Weak and Strong LLMs"
☆67Updated 11 months ago
SmartFlowAI / Hand-on-RAG
顾名思义：手搓的RAG
☆125Updated last year
zzlgreat / smart_agent
☆105Updated last year
llm-factory / imitater
Imitate OpenAI with Local Models
☆87Updated 10 months ago
CLUEbenchmark / SuperCLUE-RAG
中文原生检索增强生成测评基准
☆119Updated last year
Alannikos / edg4llm
A unified tool to generate fine-tuning datasets for LLMs, including questions, answers, and dialogues. ✨🤖📚💬
☆60Updated 4 months ago
DocAILab / XRAG
XRAG: eXamining the Core - Benchmarking Foundational Component Modules in Advanced Retrieval-Augmented Generation
☆103Updated 2 weeks ago
liangwq / deepSeekRecurrence
deepseek思维树模式实现
☆15Updated 5 months ago
Alibaba-NLP / CoFE-RAG
☆37Updated 3 months ago
ArtificialZeng / llama3_explained
the newest version of llama3，source code explained line by line using Chinese
☆22Updated last year
tpoisonooo / ROGRAG
[ACL2025 demo track] ROGRAG: A Robustly Optimized GraphRAG Framework
☆163Updated 2 weeks ago
MigoXLab / dingo
Dingo: A Comprehensive AI Data Quality Evaluation Tool
☆288Updated this week
yongzhuo / qwen2-sft
Qwen1.5-SFT(阿里, Ali), Qwen_Qwen1.5-2B-Chat/Qwen_Qwen1.5-7B-Chat微调(transformers)/LORA(peft)/推理
☆63Updated last year
riddle911 / SuperInsights
☆66Updated 9 months ago
aliyun / qwen-dianjin
Qwen DianJin: LLMs for the Financial Industry by Alibaba Cloud
☆119Updated last month
MetaGLM / LawGLM
探索 LLM 在法律行业的应用潜力
☆90Updated 7 months ago
HITsz-TMG / YiZhao
YiZhao: A 2TB Open Financial Corpus. Data and tools for generating and inspecting YiZhao, a safe, high-quality, open-source bilingual fin…
☆28Updated last week
chaoql / rag-best-practices
大模型检索增强生成技术最佳实践。
☆80Updated 10 months ago
ictnlp / FlexRAG
FlexRAG: A RAG Framework for Information Retrieval and Generation.
☆194Updated last month
wey-gu / grpo-graph-extraction
Qwen GRPO Graph Extraction RL Finetune
☆51Updated 3 months ago
t6am3 / law_glm_baseline
☆15Updated last year
Alibaba-NLP / MaskSearch
Repo for "MaskSearch: A Universal Pre-Training Framework to Enhance Agentic Search Capability"
☆133Updated last month
shibing624 / deep-research
Python implementation of AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, w…
☆46Updated 3 months ago
shuyhere / all-about-llm
大语言模型训练和服务调研
☆37Updated last year
yanqiangmiffy / tree2retriever
Recursive Abstractive Processing for Tree-Organized Retrieval
☆9Updated last year
yiyepiaoling0715 / codellm-data-preprocess-pipeline
代码大模型预训练&微调&DPO 数据处理业界处理pipeline sota
☆42Updated 11 months ago
zjrwtx / SFT-data-builder
利用免费的大模型api来结合你的私域数据来生成sft训练数据（妥妥白嫖）支持llamafactory等工具的训练数据格式synthetic data
☆169Updated 7 months ago