Laoyu84 / 4onebench
A minimalist benchmarking tool designed to test the routine-generation capabilities of LLMs.
☆17Updated 2 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for 4onebench
- 中文原生检索增强生成测评基准☆100Updated 7 months ago
- the newest version of llama3,source code explained line by line using Chinese☆22Updated 7 months ago
- TianGong-AI-Unstructure☆51Updated this week
- Here is a demo for PDF parser (Including OCR, object detection tools)☆30Updated last month
- Imitate OpenAI with Local Models☆85Updated 2 months ago
- Repo for for paper "AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction".☆50Updated 3 months ago
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆28Updated 5 months ago
- ☆78Updated last month
- Qwen-WisdomVast is a large model trained on 1 million high-quality Chinese multi-turn SFT data, 200,000 English multi-turn SFT data, and …☆18Updated 7 months ago
- Recursive Abstractive Processing for Tree-Organized Retrieval☆11Updated 5 months ago
- pretrain a wiki llm using transformers☆10Updated 2 months ago
- Meta-Chunking: Learning Efficient Text Segmentation via Logical Perception☆68Updated last week
- ☆105Updated last year
- 大模型检索增强生成技术最佳实践。☆46Updated 2 months ago
- ☆129Updated 4 months ago
- 通用版面分析 | 中文文档解析 |Document Layout Analysis | layout paser☆45Updated 5 months ago
- ☆60Updated 2 months ago
- 文本去重☆67Updated 5 months ago
- ☆40Updated 5 months ago
- The official implementation of paper "ToolGen: Unified Tool Retrieval and Calling via Generation"☆99Updated 3 weeks ago
- 通用简单工具项目☆14Updated last month
- CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models☆38Updated 8 months ago
- ☆22Updated last month
- ☆11Updated 6 months ago
- Qwen1.5-SFT(阿里, Ali), Qwen_Qwen1.5-2B-Chat/Qwen_Qwen1.5-7B-Chat微调(transformers)/LORA(peft)/推理☆44Updated 6 months ago
- Light local website for displaying performances from different chat models.☆85Updated last year
- 大语言模型训练和服务调研☆34Updated last year
- 本项目用于Embedding模型的相关实验,包括Embedding模型评估、Embedding模型微调、Embedding模型量化等。☆29Updated 4 months ago
- Hammer: Robust Function-Calling for On-Device Language Models via Function Masking☆33Updated this week
- SmartSearch: Building a quick conversation-based search engine with LLMs.☆43Updated 6 months ago