malody2014 / llm_benchmark
☆98Updated last month
Alternatives and similar repositories for llm_benchmark:
Users that are interested in llm_benchmark are comparing it to the libraries listed below
- LLM Arena by KCORES team☆688Updated last week
- ☆232Updated 2 months ago
- TurtleBench: Evaluating Top Language Models via Real-World Yes/No Puzzles.☆145Updated 6 months ago
- ☆201Updated this week
- ☆698Updated last week
- ☆179Updated last week
- ☆105Updated 4 months ago
- Train a 1B LLM with 1T tokens from scratch by personal☆613Updated last month
- website☆410Updated last month
- ☆999Updated 8 months ago
- ☆62Updated this week
- ☆703Updated last year
- High-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.☆1,097Updated this week
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.☆282Updated 2 weeks ago
- 【逐条处理完成】人为审核+修改每一条的弱智吧精选问题QA数据集☆193Updated 2 weeks ago
- A visuailzation tool to make deep understaning and easier debugging for RLHF training.☆187Updated 2 months ago
- ☆168Updated last year
- An open-source solution for full parameter fine-tuning of DeepSeek-V3/R1 671B, including complete code and scripts from training to infer…☆659Updated last month
- ☆405Updated this week
- Distributed RL System for LLM Reasoning☆1,164Updated this week
- Cool Papers - Immersive Paper Discovery☆518Updated 3 weeks ago
- DeepSeek 系列工作解读、扩展和复现。☆627Updated 3 weeks ago
- Repo for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent☆305Updated this week
- Unleashing the Power of Reinforcement Learning for Math and Code Reasoners☆517Updated this week
- LongBench v2 and LongBench (ACL 2024)☆850Updated 3 months ago
- A collection of phenomenons observed during the scaling of big foundation models, which may be developed into consensus, principles, or l…☆279Updated last year
- 大模型多维度中文对齐评测基准 (ACL 2024)☆382Updated 8 months ago
- ☆673Updated last week
- LLM101n: Let's build a Storyteller 中文版☆131Updated 8 months ago
- 中文Mixtral-8x7B(Chinese-Mixtral-8x7B)☆650Updated 8 months ago