math-eval / TAL-SCQ5K
☆143Updated last year
Alternatives and similar repositories for TAL-SCQ5K:
Users that are interested in TAL-SCQ5K are comparing it to the libraries listed below
- 本项目用于大模型数学解题能力方面的数据集合成,模型训练及评测,相关文章记录。☆73Updated 5 months ago
- Official Pytorch Implementation for MathGLM☆320Updated last year
- ☆209Updated 9 months ago
- Alpaca Chinese Dataset -- 中文指令微调数据集【人工+GPT4o持续更新】☆192Updated 4 months ago
- SuperCLUE-Agent: 基于中文原生任务的Agent智能体核心能力测评基准☆81Updated last year
- XVERSE-65B: A multilingual large language model developed by XVERSE Technology Inc.☆135Updated 10 months ago
- A Massive Multi-Level Multi-Subject Knowledge Evaluation benchmark☆100Updated last year
- SOTA Math Opensource LLM☆331Updated last year
- Gaokao Benchmark for AI☆105Updated 2 years ago
- SUS-Chat: Instruction tuning done right☆48Updated last year
- ☆225Updated 9 months ago
- 李鲁鲁老师对 吴恩达《ChatGPT Prompt Engineering for Developers》课程中文版的实践☆132Updated last year
- 大模型多维度中文对齐评测基准 (ACL 2024)☆359Updated 6 months ago
- MathEval is a benchmark dedicated to the holistic evaluation on mathematical capacities of LLMs.☆73Updated 3 months ago
- deep learning☆150Updated 8 months ago
- Imitate OpenAI with Local Models☆86Updated 5 months ago
- ☆218Updated 3 months ago
- 怎么训练一个LLM分词器☆140Updated last year
- ☆307Updated 7 months ago
- 使用langchain进行任务规划,构建子任务的会话场景资源,通过MCTS任务执行器,来让每个子任务通过在上下文中资源,通过自身反思探索来获取自身对问题的最优答案;这种方式依赖模型的对齐偏好,我们在每种偏好上设计了一个工程框架,来完成自我对不同答案的奖励进行采样策略☆27Updated this week
- Unleashing the Power of Cognitive Dynamics on Large Language Models☆60Updated 4 months ago
- the newest version of llama3,source code explained line by line using Chinese☆22Updated 10 months ago
- Its an open source LLM based on MOE Structure.☆58Updated 7 months ago
- ☆139Updated 7 months ago
- ☆81Updated 10 months ago
- ☆104Updated 3 months ago
- Mixture-of-Experts (MoE) Language Model☆184Updated 5 months ago
- ☆62Updated last year
- qwen-7b and qwen-14b finetuning☆90Updated 10 months ago
- SuperCLUE-Math6:新一代中文原生多轮多步数学推理数据集的探索之旅☆53Updated last year