THUDM / ChatGLM-Math
☆78Updated 6 months ago
Related projects ⓘ
Alternatives and complementary repositories for ChatGLM-Math
- Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"☆190Updated 3 weeks ago
- Reformatted Alignment☆112Updated last month
- ☆77Updated last month
- ☆129Updated 4 months ago
- ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios☆62Updated 6 months ago
- [EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs☆216Updated 6 months ago
- ☆48Updated 8 months ago
- NaturalCodeBench (Findings of ACL 2024)☆56Updated 3 weeks ago
- CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models☆38Updated 8 months ago
- A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. 🧮✨☆94Updated 6 months ago
- ☆55Updated this week
- The implementation of paper "LLM Critics Help Catch Bugs in Mathematics: Towards a Better Mathematical Verifier with Natural Language Fee…☆32Updated 3 months ago
- SciGLM: Training Scientific Language Models with Self-Reflective Instruction Annotation and Tuning (NeurIPS D&B Track 2024)☆65Updated 8 months ago
- ☆37Updated 4 months ago
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning☆123Updated 2 months ago
- ☆211Updated 3 months ago
- Unofficial implementation of AlpaGasus☆84Updated last year
- The official repository of the Omni-MATH benchmark.☆45Updated last week
- Collection of papers for scalable automated alignment.☆71Updated 2 weeks ago
- Code implementation of synthetic continued pretraining☆54Updated last month
- [NeurIPS 2024 Oral] Aligner: Efficient Alignment by Learning to Correct☆111Updated last week
- Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Models☆215Updated last month
- trending projects & awesome papers about data-centric llm studies.☆32Updated this week
- Source code of "Reasons to Reject? Aligning Language Models with Judgments"☆56Updated 8 months ago
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆73Updated 9 months ago
- We aim to provide the best references to search, select, and synthesize high-quality and large-quantity data for post-training your LLMs.☆46Updated last month
- ☆67Updated 4 months ago
- We introduce ScaleQuest, a scalable, novel and cost-effective data synthesis method to unleash the reasoning capability of LLMs.☆43Updated 2 weeks ago
- [EMNLP 2024 (Oral)] Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA☆89Updated last month
- ☆37Updated 3 weeks ago