trotsky1997 / MathBlackBox
☆940Updated 2 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for MathBlackBox
- Code for Quiet-STaR☆653Updated 3 months ago
- System 2 Reasoning Link Collection☆694Updated 3 weeks ago
- Automated Design of Agentic Systems☆1,040Updated this week
- Large Reasoning Models☆612Updated this week
- MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering☆523Updated 3 weeks ago
- A library for advanced large language model reasoning☆1,451Updated last week
- A reading list on LLM based Synthetic Data Generation 🔥☆798Updated 2 weeks ago
- ☆453Updated this week
- Official repository for "Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing". Your efficient and high-quality s…☆495Updated 2 weeks ago
- Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard a…☆812Updated this week
- Official repository for ORPO☆421Updated 5 months ago
- A bibliography and survey of the papers surrounding o1☆780Updated last week
- ☆520Updated this week
- Automatically evaluate your LLMs in Google Colab☆559Updated 6 months ago
- Evaluate your LLM's response with Prometheus and GPT4 💯☆798Updated 2 months ago
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆1,647Updated this week
- [NeurIPS 2024 Spotlight] Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models☆535Updated 3 weeks ago
- [ICML 2024] Official repository for "Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models"☆685Updated 3 months ago
- ☆1,274Updated this week
- ☆322Updated 4 months ago
- A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).☆745Updated this week
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends☆813Updated this week
- [ACL'24] Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning☆340Updated 2 months ago
- Official implementation of "Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling"☆804Updated 3 months ago
- AIDE: the state-of-the-art machine learning engineer agent, generating machine learning solution code from natural language descriptions.☆599Updated 2 weeks ago
- Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.☆648Updated last month
- An Open Source Toolkit For LLM Distillation☆358Updated 2 months ago
- Minimalistic large language model 3D-parallelism training☆1,265Updated this week
- Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"☆891Updated last month