LAION-AI / math_problems-step-by-step_solutionsLinks
Here we provide and collect many functions to generate math problem and step by step solutions for LLM training
☆17Updated 2 years ago
Alternatives and similar repositories for math_problems-step-by-step_solutions
Users that are interested in math_problems-step-by-step_solutions are comparing it to the libraries listed below
Sorting:
- Implementation of CALM from the paper "LLM Augmented LLMs: Expanding Capabilities through Composition", out of Google Deepmind☆179Updated last year
- Small and Efficient Mathematical Reasoning LLMs☆73Updated last year
- ☆70Updated last year
- This is the official repository for Inheritune.☆116Updated 10 months ago
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated 2 years ago
- Scaling Data-Constrained Language Models☆342Updated 5 months ago
- Language models scale reliably with over-training and on downstream tasks☆100Updated last year
- Scripts for generating synthetic finetuning data for reducing sycophancy.☆117Updated 2 years ago
- [ICLR 2023] Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learners☆116Updated 5 months ago
- Flacuna was developed by fine-tuning Vicuna on Flan-mini, a comprehensive instruction collection encompassing various tasks. Vicuna is al…☆111Updated 2 years ago
- ☆150Updated last year
- ☆129Updated last year
- Code accompanying the paper Pretraining Language Models with Human Preferences☆180Updated last year
- Multi-Domain Expert Learning☆67Updated last year
- ☆49Updated 2 years ago
- Exploring finetuning public checkpoints on filter 8K sequences on Pile☆116Updated 2 years ago
- Plug in and play implementation of " Textbooks Are All You Need", ready for training, inference, and dataset generation☆74Updated 2 years ago
- ☆17Updated 8 months ago
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆45Updated 2 months ago
- A repository for transformer critique learning and generation☆89Updated 2 years ago
- ☆78Updated 2 years ago
- [ACL 2024 Findings & ICLR 2024 WS] An Evaluator VLM that is open-source, offers reproducible evaluation, and inexpensive to use. Specific…☆78Updated last year
- ModuleFormer is a MoE-based architecture that includes two different types of experts: stick-breaking attention heads and feedforward exp…☆226Updated 3 months ago
- This is the repo for the paper Shepherd -- A Critic for Language Model Generation☆220Updated 2 years ago
- Collection of autoregressive model implementation☆85Updated 7 months ago
- Code used for the creation of OBELICS, an open, massive and curated collection of interleaved image-text web documents, containing 141M d…☆210Updated last year
- Astraios: Parameter-Efficient Instruction Tuning Code Language Models☆63Updated last year
- M4 experiment logbook☆58Updated 2 years ago
- Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA☆104Updated 7 months ago
- Code for Zero-Shot Tokenizer Transfer☆142Updated 11 months ago