TIGER-AI-Lab / MAmmoTHLinks

Code and data for "MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning" [ICLR 2024]

☆376

Alternatives and similar repositories for MAmmoTH

Users that are interested in MAmmoTH are comparing it to the libraries listed below

Sorting:

anchen1011 / FireAct
FireAct: Toward Language Agent Fine-tuning
☆281Updated last year
chuanyang-Zheng / Progressive-Hint
This is the official implementation of "Progressive-Hint Prompting Improves Reasoning in Large Language Models"
☆209Updated last year
GAIR-NLP / abel
SOTA Math Opensource LLM
☆333Updated last year
OFA-Sys / gsm8k-ScRel
Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Models
☆268Updated 10 months ago
OpenBMB / UltraFeedback
A large-scale, fine-grained, diverse preference dataset (and models).
☆345Updated last year
iiis-ai / cumulative-reasoning
Official implementation of TMLR paper "Cumulative Reasoning With Large Language Models" (https://arxiv.org/abs/2308.04371)
☆297Updated this week
OpenBMB / Eurus
☆320Updated 10 months ago
GAIR-NLP / MathPile
[NeurlPS D&B 2024] Generative AI for Math: MathPile
☆415Updated 4 months ago
MARIO-Math-Reasoning / Super_MARIO
☆337Updated 2 months ago
Ber666 / ToolkenGPT
ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool Embeddings - NeurIPS 2023 (oral)
☆264Updated last year
Re-Align / URIAL
☆311Updated last year
liutiedong / goat
a Fine-tuned LLaMA that is Good at Arithmetic Tasks
☆178Updated last year
hkust-nlp / deita
Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]
☆561Updated 7 months ago
jayelm / gisting
Learning to Compress Prompts with Gist Tokens - https://arxiv.org/abs/2304.08467
☆289Updated 5 months ago
GAIR-NLP / auto-j
Generative Judge for Evaluating Alignment
☆244Updated last year
suzgunmirac / BIG-Bench-Hard
Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them
☆506Updated last year
OpenLMLab / LEval
[ACL'24 Outstanding] Data and code for L-Eval, a comprehensive long context language models evaluation benchmark
☆388Updated last year
sail-sg / lorahub
[COLM 2024] LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition
☆645Updated last year
FranxYao / Long-Context-Data-Engineering
Implementation of paper Data Engineering for Scaling Language Models to 128K Context
☆467Updated last year
QwenLM / AutoIF
☆298Updated last year
TencentARC / LLaMA-Pro
[ACL 2024] Progressive LLaMA with Block Expansion.
☆507Updated last year
TIGER-AI-Lab / Program-of-Thoughts
Data and Code for Program of Thoughts [TMLR 2023]
☆280Updated last year
facebookresearch / Shepherd
This is the repo for the paper Shepherd -- A Critic for Language Model Generation
☆219Updated last year
nlpxucan / evol-instruct
☆270Updated 2 years ago
neelsjain / NEFTune
Official repository of NEFTune: Noisy Embeddings Improves Instruction Finetuning
☆397Updated last year
declare-lab / instruct-eval
This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.
☆547Updated last year
itsnamgyu / reasoning-teacher
Large Language Models Are Reasoning Teachers (ACL 2023)
☆341Updated 4 months ago
allenai / reward-bench
RewardBench: the first evaluation tool for reward models.
☆622Updated last month
voidism / DoLa
Official implementation for the paper "DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models"
☆504Updated 6 months ago
princeton-nlp / AutoCompressors
[EMNLP 2023] Adapting Language Models to Compress Long Contexts
☆309Updated 10 months ago