umass-ml4ed / mathGPTLinks
A GPT-based generative LM for combined text and math formulas, leveraging tree-based formula encoding. Published as "Tree-Based Representation and Generation of Natural and Mathematical Language" (ACL 2023)
☆40Updated 2 years ago
Alternatives and similar repositories for mathGPT
Users that are interested in mathGPT are comparing it to the libraries listed below
Sorting:
- The multilingual variant of GLM, a general language model trained with autoregressive blank infilling objective☆62Updated 3 years ago
- MultilingualShareGPT, the free multi-language corpus for LLM training☆73Updated 2 years ago
- OpenLLMDE: An open source data engineering framework for LLMs☆18Updated 2 years ago
- An Implementation of "Orca: Progressive Learning from Complex Explanation Traces of GPT-4"☆43Updated last year
- ⏳ ChatLog: Recording and Analysing ChatGPT Across Time☆103Updated last year
- Large Scale Distributed Model Training strategy with Colossal AI and Lightning AI☆56Updated 2 years ago
- a Fine-tuned LLaMA that is Good at Arithmetic Tasks☆178Updated 2 years ago
- A Python implementation of Toolformer using Huggingface Transformers☆14Updated 2 years ago
- A full pipeline to finetune Alpaca LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human…☆60Updated 2 years ago
- This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and train…☆58Updated last year
- Gaokao Benchmark for AI☆109Updated 3 years ago
- Retrieves parquet files from Hugging Face, identifies and quantifies junky data, duplication, contamination, and biased content in datase…☆53Updated 2 years ago
- ☆49Updated 2 years ago
- Safety Score for Pre-Trained Language Models☆95Updated 2 years ago
- [EMNLP 2023 Industry Track] A simple prompting approach that enables the LLMs to run inference in batches.☆76Updated last year
- ☆83Updated last year
- ☆56Updated 2 years ago
- 中文原生等级化代码能力测试基准☆15Updated last year
- Source code for GreaTer ICLR 2025 - Gradient Over Reasoning makes Smaller Language Models Strong Prompt Optimizers☆33Updated 7 months ago
- Pretraining Efficiently on S2ORC!☆172Updated last year
- ☆35Updated 2 years ago
- ☆92Updated 3 years ago
- Inference script for Meta's LLaMA models using Hugging Face wrapper☆110Updated 2 years ago
- Python tools for processing the stackexchange data dumps into a text dataset for Language Models☆83Updated last year
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆41Updated last year
- The "GPT-API-Accelerate" project provides a set of Python classes for accelerating the process of generating responses to prompts using t…☆23Updated last year
- Transformers at any scale☆41Updated last year
- The data processing pipeline for the Koala chatbot language model☆118Updated 2 years ago
- A framework for human-readable prompt-based method with large language models. Specially designed for researchers. (Deprecated, check out…☆131Updated 2 years ago
- Code for paper titled "Towards the Law of Capacity Gap in Distilling Language Models"☆102Updated last year