umass-ml4ed / mathGPTLinks
A GPT-based generative LM for combined text and math formulas, leveraging tree-based formula encoding. Published as "Tree-Based Representation and Generation of Natural and Mathematical Language" (ACL 2023)
☆41Updated 2 years ago
Alternatives and similar repositories for mathGPT
Users that are interested in mathGPT are comparing it to the libraries listed below
Sorting:
- The multilingual variant of GLM, a general language model trained with autoregressive blank infilling objective☆62Updated 3 years ago
- A full pipeline to finetune Alpaca LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human…☆60Updated 2 years ago
- Retrieves parquet files from Hugging Face, identifies and quantifies junky data, duplication, contamination, and biased content in datase…☆53Updated 2 years ago
- a Fine-tuned LLaMA that is Good at Arithmetic Tasks☆178Updated 2 years ago
- An Implementation of "Orca: Progressive Learning from Complex Explanation Traces of GPT-4"☆43Updated last year
- MultilingualShareGPT, the free multi-language corpus for LLM training☆73Updated 2 years ago
- Safety Score for Pre-Trained Language Models☆96Updated 2 years ago
- The aim of this repository is to utilize LLaMA to reproduce and enhance the Stanford Alpaca☆98Updated 2 years ago
- This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and train…☆58Updated last year
- OpenLLMDE: An open source data engineering framework for LLMs☆18Updated 2 years ago
- Implementation of the paper: "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" from Google in pyTO…☆58Updated 2 weeks ago
- [EMNLP 2023 Industry Track] A simple prompting approach that enables the LLMs to run inference in batches.☆77Updated last year
- Large Scale Distributed Model Training strategy with Colossal AI and Lightning AI☆56Updated 2 years ago
- Inference script for Meta's LLaMA models using Hugging Face wrapper☆110Updated 2 years ago
- Unofficial implementation of AlpaGasus☆94Updated 2 years ago
- The data processing pipeline for the Koala chatbot language model☆118Updated 2 years ago
- Inspired by google c4, here is a series of colossal clean data cleaning scripts focused on CommonCrawl data processing. Including Chinese…☆135Updated 2 years ago
- ☆92Updated 3 years ago
- Gaokao Benchmark for AI☆109Updated 3 years ago
- ⏳ ChatLog: Recording and Analysing ChatGPT Across Time☆103Updated last year
- Scalable PaLM implementation of PyTorch☆189Updated 3 years ago
- In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuning☆35Updated 2 years ago
- ☆95Updated last year
- Code for KaLM-Embedding models☆107Updated 6 months ago
- ☆98Updated 2 years ago
- GRAIN: Gradient-based Intra-attention Pruning on Pre-trained Language Models☆19Updated 2 years ago
- An Experiment on Dynamic NTK Scaling RoPE☆64Updated 2 years ago
- Self-Controlled Memory System for LLMs☆49Updated last year
- Code for paper titled "Towards the Law of Capacity Gap in Distilling Language Models"☆102Updated last year
- Python tools for processing the stackexchange data dumps into a text dataset for Language Models☆85Updated 2 years ago