umass-ml4ed / mathGPTLinks
A GPT-based generative LM for combined text and math formulas, leveraging tree-based formula encoding. Published as "Tree-Based Representation and Generation of Natural and Mathematical Language" (ACL 2023)
☆41Updated 2 years ago
Alternatives and similar repositories for mathGPT
Users that are interested in mathGPT are comparing it to the libraries listed below
Sorting:
- The multilingual variant of GLM, a general language model trained with autoregressive blank infilling objective☆62Updated 3 years ago
- a Fine-tuned LLaMA that is Good at Arithmetic Tasks☆178Updated 2 years ago
- OpenLLMDE: An open source data engineering framework for LLMs☆18Updated 2 years ago
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆40Updated 2 years ago
- A full pipeline to finetune Alpaca LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human…☆60Updated 2 years ago
- ⏳ ChatLog: Recording and Analysing ChatGPT Across Time☆103Updated last year
- ☆84Updated last year
- Retrieves parquet files from Hugging Face, identifies and quantifies junky data, duplication, contamination, and biased content in datase…☆53Updated 2 years ago
- This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and train…☆58Updated last year
- Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]☆88Updated last year
- An Implementation of "Orca: Progressive Learning from Complex Explanation Traces of GPT-4"☆43Updated last year
- ☆49Updated 2 years ago
- Source code for GreaTer ICLR 2025 - Gradient Over Reasoning makes Smaller Language Models Strong Prompt Optimizers☆34Updated 9 months ago
- ☆34Updated last year
- The code and data for "StructGPT: A general framework for Large Language Model to Reason on Structured Data"☆103Updated last year
- GRAIN: Gradient-based Intra-attention Pruning on Pre-trained Language Models☆19Updated 2 years ago
- ☆31Updated last year
- ☆96Updated last year
- A Toolkit for Table-based Question Answering☆115Updated 2 years ago
- 中文原生等级化代码能力测试基准☆15Updated last year
- code for Scaling Laws of RoPE-based Extrapolation☆73Updated 2 years ago
- MultilingualShareGPT, the free multi-language corpus for LLM training☆73Updated 2 years ago
- A Python implementation of Toolformer using Huggingface Transformers☆14Updated 2 years ago
- The aim of this repository is to utilize LLaMA to reproduce and enhance the Stanford Alpaca☆98Updated 2 years ago
- FuseAI Project☆87Updated last year
- Open Implementations of LLM Analyses☆107Updated last year
- ☆33Updated 2 years ago
- Self-Controlled Memory System for LLMs☆49Updated last year
- This is a meta-model distilled from LLMs for information extraction. This is an intermediate checkpoint that can be well-transferred to a…☆28Updated 11 months ago
- Summarize all open source Large Languages Models and low-cost replication methods for Chatgpt.☆136Updated 2 years ago