umass-ml4ed / mathGPTLinks
A GPT-based generative LM for combined text and math formulas, leveraging tree-based formula encoding.
☆40Updated 2 years ago
Alternatives and similar repositories for mathGPT
Users that are interested in mathGPT are comparing it to the libraries listed below
Sorting:
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆40Updated last year
- Retrieves parquet files from Hugging Face, identifies and quantifies junky data, duplication, contamination, and biased content in datase…☆53Updated 2 years ago
- A Python implementation of Toolformer using Huggingface Transformers☆14Updated 2 years ago
- In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuning☆35Updated last year
- The multilingual variant of GLM, a general language model trained with autoregressive blank infilling objective☆62Updated 2 years ago
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Updated last year
- 中文原生等级化代码能力测试基准☆14Updated last year
- OpenLLMDE: An open source data engineering framework for LLMs☆17Updated last year
- Large Scale Distributed Model Training strategy with Colossal AI and Lightning AI☆57Updated last year
- An Implementation of "Orca: Progressive Learning from Complex Explanation Traces of GPT-4"☆43Updated 9 months ago
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆48Updated last year
- A collection of models built with ColossalAI☆32Updated 2 years ago
- Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]☆85Updated 6 months ago
- ☆82Updated last year
- A full pipeline to finetune Alpaca LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human…☆58Updated 2 years ago
- The paper list of multilingual pre-trained models (Continual Updated).☆22Updated last year
- ☆19Updated last week
- ☆17Updated last year
- Reasoning by Communicating with Agents☆29Updated 2 months ago
- ☆31Updated last year
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆30Updated 2 weeks ago
- [EMNLP 2023 Industry Track] A simple prompting approach that enables the LLMs to run inference in batches.☆74Updated last year
- ⏳ ChatLog: Recording and Analysing ChatGPT Across Time☆100Updated last year
- [COLM 2024] Early Weight Averaging meets High Learning Rates for LLM Pre-training☆16Updated 9 months ago
- [ACL 2023] Few-shot Reranking for Multi-hop QA via Language Model Prompting☆27Updated 2 years ago
- [NAACL 2024] Enhancing Chain-of-Thoughts Prompting with Iterative Bootstrapping in Large Language Models☆85Updated last year
- Tools for content datamining and NLP at scale☆43Updated last year
- minimal scripts for 24GB VRAM GPUs. training, inference, whatever☆41Updated last month
- Implementation of "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"☆40Updated 8 months ago
- [NAACL 2024 Findings] Evaluation suite for the systematic evaluation of instruction selection methods.☆22Updated last year