hpcaitech / TitansLinks
A collection of models built with ColossalAI
☆32Updated 2 years ago
Alternatives and similar repositories for Titans
Users that are interested in Titans are comparing it to the libraries listed below
Sorting:
- Scalable PaLM implementation of PyTorch☆190Updated 2 years ago
- A memory efficient DLRM training solution using ColossalAI☆105Updated 2 years ago
- The data processing pipeline for the Koala chatbot language model☆117Updated 2 years ago
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆40Updated last year
- Repository for analysis and experiments in the BigCode project.☆120Updated last year
- Python tools for processing the stackexchange data dumps into a text dataset for Language Models☆83Updated last year
- [EMNLP 2023 Industry Track] A simple prompting approach that enables the LLMs to run inference in batches.☆74Updated last year
- ☆104Updated 2 years ago
- The multilingual variant of GLM, a general language model trained with autoregressive blank infilling objective☆62Updated 2 years ago
- Open Implementations of LLM Analyses☆105Updated 9 months ago
- The aim of this repository is to utilize LLaMA to reproduce and enhance the Stanford Alpaca☆97Updated 2 years ago
- Ouroboros: Speculative Decoding with Large Model Enhanced Drafting (EMNLP 2024 main)☆107Updated 3 months ago
- A Python implementation of Toolformer using Huggingface Transformers☆14Updated 2 years ago
- a Fine-tuned LLaMA that is Good at Arithmetic Tasks☆178Updated last year
- ☆82Updated last year
- distill chatGPT coding ability into small model (1b)☆30Updated last year
- Implementation of NAACL 2024 Outstanding Paper "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"☆148Updated 4 months ago
- ⏳ ChatLog: Recording and Analysing ChatGPT Across Time☆100Updated last year
- A GPT-based generative LM for combined text and math formulas, leveraging tree-based formula encoding.☆40Updated 2 years ago
- Retrieves parquet files from Hugging Face, identifies and quantifies junky data, duplication, contamination, and biased content in datase…☆53Updated 2 years ago
- Examples of training models with hybrid parallelism using ColossalAI☆340Updated 2 years ago
- LLaMa Tuning with Stanford Alpaca Dataset using Deepspeed and Transformers☆51Updated 2 years ago
- A (somewhat) minimal library for finetuning language models with PPO on human feedback.☆85Updated 2 years ago
- Large Scale Distributed Model Training strategy with Colossal AI and Lightning AI☆57Updated last year
- Distributed IO-aware Attention algorithm☆20Updated 10 months ago
- An Implementation of "Orca: Progressive Learning from Complex Explanation Traces of GPT-4"☆43Updated 9 months ago
- setup the env for vllm users☆16Updated last year
- Techniques used to run BLOOM at inference in parallel☆37Updated 2 years ago
- fastertransformer for codegeex model☆63Updated 2 years ago
- Adversarial Training and SFT for Bot Safety Models☆40Updated 2 years ago