hpcaitech / TitansLinks
A collection of models built with ColossalAI
☆32Updated 2 years ago
Alternatives and similar repositories for Titans
Users that are interested in Titans are comparing it to the libraries listed below
Sorting:
- Scalable PaLM implementation of PyTorch☆190Updated 2 years ago
- A memory efficient DLRM training solution using ColossalAI☆104Updated 2 years ago
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆39Updated last year
- Large Scale Distributed Model Training strategy with Colossal AI and Lightning AI☆57Updated last year
- MultilingualShareGPT, the free multi-language corpus for LLM training☆72Updated 2 years ago
- An Experiment on Dynamic NTK Scaling RoPE☆64Updated last year
- Repository for analysis and experiments in the BigCode project.☆119Updated last year
- A unified tokenization tool for Images, Chinese and English.☆152Updated 2 years ago
- Ouroboros: Speculative Decoding with Large Model Enhanced Drafting (EMNLP 2024 main)☆106Updated 3 months ago
- ☆105Updated 2 years ago
- This is a text generation method which returns a generator, streaming out each token in real-time during inference, based on Huggingface/…☆95Updated last year
- LLaMa Tuning with Stanford Alpaca Dataset using Deepspeed and Transformers☆51Updated 2 years ago
- A plug-in of Microsoft DeepSpeed to fix the bug of DeepSpeed pipeline☆26Updated 4 years ago
- NAACL '24 (Best Demo Paper RunnerUp) / MlSys @ NeurIPS '23 - RedCoast: A Lightweight Tool to Automate Distributed Training and Inference☆66Updated 6 months ago
- Data preparation code for CrystalCoder 7B LLM☆45Updated last year
- The aim of this repository is to utilize LLaMA to reproduce and enhance the Stanford Alpaca☆97Updated 2 years ago
- setup the env for vllm users☆16Updated last year
- Simple implementation of Speculative Sampling in NumPy for GPT-2.☆95Updated last year
- The multilingual variant of GLM, a general language model trained with autoregressive blank infilling objective☆62Updated 2 years ago
- A Python implementation of Toolformer using Huggingface Transformers☆14Updated 2 years ago
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆69Updated last year
- Implementation of NAACL 2024 Outstanding Paper "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"☆146Updated 3 months ago
- ☆23Updated last year
- Simple and efficient pytorch-native transformer training and inference (batched)☆76Updated last year
- An Implementation of "Orca: Progressive Learning from Complex Explanation Traces of GPT-4"☆43Updated 8 months ago
- Nano repo for RL training of LLMs☆61Updated 2 weeks ago
- Open Implementations of LLM Analyses☆104Updated 8 months ago
- [ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models☆11Updated last year
- Async pipelined version of Verl☆100Updated 2 months ago
- Source code for ACL 2023 paper Decoder Tuning: Efficient Language Understanding as Decoding☆50Updated 2 years ago