hpcaitech / Titans
A collection of models built with ColossalAI
☆32Updated 2 years ago
Alternatives and similar repositories for Titans:
Users that are interested in Titans are comparing it to the libraries listed below
- Scalable PaLM implementation of PyTorch☆192Updated 2 years ago
- A memory efficient DLRM training solution using ColossalAI☆101Updated 2 years ago
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆36Updated last year
- Large Scale Distributed Model Training strategy with Colossal AI and Lightning AI☆58Updated last year
- An experimental implementation of the retrieval-enhanced language model☆74Updated 2 years ago
- OpenLLMDE: An open source data engineering framework for LLMs☆17Updated last year
- Ouroboros: Speculative Decoding with Large Model Enhanced Drafting (EMNLP 2024 main)☆84Updated 3 months ago
- LLaMa Tuning with Stanford Alpaca Dataset using Deepspeed and Transformers☆50Updated last year
- ☆74Updated last year
- [ICLR2025] Breaking Throughput-Latency Trade-off for Long Sequences with Speculative Decoding☆105Updated last month
- Linear Attention Sequence Parallelism (LASP)☆76Updated 7 months ago
- An Experiment on Dynamic NTK Scaling RoPE☆62Updated last year
- ☆106Updated last year
- ☆24Updated 2 years ago
- [AAAI 2024] Investigating the Effectiveness of Task-Agnostic Prefix Prompt for Instruction Following☆79Updated 4 months ago
- Techniques used to run BLOOM at inference in parallel☆37Updated 2 years ago
- Transformers at any scale☆41Updated last year
- A Python library transfers PyTorch tensors between CPU and NVMe☆102Updated 2 months ago
- SparseGPT + GPTQ Compression of LLMs like LLaMa, OPT, Pythia☆41Updated last year
- Open Implementations of LLM Analyses☆98Updated 3 months ago
- Repository for analysis and experiments in the BigCode project.☆117Updated 10 months ago
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆76Updated last year
- Implementation of "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"☆42Updated 2 months ago
- Data preparation code for CrystalCoder 7B LLM☆44Updated 8 months ago
- This is a text generation method which returns a generator, streaming out each token in real-time during inference, based on Huggingface/…☆96Updated 10 months ago
- LMTuner: Make the LLM Better for Everyone☆33Updated last year
- Retrieves parquet files from Hugging Face, identifies and quantifies junky data, duplication, contamination, and biased content in datase…☆51Updated last year
- The data processing pipeline for the Koala chatbot language model☆117Updated last year
- MultilingualShareGPT, the free multi-language corpus for LLM training☆73Updated last year
- Repository for CPU Kernel Generation for LLM Inference☆25Updated last year