hpcaitech / Titans
A collection of models built with ColossalAI
☆32Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for Titans
- Ouroboros: Speculative Decoding with Large Model Enhanced Drafting (EMNLP 2024 main)☆76Updated last month
- Scalable PaLM implementation of PyTorch☆192Updated last year
- A memory efficient DLRM training solution using ColossalAI☆100Updated 2 years ago
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆34Updated 10 months ago
- A plug-in of Microsoft DeepSpeed to fix the bug of DeepSpeed pipeline☆26Updated 3 years ago
- ☆23Updated 2 years ago
- A LLaMA1/LLaMA12 Megatron implement.☆27Updated 11 months ago
- NaturalCodeBench (Findings of ACL 2024)☆56Updated last month
- ☆78Updated 7 months ago
- ☆29Updated last year
- Reasoning by Communicating with Agents☆21Updated last month
- Performance benchmarking with ColossalAI☆39Updated 2 years ago
- An Experiment on Dynamic NTK Scaling RoPE☆61Updated 11 months ago
- [EMNLP 2023 Industry Track] A simple prompting approach that enables the LLMs to run inference in batches.☆69Updated 8 months ago
- [EMNLP'23] Execution-Based Evaluation for Open Domain Code Generation☆44Updated 10 months ago
- Large Scale Distributed Model Training strategy with Colossal AI and Lightning AI☆58Updated last year
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆74Updated 10 months ago
- ☆111Updated 8 months ago
- Simple and efficient pytorch-native transformer training and inference (batched)☆61Updated 7 months ago
- Implementation of NAACL 2024 Outstanding Paper "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"☆128Updated last month
- ☆103Updated last year
- Official repository for LightSeq: Sequence Level Parallelism for Distributed Training of Long Context Transformers☆195Updated 3 months ago
- ⏳ ChatLog: Recording and Analysing ChatGPT Across Time☆94Updated 5 months ago
- Techniques used to run BLOOM at inference in parallel☆37Updated 2 years ago
- Code and models for BERT on STILTs☆53Updated last year
- Experiments on speculative sampling with Llama models☆118Updated last year
- Easy control for Key-Value Constrained Generative LLM Inference(https://arxiv.org/abs/2402.06262)☆58Updated 9 months ago
- ☆55Updated 5 months ago
- Open Implementations of LLM Analyses☆94Updated last month