snowflakedb / ArcticTrainingLinks
ArcticTraining is a framework designed to simplify and accelerate the post-training process for large language models (LLMs)
☆261Updated this week
Alternatives and similar repositories for ArcticTraining
Users that are interested in ArcticTraining are comparing it to the libraries listed below
Sorting:
- ArcticInference: vLLM plugin for high-throughput, low-latency inference☆354Updated this week
- A unified library for building, evaluating, and storing speculative decoding algorithms for LLM inference in vLLM☆160Updated last week
- A scalable asynchronous reinforcement learning implementation with in-flight weight updates.☆336Updated this week
- Accelerating your LLM training to full speed! Made with ❤️ by ServiceNow Research☆272Updated this week
- ☆610Updated last week
- ☆225Updated last month
- Cold Compress is a hackable, lightweight, and open-source toolkit for creating and benchmarking cache compression methods built on top of…☆146Updated last year
- Pytorch Distributed native training library for LLMs/VLMs with OOTB Hugging Face support☆209Updated this week
- Load compute kernels from the Hub☆352Updated last week
- Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding", ACL 2024☆349Updated 7 months ago
- 🚀 Efficiently (pre)training foundation models with native PyTorch features, including FSDP for training and SDPA implementation of Flash…☆275Updated last month
- [NeurIPS 2025] Simple extension on vLLM to help you speed up reasoning model without training.☆215Updated 6 months ago
- Reproducible, flexible LLM evaluations☆305Updated last month
- Memory optimized Mixture of Experts☆72Updated 4 months ago
- Manage scalable open LLM inference endpoints in Slurm clusters☆278Updated last year
- An extension of the nanoGPT repository for training small MOE models.☆219Updated 9 months ago
- Efficient LLM Inference over Long Sequences☆394Updated 5 months ago
- BABILong is a benchmark for LLM evaluation using the needle-in-a-haystack approach.☆230Updated 3 months ago
- ☆219Updated 11 months ago
- HuggingFace conversion and training library for Megatron-based models☆295Updated this week
- LM engine is a library for pretraining/finetuning LLMs☆102Updated this week
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆221Updated last week
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…☆362Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆267Updated 2 weeks ago
- A family of compressed models obtained via pruning and knowledge distillation☆361Updated last month
- Storing long contexts in tiny caches with self-study☆226Updated 2 weeks ago
- Async pipelined version of Verl☆125Updated 8 months ago
- REST: Retrieval-Based Speculative Decoding, NAACL 2024☆212Updated 3 months ago
- A project to improve skills of large language models☆715Updated this week
- Experiments on speculative sampling with Llama models☆127Updated 2 years ago