ServiceNow / Fast-LLM
Accelerating your LLM training to full speed! Made with ❤️ by ServiceNow Research
☆178Updated this week
Alternatives and similar repositories for Fast-LLM:
Users that are interested in Fast-LLM are comparing it to the libraries listed below
- Manage scalable open LLM inference endpoints in Slurm clusters☆254Updated 9 months ago
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆196Updated last week
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆211Updated 5 months ago
- ☆113Updated 2 weeks ago
- ☆151Updated 4 months ago
- Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"☆299Updated last year
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆170Updated 3 months ago
- Set of scripts to finetune LLMs☆37Updated last year
- Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding", ACL 2024☆284Updated this week
- Train your own SOTA deductive reasoning model☆86Updated last month
- PyTorch building blocks for the OLMo ecosystem☆197Updated this week
- Let's build better datasets, together!☆259Updated 4 months ago
- A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).☆217Updated 2 weeks ago
- experiments with inference on llama☆104Updated 10 months ago
- Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs☆266Updated this week
- Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.☆197Updated 9 months ago
- A pipeline for LLM knowledge distillation☆100Updated 2 weeks ago
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆139Updated 2 months ago
- A family of compressed models obtained via pruning and knowledge distillation☆334Updated 5 months ago
- Complex Function Calling Benchmark.☆96Updated 3 months ago
- The official repo for "LLoCo: Learning Long Contexts Offline"☆116Updated 10 months ago
- code for training & evaluating Contextual Document Embedding models☆180Updated this week
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆168Updated last month
- Codebase accompanying the Summary of a Haystack paper.☆77Updated 7 months ago
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆107Updated last month
- A Lightweight Library for AI Observability☆241Updated 2 months ago
- ☆122Updated last month
- awesome synthetic (text) datasets☆272Updated 5 months ago
- Experiments on speculative sampling with Llama models☆125Updated last year
- minimal GRPO implementation from scratch☆72Updated last month