ServiceNow / Fast-LLM
Accelerating your LLM training to full speed! Made with ❤️ by ServiceNow Research
☆149Updated this week
Alternatives and similar repositories for Fast-LLM:
Users that are interested in Fast-LLM are comparing it to the libraries listed below
- Manage scalable open LLM inference endpoints in Slurm clusters☆253Updated 8 months ago
- The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System☆107Updated 9 months ago
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆207Updated 4 months ago
- Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs☆224Updated this week
- ☆113Updated 6 months ago
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆165Updated 2 weeks ago
- ☆73Updated 2 months ago
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆196Updated this week
- Mixing Language Models with Self-Verification and Meta-Verification☆102Updated 3 months ago
- experiments with inference on llama☆104Updated 9 months ago
- Codebase accompanying the Summary of a Haystack paper.☆75Updated 6 months ago
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆168Updated 2 months ago
- The official repo for "LLoCo: Learning Long Contexts Offline"☆116Updated 9 months ago
- XTR/WARP is an extremely fast and accurate retrieval engine based on Stanford's ColBERTv2/PLAID and Google DeepMind's XTR.☆120Updated 5 months ago
- ☆160Updated 7 months ago
- Complex Function Calling Benchmark.☆85Updated 2 months ago
- EvaByte: Efficient Byte-level Language Models at Scale☆85Updated this week
- Tutorial for building LLM router☆187Updated 8 months ago
- Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding", ACL 2024☆277Updated last month
- A pipeline for LLM knowledge distillation☆98Updated last month
- ☆106Updated this week
- Train your own SOTA deductive reasoning model☆81Updated 2 weeks ago
- A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).☆209Updated this week
- 🔧 Compare how Agent systems perform on several benchmarks. 📊🚀☆90Updated 5 months ago
- Just a bunch of benchmark logs for different LLMs☆119Updated 7 months ago
- Solving data for LLMs - Create quality synthetic datasets!☆145Updated 2 months ago
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆83Updated this week
- ☆114Updated 6 months ago
- Routing on Random Forest (RoRF)☆135Updated 6 months ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆104Updated 3 months ago