eqimp / hogwild_llmLinks
Official PyTorch implementation for Hogwild! Inference: Parallel LLM Generation with a Concurrent Attention Cache
☆105Updated last month
Alternatives and similar repositories for hogwild_llm
Users that are interested in hogwild_llm are comparing it to the libraries listed below
Sorting:
- Tina: Tiny Reasoning Models via LoRA☆245Updated last week
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆172Updated 4 months ago
- EvaByte: Efficient Byte-level Language Models at Scale☆98Updated last month
- Scaling Data for SWE-agents☆220Updated this week
- PyTorch building blocks for the OLMo ecosystem☆222Updated this week
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆173Updated 2 months ago
- A scalable asynchronous reinforcement learning implementation with in-flight weight updates.☆119Updated this week
- Simple extension on vLLM to help you speed up reasoning model without training.☆152Updated this week
- PyTorch implementation of models from the Zamba2 series.☆182Updated 4 months ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆67Updated 2 months ago
- Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆90Updated 2 months ago
- ☆190Updated 3 months ago
- ☆50Updated 11 months ago
- accompanying material for sleep-time compute paper☆90Updated last month
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…☆333Updated 5 months ago
- Training an LLM to use a calculator with multi-turn reinforcement learning, achieving a **62% absolute increase in evaluation accuracy**.☆38Updated last month
- ☆126Updated 2 months ago
- ☆89Updated last week
- Repo for "LoLCATs: On Low-Rank Linearizing of Large Language Models"☆237Updated 4 months ago
- Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding", ACL 2024☆301Updated last month
- code for training & evaluating Contextual Document Embedding models☆191Updated 3 weeks ago
- Train your own SOTA deductive reasoning model☆92Updated 2 months ago
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆207Updated last month
- Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.☆199Updated 10 months ago
- [ICML 2025] Reward-guided Speculative Decoding (RSD) for efficiency and effectiveness.☆31Updated last month
- Complex Function Calling Benchmark.☆109Updated 4 months ago
- 🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.☆367Updated last week
- minimal GRPO implementation from scratch☆90Updated 2 months ago
- A simplified implementation for experimenting with RLVR on GSM8K, This repository provides a starting point for exploring reasoning.☆95Updated 3 months ago
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆201Updated 3 weeks ago