Official PyTorch implementation for Hogwild! Inference: Parallel LLM Generation with a Concurrent Attention Cache
☆140Aug 13, 2025Updated 6 months ago
Alternatives and similar repositories for hogwild_llm
Users that are interested in hogwild_llm are comparing it to the libraries listed below
Sorting:
- ☆19Nov 5, 2025Updated 4 months ago
- Esoteric Language Models☆111Feb 8, 2026Updated last month
- ☆66Nov 4, 2024Updated last year
- Code and data release of the paper Enhancing LLM Complex Problem-Solving with Hybrid Thinking and Dynamic Workflows☆14Oct 4, 2024Updated last year
- Agentic Research and Evaluation Suite☆77Feb 26, 2026Updated last week
- This code accompanies the paper "Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration."☆37Jul 11, 2025Updated 7 months ago
- Enable R to source data from an OLAP cube via XMLA by specifying an MDX query.☆25Aug 6, 2015Updated 10 years ago
- Collaborative inference of latent diffusion via hivemind☆12May 29, 2023Updated 2 years ago
- ☆14Dec 28, 2025Updated 2 months ago
- Lottery Ticket Adaptation☆40Nov 20, 2024Updated last year
- [NeurIPS'2024] Invertible Consistency Distillation for Text-Guided Image Editing in Around 7 Steps☆101Jul 4, 2024Updated last year
- Persys desktop. Electron based application to access your Persys server.☆16May 16, 2025Updated 9 months ago
- An extention to the GaLore paper, to perform Natural Gradient Descent in low rank subspace☆18Oct 21, 2024Updated last year
- A pure and fast NumPy implementation of Mamba with cache support.☆18Jun 16, 2024Updated last year
- Memory-efficient transformer. Work in progress.☆19Sep 17, 2022Updated 3 years ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks (EMNLP'24)☆145Sep 20, 2024Updated last year
- Optimizing Anytime Reasoning via Budget Relative Policy Optimization☆52Jul 15, 2025Updated 7 months ago
- Multiplex Thinking: Reasoning via Token-wise Branch-and-Merge☆109Jan 30, 2026Updated last month
- ☆23Sep 29, 2024Updated last year
- Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"☆92Oct 30, 2024Updated last year
- Unofficial Implementation of Selective Attention Transformer☆21Oct 31, 2024Updated last year
- ☆34Jun 5, 2025Updated 9 months ago
- ☆32Oct 13, 2025Updated 4 months ago
- ☆23Jan 5, 2025Updated last year
- Visualize any repo or codebase into diagram or animation☆20Oct 14, 2024Updated last year
- (CVPR 2025) Switti: Designing Scale-Wise Transformers for Text-to-Image Synthesis☆201Jul 13, 2025Updated 7 months ago
- ☆19Mar 3, 2025Updated last year
- LLM that can analyze stocks☆24Nov 14, 2024Updated last year
- ☆32Aug 11, 2025Updated 6 months ago
- ☆36Oct 9, 2025Updated 5 months ago
- SIFT: Grounding LLM Reasoning in Contexts via Stickers☆57Mar 6, 2025Updated last year
- [ICLR 2026] Tina: Tiny Reasoning Models via LoRA☆323Sep 23, 2025Updated 5 months ago
- RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best…☆59Mar 17, 2025Updated 11 months ago
- ☆231Feb 24, 2025Updated last year
- QAlign is a new test-time alignment approach that improves language model performance by using Markov chain Monte Carlo methods.☆26Mar 2, 2026Updated last week
- Locally hosted AI Agent Python Tool To Generate Novel Research Hypothesis + Titles + Abstracts☆30Apr 30, 2025Updated 10 months ago
- ☆28Nov 10, 2025Updated 3 months ago
- j1-micro (1.7B) & j1-nano (600M) are absurdly tiny but mighty reward models.☆102Jul 19, 2025Updated 7 months ago
- Google Gemini AI model w/speech recognition and voice.☆26Nov 26, 2025Updated 3 months ago