eqimp/hogwild_llm

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/eqimp/hogwild_llm)

eqimp / hogwild_llm

Official PyTorch implementation for Hogwild! Inference: Parallel LLM Generation with a Concurrent Attention Cache

☆142

Alternatives and similar repositories for hogwild_llm

Users that are interested in hogwild_llm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yandex-research / AsyncReasoning
View on GitHub
☆24Jun 25, 2026Updated 3 weeks ago
goodevening13 / aquakv
View on GitHub
☆21Apr 27, 2026Updated 2 months ago
Multiverse4FM / Multiverse
View on GitHub
☆88Jun 16, 2025Updated last year
yandex-research / invertible-cd
View on GitHub
[NeurIPS'2024] Invertible Consistency Distillation for Text-Guided Image Editing in Around 7 Steps
☆101Jul 4, 2024Updated 2 years ago
garipovroma / autojudge
View on GitHub
[NeurIPS 2025] Official PyTorch implementation for the paper AutoJudge: Judge Decoding Without Manual Annotation
☆21Dec 22, 2025Updated 6 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
yandex-research / specexec
View on GitHub
☆68Nov 4, 2024Updated last year
Parallel-Reasoning / APR
View on GitHub
[COLM 2025] Code for Paper: Learning Adaptive Parallel Reasoning with Language Models
☆144Dec 17, 2025Updated 7 months ago
yandex-research / On-Efficient-Scaling-Of-GNNs
View on GitHub
☆63Jun 10, 2026Updated last month
mi150 / VaLoRA
View on GitHub
☆11May 19, 2025Updated last year
stanis-morozov / prodige
View on GitHub
A supplementary code for Beyond Vector Spaces: Compact Data Representation as Differentiable Weighted Graphs.
☆47Nov 2, 2019Updated 6 years ago
tanmoyio / sahajBERT
View on GitHub
☆14Dec 28, 2021Updated 4 years ago
dbaranchuk / VisualGenAI
View on GitHub
Materials for the VisualGenAI course at YSDA2026
☆53Updated this week
Linzwcs / AFT
View on GitHub
☆13Jan 22, 2025Updated last year
learning-at-home / collaborative-latent-diffusion
View on GitHub
Collaborative inference of latent diffusion via hivemind
☆12May 29, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Tomorrowdawn / top_nsigma
View on GitHub
The official code repo and data hub of top_nsigma sampling strategy for LLMs.
☆26Feb 11, 2025Updated last year
yandex-research / btard
View on GitHub
Code for the paper "Secure Distributed Training at Scale" (ICML 2022)
☆16Feb 4, 2025Updated last year
LINs-lab / ELICIT
View on GitHub
[ICLR 2025] ELICIT: LLM Augmentation Via External In-context Capability
☆14Mar 11, 2025Updated last year
Zcchill / Value-Residual-Learning
View on GitHub
☆15Mar 20, 2025Updated last year
sail-sg / AnytimeReasoner
View on GitHub
Optimizing Anytime Reasoning via Budget Relative Policy Optimization
☆54Jul 15, 2025Updated last year
chr26195 / PENCIL
View on GitHub
This is the official implementation for paper "PENCIL: Long Thoughts with Short Memory".
☆81May 9, 2025Updated last year
Infini-AI-Lab / Multiverse
View on GitHub
☆119Sep 13, 2025Updated 10 months ago
MasterVito / SwS
View on GitHub
Official Repo for SwS: A Weakness-driven Problem Synthesis Framework in RL for LLM Reasoning
☆41Nov 11, 2025Updated 8 months ago
janhq / ReZero
View on GitHub
☆160Apr 17, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
sukanto-m / directory-monitor
View on GitHub
☆16Oct 28, 2025Updated 8 months ago
luka-group / FaviComp
View on GitHub
[EMNLP 2025 Findings] Familiarity-aware Evidence Compression for Retrieval Augmented Generation
☆15Aug 20, 2025Updated 11 months ago
wuhy68 / Parameter-Efficient-MoE
View on GitHub
Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks (EMNLP'24)
☆145Sep 20, 2024Updated last year
amazon-science / PrefEval
View on GitHub
☆38May 30, 2025Updated last year
tilde-research / momoe-release
View on GitHub
Memory optimized Mixture of Experts
☆80Jul 25, 2025Updated 11 months ago
LINs-lab / LIE
View on GitHub
[preprint] Think Longer to Explore Deeper: Learn to Explore In-Context via Length-Incentivized Reinforcement Learning
☆19Feb 18, 2026Updated 5 months ago
fangyuan-ksgk / repo-viewer
View on GitHub
Visualize any repo or codebase into diagram or animation
☆24Oct 14, 2024Updated last year
tegridydev / abstract-agent
View on GitHub
Locally hosted AI Agent Python Tool To Generate Novel Research Hypothesis + Titles + Abstracts
☆30Apr 30, 2025Updated last year
AlexGoldie / discogen
View on GitHub
Official implementation of DiscoGen, for "Procedural Generation of Algorithm Discovery Tasks in Machine Learning"
☆47Jul 2, 2026Updated 2 weeks ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
fangyuan-ksgk / selective-attention-transformer
View on GitHub
Unofficial Implementation of Selective Attention Transformer
☆20Oct 31, 2024Updated last year
google-deepmind / simulation_streams
View on GitHub
Simulation Streams is a programming paradigm designed to efficiently control and leverage Large Language Models (LLMs) for complex, dynam…
☆27Jul 2, 2026Updated 2 weeks ago
letta-ai / sleep-time-compute
View on GitHub
accompanying material for sleep-time compute paper
☆134Apr 30, 2025Updated last year
SalesforceAIResearch / swecomm
View on GitHub
☆28Jun 2, 2026Updated last month
luka-group / MrCoD
View on GitHub
Multi-hop Evidence Retrieval for Cross-document Relation Extraction
☆12Sep 1, 2023Updated 2 years ago
NimbleEdge / sparse_transformers
View on GitHub
Sparse Inferencing for transformer based LLMs
☆219Mar 25, 2026Updated 3 months ago
rail-berkeley / SUPE
View on GitHub
This code accompanies the paper "Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration."
☆39Jul 11, 2025Updated last year