eqimp / hogwild_llm

Official PyTorch implementation for Hogwild! Inference: Parallel LLM Generation with a Concurrent Attention Cache
89Updated this week

Alternatives and similar repositories for hogwild_llm:

Users that are interested in hogwild_llm are comparing it to the libraries listed below