godaai/llm-inference

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/godaai/llm-inference)

godaai / llm-inference

Resources for Large Language Model Inference

☆17

Alternatives and similar repositories for llm-inference

Users that are interested in llm-inference are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ntrdma / ntrdma
View on GitHub
Linux tree for ntrdma driver development.
☆11Jun 29, 2017Updated 9 years ago
thirdgerb / ghost-in-shells
View on GitHub
WIP: project for engineering automatic bot (chatbot mainly)
☆13Sep 3, 2023Updated 2 years ago
tunib-ai / joker
View on GitHub
AI model designed to test the effectiveness in handling external ethical attacks.
☆11Feb 9, 2026Updated 5 months ago
alexeigor / sd-benchmarks
View on GitHub
Stable Diffusion inference benchmarks
☆10Jun 14, 2024Updated 2 years ago
yybit / pllm
View on GitHub
Portable LLM - A rust library for LLM inference
☆12Apr 13, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
JiaYaobo / stamox
View on GitHub
make your statistical research faster
☆12Jul 7, 2023Updated 3 years ago
JiaYaobo / fenbux
View on GitHub
A Simple Statistical Distribution Library in JAX
☆16Mar 30, 2024Updated 2 years ago
tommie / go-coopsched
View on GitHub
A benchmark and playground for Completely Fair Scheduling in Go
☆11Feb 12, 2022Updated 4 years ago
systems-nuts / dynamic_to_static
View on GitHub
Convert a dynamically linked binary to a statically linked binary going thorugh LLVM IR, using mcsema
☆12May 27, 2019Updated 7 years ago
EdgeSimPy / edgesimpy-tutorials
View on GitHub
Hands-on tutorials to help you understand how EdgeSimPy works and how to use it in your research
☆25Aug 28, 2025Updated 11 months ago
lightsocks / lightsocks-go
View on GitHub
lightsocks client implements by golang
☆13Sep 11, 2015Updated 10 years ago
pierreHmbt / FedCP-QQ
View on GitHub
Federated Conformal Prediction with Quantile-of-Quantiles (FedCP-QQ)
☆11May 6, 2026Updated 2 months ago
deep-spin / non-exchangeable-crc
View on GitHub
☆11Sep 25, 2025Updated 10 months ago
Embucket / embucket
View on GitHub
BYOC option for your dbt Snowflake workloads
☆24Apr 24, 2026Updated 3 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
haormj / llama2.go
View on GitHub
Inference Llama 2 in one file of pure go
☆15Jul 25, 2023Updated 3 years ago
bluca / valgrind-dpdk
View on GitHub
Valgrind patched with support for DPDK (and rte_*alloc). STATICALLY LINKED: use --soname-synonyms=somalloc=NONE - DINAMICALLY LINKED: SON…
☆27Mar 10, 2022Updated 4 years ago
qingstor / neonio
View on GitHub
☆25Jul 13, 2021Updated 5 years ago
camenduru / notebooks
View on GitHub
☆23Dec 18, 2023Updated 2 years ago
tchinso / MekiCopy
View on GitHub
☆17Jul 10, 2026Updated 2 weeks ago
AUGMXNT / speed-benchmarking
View on GitHub
☆19Oct 18, 2025Updated 9 months ago
clu5 / federated-conformal
View on GitHub
Conformal Prediction + Federated Learning
☆16Mar 16, 2024Updated 2 years ago
bhagya-hettige / GLACE
View on GitHub
Gaussian Embedding of Large-scale Attributed Graphs
☆10Mar 13, 2020Updated 6 years ago
swaggy-TN / EfficientVLM
View on GitHub
EfficientVLM: Fast and Accurate Vision-Language Models via Knowledge Distillation and Modal-adaptive Pruning (ACL 2023)
☆33Jul 18, 2023Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
luweizheng / finax
View on GitHub
High Performance Quantative Finance on JAX
☆13Jun 28, 2022Updated 4 years ago
CongWeilin / DGCN
View on GitHub
☆10Aug 13, 2021Updated 4 years ago
handhand / lightsocks-androidx
View on GitHub
你也可以做个Shadowsocks(Android篇)
☆14Jan 30, 2022Updated 4 years ago
princeton-pli / LongProc
View on GitHub
LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation
☆36Feb 26, 2026Updated 5 months ago
wille1101 / sttg
View on GitHub
TUI-klient för SVTs text-tv skriven i Go.
☆23May 28, 2021Updated 5 years ago
adamydwang / mobilellama
View on GitHub
a lightweight C++ LLaMA inference engine for mobile devices
☆15Oct 28, 2023Updated 2 years ago
replicate / cog-flan-models
View on GitHub
Code for training & inference with FLAN family of models
☆17May 23, 2023Updated 3 years ago
sttich / dl-recommendation
View on GitHub
code for "Deep Learning for Sequential Recommendation: Algorithms, Influential Factors, and Evaluations"
☆12Sep 7, 2020Updated 5 years ago
suzgunmirac / marnns
View on GitHub
MARNNs Can Learn Generalized Dyck Languages
☆12Nov 11, 2019Updated 6 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
Oneflow-Inc / oneflow-lite
View on GitHub
☆17Jan 1, 2024Updated 2 years ago
wonglkd / Baleen-FAST24
View on GitHub
Code for "Baleen: ML Admission & Prefetching for Flash Caches" (FAST 2024).
☆27Feb 29, 2024Updated 2 years ago
cns-iu / cjobs
View on GitHub
CJOBS
☆16Dec 22, 2018Updated 7 years ago
a3794110 / ns3-SUMO-Interface
View on GitHub
Network Simulation for Urban Mobility: Interface between ns-3 and SUMO. The project allow user to control ns-3 LTE module and the UE mobi…
☆44Sep 8, 2019Updated 6 years ago
rakyll / go-numa
View on GitHub
NUMA bindings for Go, requires libnuma.
☆26Nov 18, 2019Updated 6 years ago
TheEastKoi / celest
View on GitHub
DeepSeek TUI VS Code extension
☆16Jun 17, 2026Updated last month
nuxlear / keras-audio
View on GitHub
Example codes for Audio Processing with Deep Learning & Keras || Presentation ->
☆18Jul 16, 2019Updated 7 years ago