Nero10578/LLM-Inference-Benchmark

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Nero10578/LLM-Inference-Benchmark)

Nero10578 / LLM-Inference-Benchmark

☆14

Alternatives and similar repositories for LLM-Inference-Benchmark

Users that are interested in LLM-Inference-Benchmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

HabermannR / Fantasy-Tribe-Game
View on GitHub
LLM backed Fantasy Tribe Game
☆19Nov 21, 2024Updated last year
severian42 / Proteus-The-Genesis-LLM
View on GitHub
Proteus is an experimental platform that combines the power of Large Language Models with the Genesis physics engine
☆25Dec 20, 2024Updated last year
Krisseck / Phrasing-Bot
View on GitHub
A bot that checks your grammar and phrasing using LLM of choice
☆35Feb 6, 2025Updated last year
fishiatee / Tumera
View on GitHub
Yet another frontend for LLM, written using .NET and WinUI 3
☆11Sep 14, 2025Updated 10 months ago
coder543 / llm-speed-benchmark
View on GitHub
A tool that can be used to measure the sequential performance of any OpenAI-compatible LLM API
☆25Aug 1, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
kkacsh321 / st-multi-gpu
View on GitHub
Basic Streamlit Application for testing, and displaying Multi-GPU LLM timings
☆10Mar 30, 2024Updated 2 years ago
De-Panther / webxr-input-profiles-loader
View on GitHub
WebXR Input Profiles Loader in Unity. Based on https://github.com/immersive-web/webxr-input-profiles
☆20May 10, 2026Updated 2 months ago
7etsuo / anvil
View on GitHub
Anvil — agent-native multi-genre game engine for AI coding agents
☆24Updated this week
calmstate / VisualTagger
View on GitHub
Visual Tagger is a JavaScript tool that visually highlights HTML elements for AIs, aiding in identifying interactive components on web pa…
☆12Oct 28, 2024Updated last year
sarkiisov / startpage-frontend
View on GitHub
☆10Aug 11, 2025Updated 11 months ago
transferwise / wise-topic
View on GitHub
LLM-only topic extraction and classification
☆11Jun 3, 2026Updated last month
HugoLePicard / tutorial-triton-inference-server
View on GitHub
☆16Jul 17, 2025Updated last year
PavAI-Research / pavai-c3po
View on GitHub
reimagine the implementation of C-3PO droid voice synthesizer and multilingual translation and communication capabilities with the latest…
☆12Mar 6, 2024Updated 2 years ago
kozistr / triton-grpc-proxy-rs
View on GitHub
Proxy server for triton gRPC server that inferences embedding model in Rust
☆21Aug 10, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
zextras / tech-doc
View on GitHub
Zextras Technical Documentation
☆13Jul 10, 2026Updated last week
iclr-blogposts / 2025
View on GitHub
ICLR Blog Track 2025
☆19Sep 21, 2025Updated 9 months ago
databricks-industry-solutions / csrd_assistant
View on GitHub
In this solution accelerator, we demonstrate how generative AI, retrieval augmented generation (RAG) and multi stage reasoning can be use…
☆14Nov 4, 2024Updated last year
keeeeenw / TinyLlama
View on GitHub
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
☆14Mar 30, 2024Updated 2 years ago
LambdaLabsML / shaderunner
View on GitHub
Ctrl + F but fancy.
☆16Sep 30, 2024Updated last year
solo5star / dashboard
View on GitHub
Dashboard system (Grafana+Prometheus+Node-Exporter+Cadvisor)
☆18Jul 25, 2021Updated 4 years ago
extrawest / ev_stations_map_showcase
View on GitHub
This project is an app that shows a map with Electric Charging Stations and their information. The app supports station markers clusterin…
☆12Jan 15, 2024Updated 2 years ago
leeex1 / Quillan-Ronin
View on GitHub
Quillan-Ronin - an attempt at a mini Software 3.0 runtime on Universal BitNet 1.58-bit logic and a 9B EGGROLL Swarm. v6.0.3 Quantum featu…
☆26Updated this week
LambdaLabsML / unitree-retarget
View on GitHub
☆14Mar 17, 2025Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
BUTSpeechFIT / SOT-DiCoW
View on GitHub
Multi-talker ASR based on DiCoW with Serialized Output Training
☆20Sep 18, 2025Updated 10 months ago
tijo95 / piper_tts
View on GitHub
Synthèses vocale piper oobabooga
☆14Feb 24, 2024Updated 2 years ago
withlang-dev / with
View on GitHub
☆29Updated this week
Neuroengine-vulns / neuroengine
View on GitHub
Neuroengine is a service to share LLMs in the form of a webchat and API.
☆45Oct 21, 2024Updated last year
supersjimmie / wifi_ducky_keylogger
View on GitHub
Keystroke injection and Keylogger with an ESP8266 + Arduino Pro Micro + USB Host Shield
☆17Nov 7, 2017Updated 8 years ago
BoredBrownBear / text-generation-webui-model_ducking
View on GitHub
An extension for oobabooga/text-generation-webui that automatically unloads and reloads your model.
☆17Apr 22, 2024Updated 2 years ago
Zetaphor / Dead-Internet
View on GitHub
Y'all thought the dead internet theory wasn't real, but HERE IT IS
☆18Apr 27, 2024Updated 2 years ago
BarthPaleologue / WebTide
View on GitHub
WebTide is an ocean simulation based on Jerry Tessendorf's paper, implemented on WebGPU with BabylonJS.
☆35Jan 3, 2025Updated last year
krea-ai / CogVideo-lambda
View on GitHub
Text-to-video generation.
☆20Jul 18, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
gallen881 / Physics_Master
View on GitHub
Physics Master is a model fine-tuned from llama3-8B-Instruct. It can answer your physics question!
☆16Aug 24, 2024Updated last year
TengHu / AutoCoder
View on GitHub
☆11Jan 28, 2024Updated 2 years ago
modular / workshops
View on GitHub
Modular workshop contents
☆16Updated this week
confident-ai / blog-examples
View on GitHub
☆15Oct 22, 2023Updated 2 years ago
danielscottjames / dominion
View on GitHub
Benchmarking LLMs as Casual Card Game AIs
☆20Jan 22, 2025Updated last year
Apsu / flue
View on GitHub
Fast, Lightweight, Unified Engine for Text2Image Diffusion Models
☆20Apr 13, 2025Updated last year
matatonic / openedai-images
View on GitHub
An OpenAI API compatible images server to generate or manipulate images.
☆18Feb 2, 2025Updated last year