Michaelvll/llm-ie-benchmarks

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Michaelvll/llm-ie-benchmarks)

Michaelvll / llm-ie-benchmarks

A collection of reproducible inference engine benchmarks

☆38

Alternatives and similar repositories for llm-ie-benchmarks

Users that are interested in llm-ie-benchmarks are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

simon-mo / vLLM-Benchmark
View on GitHub
☆33Apr 19, 2025Updated last year
skypilot-org / skypilot-catalog
View on GitHub
☆37Updated this week
keeeeenw / TinyLlama
View on GitHub
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
☆14Mar 30, 2024Updated 2 years ago
Etamin / TSED
View on GitHub
TSED with Flexible Parser
☆21Jan 22, 2026Updated 6 months ago
skypilot-org / skypilot-tutorial
View on GitHub
Tutorial to get started with SkyPilot!
☆60May 15, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
ucbrise / hypersched
View on GitHub
Deadline-based hyperparameter tuning on RayTune.
☆32Jan 16, 2020Updated 6 years ago
jiazhihao / sosp19ae
View on GitHub
Artifacts for SOSP'19 paper Optimizing Deep Learning Computation with Automatic Generation of Graph Substitutions
☆21Apr 15, 2022Updated 4 years ago
MeetElise / surprise-similarity
View on GitHub
A context-aware embedding similarity score
☆11Aug 23, 2023Updated 2 years ago
TJUSSE / sseweb
View on GitHub
The main repository of sse.tongji.edu.cn
☆16Oct 28, 2015Updated 10 years ago
HPAI-BSC / prompt_engine
View on GitHub
Evaluate your model using advanced prompt strategies
☆21Jan 30, 2026Updated 5 months ago
AmanPriyanshu / GPT-OSS-MoE-ExpertFingerprinting
View on GitHub
ExpertFingerprinting: Behavioral Pattern Analysis and Specialization Mapping of Experts in GPT-OSS-20B's Mixture-of-Experts Architecture
☆27Feb 3, 2026Updated 5 months ago
skypilot-org / sky-llama
View on GitHub
☆28May 2, 2023Updated 3 years ago
ray-project / ray-legacy
View on GitHub
An experimental distributed execution engine
☆23Jul 23, 2020Updated 6 years ago
mcoavoux / pnet
View on GitHub
☆12Apr 18, 2019Updated 7 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
richarddwang / hugdatafast
View on GitHub
The elegant integration of huggingface/nlp and fastai2 and handy transforms using pure huggingface/nlp
☆19Oct 6, 2020Updated 5 years ago
gtfintechlab / HYPHEN-ACL
View on GitHub
Codebase for HYPHEN, accepted at ACL 2022 (main)
☆12May 17, 2022Updated 4 years ago
StigLidu / TURN
View on GitHub
[ICML2025] Official Repo for Paper "Optimizing Temperature for Language Models with Multi-Sample Inference"
☆23Feb 16, 2025Updated last year
Nana-Lv / TF_SPLERGE
View on GitHub
☆14Jan 11, 2022Updated 4 years ago
young-geng / koala_data_pipeline
View on GitHub
The data processing pipeline for the Koala chatbot language model
☆118Apr 6, 2023Updated 3 years ago
dburihabwa / sgx-fs
View on GitHub
Experimental encrypted file system using SGX and FUSE
☆12Oct 9, 2018Updated 7 years ago
abcdabcd987 / LLIRInterpreter
View on GitHub
Single file interpreter (or naive virtual machine) for my intermediate representation. SSA support has been added.
☆15Apr 27, 2016Updated 10 years ago
YisongMiao / CS5228-project
View on GitHub
Winning 2nd place🥈at NUS CS5228 in-class Kaggle competition 2018!
☆13Nov 13, 2018Updated 7 years ago
SebiSebi / AI2-Reasoning-Challenge-ARC
View on GitHub
Source code for the AI2 Reasoning Challenge (ARC) submission.
☆16Dec 8, 2022Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
cw75 / torchMojiBot
View on GitHub
Building a Deep Learning Powered Emoji Slackbot!
☆15Jul 23, 2020Updated 6 years ago
Engineev / compiler-offline-judge
View on GitHub
The offline version of acm-compiler-judge
☆13May 16, 2019Updated 7 years ago
lapp0 / lm-inference-engines
View on GitHub
Comparison of Language Model Inference Engines
☆240Dec 16, 2024Updated last year
romilbhardwaj / kube-tutorial
View on GitHub
Kubernetes Tutorial for the PS2 group meetings at UC Berkeley
☆18Mar 23, 2023Updated 3 years ago
coder543 / llm-speed-benchmark
View on GitHub
A tool that can be used to measure the sequential performance of any OpenAI-compatible LLM API
☆25Aug 1, 2024Updated last year
sradc / patchless_mlp_mixer
View on GitHub
A patchless architecture, based on MLP-Mixer
☆18Dec 30, 2021Updated 4 years ago
drbh / yamoe
View on GitHub
🔀 yet another mixture of experts
☆23Jun 5, 2026Updated last month
foundation-model-stack / vllm-triton-backend
View on GitHub
A Triton-only attention backend for vLLM
☆27Jul 14, 2026Updated 2 weeks ago
ibm-aur-nlp / EDD
View on GitHub
☆19Jun 11, 2024Updated 2 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
dbindel / sjtu-summer2018
View on GitHub
Course material for "Numerical Methods for Data Science" (SJTU, summer 2018)
☆40Jul 6, 2018Updated 8 years ago
grycap / ansible-role-hadoop
View on GitHub
Ansible Role to install a Hadoop Cluster
☆16Apr 1, 2026Updated 3 months ago
simveit / effective_transpose
View on GitHub
Effective transpose on Hopper GPU
☆29Sep 6, 2025Updated 10 months ago
levipereira / yolo_e2e
View on GitHub
Implementation of End-to-End YOLO Models
☆10Dec 30, 2025Updated 6 months ago
KULeuven-MICAS / snax-mlir
View on GitHub
Driving Snax with MLIR
☆23Apr 22, 2026Updated 3 months ago
RLsys-Foundation / TritonForge
View on GitHub
🔥 LLM-powered GPU kernel synthesis: Train models to convert PyTorch ops into optimized Triton kernels via SFT+RL. Multi-turn compilation…
☆146Nov 10, 2025Updated 8 months ago
Oefenweb / ansible-pycharm
View on GitHub
Ansible role to set up PyCharm
☆13Jun 2, 2025Updated last year