☆21Oct 1, 2024Updated last year
Alternatives and similar repositories for LLM-inference-speed-benchmarks
Users that are interested in LLM-inference-speed-benchmarks are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Attend - to what matters.☆17Feb 22, 2025Updated last year
- ☆16Jul 18, 2023Updated 2 years ago
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- LLM inference in C/C++☆13Apr 21, 2024Updated last year
- Note about running ollama 🦙☆36May 2, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A free and open-source GUI tool that simplifies combining multiple code files into one, with automatic labeling and support for various p…☆14Jan 3, 2025Updated last year
- Comparison of Language Model Inference Engines☆242Dec 16, 2024Updated last year
- Explore semantic caching to reduce your OpenAI/LLM API bill☆11Jul 21, 2023Updated 2 years ago
- SYN flood implementation using Boost.Asio☆12Nov 20, 2014Updated 11 years ago
- openai-proxy-vercel☆12Aug 11, 2023Updated 2 years ago
- Work with your business data using natural language☆19Nov 20, 2024Updated last year
- ☆14Sep 24, 2024Updated last year
- Code repository for the paper on "Predicting the Performance of Black-Box LLMs through Self-Queries".☆12Jan 9, 2025Updated last year
- Efficient Finetuning for OpenAI GPT-OSS☆23Oct 2, 2025Updated 6 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Unified System Interface, framework, server, GUI and Remote API☆15Apr 9, 2026Updated last week
- Simple implementation of an AABB Tree (Axis Aligned Bounding Box Tree) to optimize 3d collision detection☆10Oct 22, 2024Updated last year
- cursor logs with gpt-4o using litellm proxy☆14Sep 9, 2025Updated 7 months ago
- A curated collection of OpenClaw resources: GCP installation guide, best practices, and use cases☆19Feb 19, 2026Updated last month
- Load and run Llama from safetensors files in C☆15Oct 24, 2024Updated last year
- a character-ai like UI for LLM☆10Dec 3, 2024Updated last year
- ☆22Mar 25, 2025Updated last year
- ☆14Nov 23, 2018Updated 7 years ago
- self implementation of DPPO, Distributed Proximal Policy Optimization, by using tensorflow☆12Sep 1, 2017Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- SpyGame: An interactive multi-agent framework to evaluate intelligence with large language models :D☆15Nov 9, 2023Updated 2 years ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31May 22, 2024Updated last year
- A Tiny, Pure Python implementation of Gradient Boosted Trees.☆14Dec 28, 2022Updated 3 years ago
- ☆14Aug 22, 2024Updated last year
- Project on how to integrate django with data science libraries (i.e. pandas, matplotlib, numpy)☆14Jul 6, 2023Updated 2 years ago
- ☆14Mar 18, 2025Updated last year
- Logseq Plugin to streamline Youtube note taking☆12Apr 9, 2022Updated 4 years ago
- Inverse kinematic solver (FABRIK) for a simple 3D chain☆12Apr 23, 2021Updated 4 years ago
- ☆11May 20, 2022Updated 3 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Implementation of SoundtStream from the paper: "SoundStream: An End-to-End Neural Audio Codec"☆13Jan 27, 2025Updated last year
- KMM: Key Frame Mask Mamba for Extended Motion Generation☆19Sep 22, 2025Updated 6 months ago
- Script that converts 7.1 surround sound files to virtual surround stereo using HeSuVi. For Dolby Atmos, check out: https://github.com/Thr…☆17Dec 26, 2020Updated 5 years ago
- Build your own offline AI from any documents. Free. No coding. LoRA fine-tuning + RAG + GGUF export.☆89Mar 21, 2026Updated 3 weeks ago
- A simple Python + Tkinter + Tesseract-based GUI image-to-text copypaste pad application☆10Sep 14, 2023Updated 2 years ago
- ☆11Mar 5, 2024Updated 2 years ago
- Sources for a Medium article☆11Dec 2, 2021Updated 4 years ago