☆21Oct 1, 2024Updated last year
Alternatives and similar repositories for LLM-inference-speed-benchmarks
Users that are interested in LLM-inference-speed-benchmarks are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Attend - to what matters.☆17Feb 22, 2025Updated last year
- Steam Cloud Save Manager☆11Sep 26, 2025Updated 6 months ago
- Note about running ollama 🦙☆36May 2, 2024Updated last year
- A free and open-source GUI tool that simplifies combining multiple code files into one, with automatic labeling and support for various p…☆14Jan 3, 2025Updated last year
- QLoRA: Efficient Finetuning of Quantized LLMs☆11Jul 22, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Precision Knowledge Editing (PKE): A novel method to reduce toxicity in LLMs while preserving performance, with robust evaluations and ha…☆11Nov 26, 2024Updated last year
- An RPG Maker MZ plugin☆12Nov 2, 2023Updated 2 years ago
- Pytorch implementation of NASA: NEURAL ARTICULATED SHAPE APPROXIMATION☆12May 4, 2021Updated 4 years ago
- Explore semantic caching to reduce your OpenAI/LLM API bill☆11Jul 21, 2023Updated 2 years ago
- A Lethal Company (EasySave3) Data Editor. It should work with any game that utilizes EasySave, however the project is setup for Lethal Co…☆10Jan 15, 2024Updated 2 years ago
- ☆14Sep 24, 2024Updated last year
- Unified System Interface, framework, server, GUI and Remote API☆15Mar 19, 2026Updated last week
- Simple implementation of an AABB Tree (Axis Aligned Bounding Box Tree) to optimize 3d collision detection☆10Oct 22, 2024Updated last year
- cursor logs with gpt-4o using litellm proxy☆14Sep 9, 2025Updated 6 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A curated collection of OpenClaw resources: GCP installation guide, best practices, and use cases☆20Feb 19, 2026Updated last month
- Load and run Llama from safetensors files in C☆15Oct 24, 2024Updated last year
- A simple implementation of anti-spam bot for itmo opensource chat☆11Sep 29, 2025Updated 5 months ago
- This project explores my adventures doing a deep dive of OpenAI embeddings with Neo4j during the Fixie AI + LLM Hackathon on Saturday, Se…☆15Sep 19, 2023Updated 2 years ago
- a character-ai like UI for LLM☆10Dec 3, 2024Updated last year
- ☆22Mar 25, 2025Updated last year
- ☆14Nov 23, 2018Updated 7 years ago
- telegram-chat-summariser☆10Nov 24, 2024Updated last year
- self implementation of DPPO, Distributed Proximal Policy Optimization, by using tensorflow☆12Sep 1, 2017Updated 8 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆16Mar 14, 2025Updated last year
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31May 22, 2024Updated last year
- ☆14Mar 18, 2025Updated last year
- Logseq Plugin to streamline Youtube note taking☆12Apr 9, 2022Updated 3 years ago
- ☆13Mar 6, 2024Updated 2 years ago
- ☆11May 20, 2022Updated 3 years ago
- Implementation of SoundtStream from the paper: "SoundStream: An End-to-End Neural Audio Codec"☆13Jan 27, 2025Updated last year
- KMM: Key Frame Mask Mamba for Extended Motion Generation☆19Sep 22, 2025Updated 6 months ago
- ☆11Mar 5, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Build your own offline AI from any documents. Free. No coding. LoRA fine-tuning + RAG + GGUF export.☆79Updated this week
- 大语言模型工具集☆25Aug 1, 2025Updated 7 months ago
- Local Group Policy Editor plus more, for all Windows editions☆14Sep 29, 2025Updated 5 months ago
- Inference Llama/Llama2/Llama3 Modes in NumPy☆21Nov 22, 2023Updated 2 years ago
- Repository mirror of GitLab: https://gitlab.com/rosarior/awesome-django http://awesome-django.com☆16Feb 22, 2018Updated 8 years ago
- Building AI Devops Assistant with Langchain, Postgres, and Ollama☆13Jun 12, 2024Updated last year
- ☆21Sep 20, 2025Updated 6 months ago