KoboldCpp Smart Launcher with GPU Layer and Tensor Override Tuning
☆30May 18, 2025Updated last year
Alternatives and similar repositories for TensorTune
Users that are interested in TensorTune are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆19Aug 19, 2025Updated 9 months ago
- A simple, "Ollama-like" tool for managing and running GGUF language models from your terminal.☆25Jan 2, 2026Updated 5 months ago
- 🤖 AI-powered CLI for file reorganization. Runs fully locally — no data leaves your machine.☆20Jul 2, 2025Updated 11 months ago
- An OpenAI API compatible images server to generate or manipulate images.☆18Feb 2, 2025Updated last year
- ☆47Apr 29, 2026Updated last month
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆54May 28, 2025Updated last year
- ☆16Dec 16, 2024Updated last year
- Stable Diffusion and Flux in pure C/C++☆25Jun 3, 2026Updated last week
- 🗣️ Real‑time, low‑latency voice, vision, and conversational‑memory AI assistant built on LiveKit and local LLMs☆111Jun 25, 2025Updated 11 months ago
- Create text chunks which end at natural stopping points without using a tokenizer☆26Nov 26, 2025Updated 6 months ago
- ☆51Feb 19, 2025Updated last year
- Llama.cpp runner/swapper and proxy that emulates LMStudio / Ollama backends☆58Aug 21, 2025Updated 9 months ago
- CompChomper is a framework for measuring how LLMs perform at code completion.☆21Apr 29, 2025Updated last year
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆30Jan 19, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- 🎮 Material You TUI for monitoring NVIDIA GPUs☆58Jan 16, 2026Updated 4 months ago
- Personal voice assistant, with voice interruption and Twilio support☆18Feb 24, 2025Updated last year
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆34Mar 2, 2024Updated 2 years ago
- Gaussian Splatting for Meshroom. Gaussian splatting enables high-quality novel-view synthesis using explicit 3D Gaussian representations …☆28May 29, 2026Updated last week
- Yet another frontend for LLM, written using .NET and WinUI 3☆11Sep 14, 2025Updated 8 months ago
- Neo AI integrates into the Linux terminal, capable of executing system commands and providing helpful information.☆137Mar 9, 2026Updated 3 months ago
- 3Dprint profiles for Flashforge printers in Ultimate Cura slicer.☆11Jun 23, 2022Updated 3 years ago
- Bookmarklet to pull and run hugging face GGUF models in Ollama☆18Oct 17, 2024Updated last year
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆38Jan 29, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Synthetic data for fine tuning LLM☆27Dec 26, 2024Updated last year
- ☆57Oct 10, 2025Updated 8 months ago
- Eidos – A Self-Growing AI Agent with Long-Term Memory and Environmental Awareness☆23Jul 4, 2025Updated 11 months ago
- Port of Facebook's LLaMA model in C/C++☆23May 4, 2026Updated last month
- The one who calls upon functions - Function-Calling Language Model☆36Oct 2, 2023Updated 2 years ago
- OpenSCAD script for generating pipe fitting automatically.☆19Mar 9, 2025Updated last year
- Your personal ArXiv Feed☆23Dec 18, 2024Updated last year
- My version of an LLM Websearch Agent using a local SearXNG server because SearXNG is great.☆45Jan 27, 2026Updated 4 months ago
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆14Mar 30, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆20Sep 28, 2024Updated last year
- A refeference of text models that can be used in the AI Horde☆12May 31, 2026Updated last week
- Simple Tool Caller for llama.cpp☆11Aug 12, 2024Updated last year
- Code for paper: "QuIP: 2-Bit Quantization of Large Language Models With Guarantees" adapted for Llama models☆41Aug 4, 2023Updated 2 years ago
- Forked from ggerganov/llama.cpp☆17Updated this week
- empirically chooses -ngl param for llama.cpp☆20Mar 19, 2025Updated last year
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆90Updated this week