NVIDIA Linux open GPU with P2P support
☆316Jun 2, 2026Updated last month
Alternatives and similar repositories for open-gpu-kernel-modules
Users that are interested in open-gpu-kernel-modules are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- NVIDIA Linux open GPU with P2P support☆1,385Jun 6, 2025Updated last year
- ☆32Jul 2, 2025Updated last year
- ☆12May 30, 2025Updated last year
- SGLang Kernel Wheel Index☆23Jun 26, 2026Updated last week
- Blazingly fast neighborhood attention☆15Nov 28, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Desktop application for instant AI-powered text transformation. Translate, correct, summarize, and change the tone of any text, anywhere,…☆35Dec 29, 2025Updated 6 months ago
- ☆108May 31, 2025Updated last year
- Low overhead tracing library and trace visualizer for pipelined CUDA kernels☆136Nov 26, 2025Updated 7 months ago
- An OpenAI API compatible LLM inference server based on ExLlamaV2.☆25Feb 9, 2024Updated 2 years ago
- Your universal AI text processor, powered by local and cloud LLMs. Edit, refactor, and transform text in any application on Windows, macO…☆74Nov 9, 2025Updated 7 months ago
- Tools to develop characters and maps for Churn Vector.☆14Oct 6, 2025Updated 8 months ago
- llama.cpp fork with additional SOTA quants and improved performance☆2,804Jun 26, 2026Updated last week
- No cloudflare or other uncessessary stuff for character card archive. works using the backup torrent from before the site shut down☆29Jan 4, 2026Updated 6 months ago
- ☆245Sep 30, 2025Updated 9 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Production-grade agent orchestration for Claude Code - 11 agents, 46 MCP tools, SQLite+FTS5, drift detection, consensus checkpoints☆51Jun 8, 2026Updated 3 weeks ago
- ☆13Jun 18, 2024Updated 2 years ago
- The entire open source TokenRing ecosystem☆19Updated this week
- Crashbench is a LLM benchmark to measure bug-finding and reporting capabilities of LLMs☆14Mar 8, 2026Updated 3 months ago
- An optimized quantization and inference library for running LLMs locally on modern consumer-class GPUs☆987Jun 26, 2026Updated last week
- Testing LLM reasoning abilities with family relationship quizzes.☆63Jan 28, 2025Updated last year
- [ICLR'25] Official repository of paper: Ranking-aware adapter for text-driven image ordering with CLIP☆16Apr 17, 2025Updated last year
- A practical way of learning Swizzle☆42Feb 3, 2025Updated last year
- ☆14May 25, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 🌳 MCTS-inspired parallel beam search for conversation optimization. Explore multiple dialogue strategies simultaneously, stress-test a…☆36Jan 18, 2026Updated 5 months ago
- A single-line modification to any (dualizer-based) optimizer that allows the optimizer to adapt to the scale of the gradients as they cha…☆19Jan 11, 2025Updated last year
- ☆18Mar 12, 2025Updated last year
- High-performance embedded graph database for analytics and real-time transactions☆118Updated this week
- The official API server for Exllama. OAI compatible, lightweight, and fast.☆1,261Jun 27, 2026Updated last week
- A proxy that hosts multiple single-model runners such as LLama.cpp and vLLM☆12May 30, 2025Updated last year
- ☆21Mar 22, 2021Updated 5 years ago
- ☆13Jun 13, 2025Updated last year
- Linux based GDDR6/GDDR6X VRAM temperature reader for NVIDIA RTX 3000/4000 series GPUs.☆125Apr 25, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆111Aug 21, 2025Updated 10 months ago
- ☆18Dec 2, 2024Updated last year
- AdaLLM is an NVFP4-first inference runtime for Ada Lovelace (RTX 4090) with FP8 KV cache and custom decode kernels. This repo targets NVF…☆134Feb 15, 2026Updated 4 months ago
- code for Towards Data Science article on prompt-loss-weight☆11Jun 4, 2025Updated last year
- Web UI for ExLlamaV2☆513Feb 5, 2025Updated last year
- A frontend for creative writing with LLMs☆167Jul 15, 2024Updated last year
- Simple HTML template library for C++☆14Feb 3, 2021Updated 5 years ago