aikitoria / open-gpu-kernel-modulesLinks
NVIDIA Linux open GPU with P2P support
☆50Updated 3 weeks ago
Alternatives and similar repositories for open-gpu-kernel-modules
Users that are interested in open-gpu-kernel-modules are comparing it to the libraries listed below
Sorting:
- InferX is a Inference Function as a Service Platform☆133Updated last week
- Comparison of the output quality of quantization methods, using Llama 3, transformers, GGUF, EXL2.☆163Updated last year
- Transplants vocabulary between language models, enabling the creation of draft models for speculative decoding WITHOUT retraining.☆42Updated 2 weeks ago
- ☆43Updated 5 months ago
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆82Updated last week
- ☆100Updated last month
- KoboldCpp Smart Launcher with GPU Layer and Tensor Override Tuning☆27Updated 4 months ago
- SLOP Detector and analyzer based on dictionary for shareGPT JSON and text☆74Updated 10 months ago
- automatically quant GGUF models☆202Updated this week
- ☆133Updated 4 months ago
- klmbr - a prompt pre-processing technique to break through the barrier of entropy while generating text with LLMs☆80Updated last year
- ☆83Updated this week
- A pipeline parallel training script for LLMs.☆159Updated 4 months ago
- Sparse Inferencing for transformer based LLMs☆197Updated last month
- Distributed Inference for mlx LLm☆95Updated last year
- LLM Inference on consumer devices☆124Updated 6 months ago
- An optimized quantization and inference library for running LLMs locally on modern consumer-class GPUs☆499Updated this week
- Multi-Agent Step Race Benchmark: Assessing LLM Collaboration and Deception Under Pressure. A multi-player “step-race” that challenges LLM…☆73Updated 3 weeks ago
- Train your own small bitnet model☆75Updated 11 months ago
- Testing LLM reasoning abilities with family relationship quizzes.☆63Updated 7 months ago
- Guaranteed Structured Output from any Language Model via Hierarchical State Machines☆146Updated 3 months ago
- Easily view and modify JSON datasets for large language models☆83Updated 4 months ago
- The DPAB-α Benchmark☆29Updated 8 months ago
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆260Updated 6 months ago
- ☆209Updated 2 weeks ago
- A python package for serving LLM on OpenAI-compatible API endpoints with prompt caching using MLX.☆95Updated 2 months ago
- GPU Power and Performance Manager☆61Updated 11 months ago
- An OpenAI API compatible LLM inference server based on ExLlamaV2.☆25Updated last year
- Autonomous, agentic, creative story writing system that incorporates stored embeddings and Knowledge Graphs.☆78Updated this week
- An unsupervised model merging algorithm for Transformers-based language models.☆107Updated last year