A comprehensive, step-by-step guide for successfully installing and running llama-cpp-python with CUDA GPU acceleration on Windows. This repository provides a definitive solution to the common installation challenges, including exact version requirements, environment setup, and troubleshooting tips.
☆20Jun 2, 2025Updated 11 months ago
Alternatives and similar repositories for windows-llama-cpp-python-cuda-guide
Users that are interested in windows-llama-cpp-python-cuda-guide are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 실무로 통하는 LLM 애플리케이션 설계☆39Nov 12, 2025Updated 6 months ago
- ☆12Feb 5, 2023Updated 3 years ago
- danbooru的tag中文对照表☆22Mar 21, 2025Updated last year
- 基于edge-tts的简单语音合 成服务,支持私有化部署,支持和源阅读APP无缝对接。☆20Aug 19, 2025Updated 9 months ago
- MixFile命令行版本☆26May 22, 2026Updated last week
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Windows desktop control panel for local llama.cpp server☆164May 4, 2026Updated 3 weeks ago
- AutoGen multi AI agent blog post writing using reflection☆12Updated this week
- Effect-ive Programming in Go☆39Oct 21, 2025Updated 7 months ago
- ☆39Jun 18, 2025Updated 11 months ago
- Python bot to automate Quillbot without Buying Premium☆10Oct 22, 2020Updated 5 years ago
- Gives each individual character their own memory.☆30Jun 1, 2025Updated 11 months ago
- A comprehensive MCP server for LightRAG integration with 22 tools for document management, querying, knowledge graph operations, and syst…☆30Aug 22, 2025Updated 9 months ago
- Make downloading scientific data much easier☆13Mar 3, 2026Updated 2 months ago
- AI在线Tag选择器,基于开源项目改进,添加了一些新的功能☆30Mar 9, 2026Updated 2 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Pre-built wheels for llama-cpp-python across platforms and CUDA versions☆68Apr 18, 2026Updated last month
- ☆12Oct 4, 2025Updated 7 months ago
- Free WordPress plugin that unlocks PRO features in the Elementor page builder, including advanced widgets, theme and WooCommerce builders…☆40May 27, 2025Updated last year
- A repo about deep learning and PyTorch covering basics, projects and ideas☆11Jan 31, 2021Updated 5 years ago
- Implementation of logistic regression using numpy☆15Aug 2, 2019Updated 6 years ago
- ☆19Updated this week
- chatGPT integrated into Telegram using official OpenAI API☆16Mar 2, 2023Updated 3 years ago
- Скрипт для более приятной работы странички с проверкой ДЗ на сайте learn.innopolis.university☆10Feb 24, 2023Updated 3 years ago
- Implementation of proof of concept quantum enhanced reinforced learning algorithm, able to find the sequence of quantum gates needed to a…☆15Mar 29, 2022Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A complete simulator for quantum computing☆19Jan 3, 2026Updated 4 months ago
- Contain notes and code from my YT pytorch lessons. Храню записи и код со своих YT pytorch уроков☆12May 7, 2025Updated last year
- A principled library for tuning, training and evaluating tabular data synthesis on fidelity, privacy and utility. CCS 2025.☆26Aug 17, 2025Updated 9 months ago
- Code repository for The Automation Ahead series, showcasing practical examples for GenAI-driven automation in investments. Each installme…☆55May 8, 2026Updated 3 weeks ago
- [NeurIPS 2023] Understanding and Improving Feature Learning for Out-of-Distribution Generalization☆29May 27, 2025Updated last year
- Probably one of the lightest native RAG + Agent apps out there,experience the power of Agent-powered models and Agent-driven knowledge ba…☆33May 30, 2025Updated 11 months ago
- Python SDK for Palabra AI's real-time speech-to-speech translation API. Break down language barriers and enable seamless communication ac…☆38Apr 13, 2026Updated last month
- ☆29Oct 20, 2025Updated 7 months ago
- [NeurIPS 2025 Spotlight] LeMiCa: Lexicographic Minimax Path Caching for Efficient Diffusion-Based Video Generation☆117Apr 28, 2026Updated last month
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- The Python Implementation of CRISP: Clustering Multi-Vector Representations for Denoising and Pruning☆27Jul 27, 2025Updated 10 months ago
- Your friendly AI text adventure!☆22Feb 12, 2025Updated last year
- ☆26Jun 8, 2023Updated 2 years ago
- ☆17Dec 11, 2025Updated 5 months ago
- implementation of dualformer☆25Mar 1, 2025Updated last year
- Personalize Anything for Free with Diffusion Transformer,use it in comfyUI with wrapper mode☆44Mar 26, 2025Updated last year
- Quantum-enhanced GPT-2☆15Mar 19, 2024Updated 2 years ago