A comprehensive, step-by-step guide for successfully installing and running llama-cpp-python with CUDA GPU acceleration on Windows. This repository provides a definitive solution to the common installation challenges, including exact version requirements, environment setup, and troubleshooting tips.
☆20Jun 2, 2025Updated 11 months ago
Alternatives and similar repositories for windows-llama-cpp-python-cuda-guide
Users that are interested in windows-llama-cpp-python-cuda-guide are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 실무로 통하는 LLM 애플리케이션 설계☆39Nov 12, 2025Updated 5 months ago
- ☆12Feb 5, 2023Updated 3 years ago
- danbooru的tag中文对照表☆21Mar 21, 2025Updated last year
- 基于edge-tts的简单语音合 成服务,支持私有化部署,支持和源阅读APP无缝对接。☆20Aug 19, 2025Updated 8 months ago
- MixFile命令行版本☆25Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- AutoGen multi AI agent blog post writing using reflection☆12Updated this week
- Effect-ive Programming in Go☆39Oct 21, 2025Updated 6 months ago
- ☆37Jun 18, 2025Updated 10 months ago
- Python bot to automate Quillbot without Buying Premium☆10Oct 22, 2020Updated 5 years ago
- Gives each individual character their own memory.☆30Jun 1, 2025Updated 11 months ago
- A comprehensive MCP server for LightRAG integration with 22 tools for document management, querying, knowledge graph operations, and syst…☆30Aug 22, 2025Updated 8 months ago
- Make downloading scientific data much easier☆13Mar 3, 2026Updated 2 months ago
- Free WordPress plugin that unlocks PRO features in the Elementor page builder, including advanced widgets, theme and WooCommerce builders…☆39May 27, 2025Updated 11 months ago
- AI在线Tag选择器,基于开源项目改进,添加了一些新的功能☆30Mar 9, 2026Updated 2 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Pre-built wheels for llama-cpp-python across platforms and CUDA versions☆61Apr 18, 2026Updated 2 weeks ago
- ☆12Oct 4, 2025Updated 7 months ago
- A repo about deep learning and PyTorch covering basics, projects and ideas☆11Jan 31, 2021Updated 5 years ago
- Implementation of logistic regression using numpy☆15Aug 2, 2019Updated 6 years ago
- ☆19Feb 3, 2026Updated 3 months ago
- chatGPT integrated into Telegram using official OpenAI API☆16Mar 2, 2023Updated 3 years ago
- Скрипт для более приятной работы странички с проверкой ДЗ на сайте learn.innopolis.university☆10Feb 24, 2023Updated 3 years ago
- Implementation of proof of concept quantum enhanced reinforced learning algorithm, able to find the sequence of quantum gates needed to a…☆15Mar 29, 2022Updated 4 years ago
- A complete simulator for quantum computing☆19Jan 3, 2026Updated 4 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Contain notes and code from my YT pytorch lessons. Храню записи и код со своих YT pytorch уроков☆11May 7, 2025Updated last year
- A principled library for tuning, training and evaluating tabular data synthesis on fidelity, privacy and utility. CCS 2025.☆26Aug 17, 2025Updated 8 months ago
- Code repository for The Automation Ahead series, showcasing practical examples for GenAI-driven automation in investments. Each installme…☆53Updated this week
- [NeurIPS 2023] Understanding and Improving Feature Learning for Out-of-Distribution Generalization☆29May 27, 2025Updated 11 months ago
- Probably one of the lightest native RAG + Agent apps out there,experience the power of Agent-powered models and Agent-driven knowledge ba…☆33May 30, 2025Updated 11 months ago
- [NeurIPS 2025 Spotlight] LeMiCa: Lexicographic Minimax Path Caching for Efficient Diffusion-Based Video Generation☆112Apr 28, 2026Updated last week
- Python SDK for Palabra AI's real-time speech-to-speech translation API. Break down language barriers and enable seamless communication ac…☆37Apr 13, 2026Updated 3 weeks ago
- ☆29Oct 20, 2025Updated 6 months ago
- The Python Implementation of CRISP: Clustering Multi-Vector Representations for Denoising and Pruning☆27Jul 27, 2025Updated 9 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Your friendly AI text adventure!☆22Feb 12, 2025Updated last year
- ☆26Jun 8, 2023Updated 2 years ago
- ☆17Dec 11, 2025Updated 4 months ago
- Personalize Anything for Free with Diffusion Transformer,use it in comfyUI with wrapper mode☆44Mar 26, 2025Updated last year
- implementation of dualformer☆25Mar 1, 2025Updated last year
- Quantum-enhanced GPT-2☆15Mar 19, 2024Updated 2 years ago
- NextCoder: Robust Adaptation of Code LMs to Diverse Code Edits (ICML'25)☆44Jul 9, 2025Updated 10 months ago