Run fast LLM Inference using Llama.cpp in Python
β19Jan 3, 2024Updated 2 years ago
Alternatives and similar repositories for llama-cpp-python-bindings
Users that are interested in llama-cpp-python-bindings are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- π§ Workshop Notebook and assets for the Anthropic Hackathonβ12Nov 4, 2023Updated 2 years ago
- generate & query embeddings from VTT files using openai & pinecone on Andrej Karpathy's's latest GPT tutorialβ19Oct 9, 2023Updated 2 years ago
- Test-Time Memory Framework: Control Hallucinations in Foundation Modelsβ11Nov 4, 2025Updated 7 months ago
- LLM-Blender: Ensembling framework that maximizes LLM performance via pairwise ranking. Employs PairRanker to rank candidates and GenFuserβ¦β37Jun 19, 2026Updated last week
- Your Python AI Coder!β36May 21, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- This is "Your Private StackOverflow" app that helps you perform generative search in your code bases. This is built using open-source staβ¦β11Aug 14, 2023Updated 2 years ago
- Tools to convert AIML code into RiveScript code.β19Dec 16, 2016Updated 9 years ago
- Perplexity Lite using Langgraph, Tavily, and GPT-4.β14Jan 11, 2024Updated 2 years ago
- Use LLMs to clean your gmail inboxβ22Dec 23, 2023Updated 2 years ago
- β11Jan 7, 2023Updated 3 years ago
- All the solutions for the Sliding window Algorithmsβ12May 5, 2022Updated 4 years ago
- A swarm of LLM agents that will help you test, document, and productionize your code!β19Jun 22, 2026Updated last week
- Code for the paper "Measuring Bias in Contextualized Word Representations"β35Jul 19, 2019Updated 6 years ago
- β10Feb 3, 2020Updated 6 years ago
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Repo for my youtube tutorialβ13Nov 23, 2023Updated 2 years ago
- β12May 17, 2021Updated 5 years ago
- Ruby on rails app using aws-sdk-rubyβ12Aug 30, 2018Updated 7 years ago
- Examples in the MLX frameworkβ11Sep 23, 2024Updated last year
- ComfyUI-Direct3DβS2 is now available in ComfyUI, Direct3DβS2 - Gigascale 3D Generation Made Easy with Spatial Sparse Attention. Direct3Dββ¦β17Jun 10, 2025Updated last year
- Catalyst example of a grid-based video app that opens videos in secondary windowsβ22Feb 8, 2022Updated 4 years ago
- Mac App Store: Embedding a Command Line tool using paths as argumentsβ20May 16, 2021Updated 5 years ago
- This repository holds files and scripts for incorporating simple CI/CD practices for model training in ML.β20Oct 26, 2021Updated 4 years ago
- Update a DNS record on Cloudflare with the public IP of the running machine.β11Mar 2, 2026Updated 4 months ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- UWP app with MinGW-w64β17Feb 11, 2025Updated last year
- An attempt at making it easier to setup tables with SQLiteβ14Feb 12, 2016Updated 10 years ago
- Node.js Gab.com API Clientβ12Dec 9, 2022Updated 3 years ago
- TreeView,UITreeViewβ14Jun 5, 2017Updated 9 years ago
- C++ Dynamic loader generator for C APIsβ18Sep 6, 2018Updated 7 years ago
- Gradio chat interface for FastMLXβ12Sep 22, 2024Updated last year
- Flexible and transparent Python Boruta implementationβ15Jun 8, 2025Updated last year
- β23Jul 10, 2023Updated 2 years ago
- Associated code for the Quickstart tutorialβ17Aug 18, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Mixture of Expert (MoE) techniques for enhancing LLM performance through expert-driven prompt mapping and adapter combinations.β12Feb 11, 2024Updated 2 years ago
- Huggingface Backup - Jupyter, Colab and Python Scriptβ10Jun 22, 2026Updated last week
- Renderize Menu from Json & Nested select from Json