Run fast LLM Inference using Llama.cpp in Python
☆19Jan 3, 2024Updated 2 years ago
Alternatives and similar repositories for llama-cpp-python-bindings
Users that are interested in llama-cpp-python-bindings are comparing it to the libraries listed below
Sorting:
- This project helps you to record your voice using iPhone.☆30Nov 19, 2012Updated 13 years ago
- Test-Time Memory Framework: Control Hallucinations in Foundation Models☆11Nov 4, 2025Updated 3 months ago
- Your Python AI Coder!☆36May 21, 2025Updated 9 months ago
- A Python3 implementation of OPC-XML-DA.☆10Nov 9, 2023Updated 2 years ago
- All the solutions for the Sliding window Algorithms☆12May 5, 2022Updated 3 years ago
- NDIToolbox is an open source extensible signal and image processing application under development by TRI/Austin designed to assist with t…☆10Aug 19, 2018Updated 7 years ago
- 👾 template repo for getting started with opengl together with imgui using cmake☆10Jul 20, 2024Updated last year
- Emotion based music recommender system☆11Mar 26, 2025Updated 11 months ago
- Templates for musical textual inversion for riffusion☆11Apr 14, 2023Updated 2 years ago
- Write your next novel faster and easier☆15Dec 7, 2025Updated 2 months ago
- This project aims to utilize Generative AI for the next marketing strategy in the case of e-commerce customer segmentation.☆12Mar 19, 2024Updated last year
- An MCP server implementation providing a standardized interface for LLMs to interact with the Atla API.☆17Jul 21, 2025Updated 7 months ago
- ComfyUI-Direct3D‑S2 is now available in ComfyUI, Direct3D‑S2 - Gigascale 3D Generation Made Easy with Spatial Sparse Attention. Direct3D‑…☆16Jun 10, 2025Updated 8 months ago
- Find similarity between two strings, based on Dice Similarity Coefficient DSC☆13Jan 23, 2023Updated 3 years ago
- Streamlines the creation of dataset to train a Large Language Model with triplets : instruction-input-output . The default configuration …☆13Apr 17, 2023Updated 2 years ago
- A knowledge graph based forward chain inferencing engine in typescript/node.☆11Jan 23, 2021Updated 5 years ago
- In-browser semantic search demo using EmbeddingGemma and Transformers.js. No server required.☆30Sep 7, 2025Updated 5 months ago
- ☆10Nov 17, 2024Updated last year
- FlaskRestful + Swagger UI + Docker Compose + Unit Test | How to organize Python Code for REST API☆14Jun 5, 2022Updated 3 years ago
- Gradio chat interface for FastMLX☆12Sep 22, 2024Updated last year
- Huggingface Backup - Jupyter, Colab and Python Script☆10Jan 20, 2026Updated last month
- Mixture of Expert (MoE) techniques for enhancing LLM performance through expert-driven prompt mapping and adapter combinations.☆12Feb 11, 2024Updated 2 years ago
- Flexible and transparent Python Boruta implementation☆15Jun 8, 2025Updated 8 months ago
- Examples in the MLX framework☆11Sep 23, 2024Updated last year
- Rivet plugin to access E2B goodies☆10Feb 6, 2025Updated last year
- ☆11Aug 26, 2024Updated last year
- A jekyll template for easy creation of course websites. Checkout the template here:☆11Aug 1, 2024Updated last year
- A Next.js chat app to use Llama 2 locally using node-llama-cpp☆12Oct 27, 2024Updated last year
- Code repository for TIDMAD: Time series Dataset for Discovering Dark Matter with AI Denoising.☆15Oct 23, 2025Updated 4 months ago
- ☆15May 20, 2017Updated 8 years ago
- Semi-supervised and unsupervised anomaly detection by mining numerical workflow relations from system logs (Accepted by Automated Softwar…☆10Sep 29, 2022Updated 3 years ago
- Various agents from all of the top agent frameworks to integrate into swarms! Langchain, Griptape, CrewAI, and more!☆18Dec 22, 2025Updated 2 months ago
- ☆12Feb 16, 2026Updated 2 weeks ago
- My Gen AI research☆11Jun 3, 2024Updated last year
- Central hub for demos, code snippets, and other assets for Azure Cosmos DB for AI apps.☆13Apr 9, 2025Updated 10 months ago
- This fully reconfigurable action, validates conformity with Azure Developer CLI template standards.☆20Oct 8, 2025Updated 4 months ago
- Automate the batch upload and parsing of documents into Dify's knowledge base, reducing manual intervention and wait time.☆15Aug 29, 2024Updated last year
- [Intelligenza Artificiale] The official repo for the paper: "CLAM: A Synergistic Deep Learning Model for Multi-Step Stock Price Trend For…☆13Mar 22, 2025Updated 11 months ago
- A tiny server to run local inference on MLX model in the style of OpenAI☆13Jan 31, 2024Updated 2 years ago