b0kch01 / llama-cpuLinks
๐ฆ Inference code for LLaMA models (modified for cpu)
โ12Updated 2 years ago
Alternatives and similar repositories for llama-cpu
Users that are interested in llama-cpu are comparing it to the libraries listed below
Sorting:
- Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.โ46Updated last year
- ONNX-compatible Fast SeamlessM4TโMassively Multilingual & Multimodal Machine Translationโ43Updated last year
- โ ChatGPT Plugin for performing basic arithmetic operationsโ18Updated 2 years ago
- Speech to text to speech using Elevenlabsโ28Updated 2 years ago
- โ158Updated 2 years ago
- Speaker prediction for captions on the Lex Fridman podcastโ28Updated last year
- A cog implementation of MosaicML's MPT-7B-StoryWriter-65k+ Large Language Modelโ57Updated 2 years ago
- Code for OpenAI Whisper Web App Demoโ93Updated 2 years ago
- โ175Updated last year
- HuggingChat like UI in Gradioโ71Updated 2 years ago
- This shows the results from using a second, filter LLM that analyses prompts before sending them to GPT-Chatโ113Updated 2 years ago
- A reverse engineered Python API wrapper for OpenPlayground (nat.dev)โ76Updated 2 years ago
- canvas-based talking head model using viseme dataโ32Updated last year
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.โ68Updated last month
- Open TTS models, built for streaming on the edgeโ43Updated 4 months ago
- Example of Alpaca-LoRA with llama index.โ31Updated 2 years ago
- โ21Updated last year
- โ64Updated 2 years ago
- Port of Microsoft's BioGPT in C/C++ using ggmlโ87Updated last year
- LLaVA server (llama.cpp).โ181Updated last year
- whisper.cpp bindings for pythonโ100Updated last year
- An approach to creating the perfect prompt for any image generation task.โ29Updated 2 years ago
- Scripts to create your own moe models using mlxโ90Updated last year
- Thin wrapper around OpenAI Whisper API with streaming supportโ89Updated 6 months ago
- โ10Updated last year
- The code we currently use to fine-tune models.โ114Updated last year
- โ74Updated last year
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.โ53Updated 8 months ago
- GGUF Quantization of any LLM.โ40Updated last year
- Incredibly descriptive audiovisual summaries for videosโ41Updated last year