b0kch01 / llama-cpuLinks
๐ฆ Inference code for LLaMA models (modified for cpu)
โ12Updated 2 years ago
Alternatives and similar repositories for llama-cpu
Users that are interested in llama-cpu are comparing it to the libraries listed below
Sorting:
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.โ74Updated 6 months ago
- Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.โ46Updated 2 years ago
- LLaVA server (llama.cpp).โ183Updated 2 years ago
- GGML implementation of BERT model with Python bindings and quantization.โ58Updated last year
- โ ChatGPT Plugin for performing basic arithmetic operationsโ18Updated 2 years ago
- Speaker prediction for captions on the Lex Fridman podcastโ27Updated last year
- โ158Updated 2 years ago
- HuggingChat like UI in Gradioโ70Updated 2 years ago
- an optimized, production-ready implementation of active speaker detectionโ80Updated last year
- A cog implementation of MosaicML's MPT-7B-StoryWriter-65k+ Large Language Modelโ58Updated 2 years ago
- Whisper realtime streaming for long speech-to-text transcription and translationโ121Updated 2 years ago
- ONNX-compatible Fast SeamlessM4TโMassively Multilingual & Multimodal Machine Translationโ43Updated 2 years ago
- Port of Microsoft's BioGPT in C/C++ using ggmlโ86Updated last year
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.โ53Updated last year
- Drop in replacement for OpenAI, but with Open models.โ156Updated 2 years ago
- Open TTS models, built for streaming on the edgeโ45Updated 10 months ago
- Python bindings for llama.cppโ198Updated 2 years ago
- OpenAI Whisper + davinci for podcast summarizationโ70Updated 2 years ago
- Scripts to create your own moe models using mlxโ90Updated last year
- Improving transcription performance of OpenAI Whisper for CPU based deploymentโ258Updated 3 years ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelinesโ100Updated last year
- The code we currently use to fine-tune models.โ117Updated last year
- A lightweight end-to-end text-to-speech modelโ126Updated 11 months ago
- large language model for mastering data analysis using pandasโ48Updated 2 years ago
- Use OpenAI with HuggingChat by emulating the text_generation_inference_serverโ44Updated 2 years ago
- โ258Updated last year
- Speech to text to speech using Elevenlabsโ28Updated 2 years ago
- OVALChat is a customizable Web app aimed at conducting user studies with chatbotsโ29Updated 2 years ago
- โ10Updated 2 years ago
- Let's create synthetic textbooks together :)โ76Updated 2 years ago