b0kch01 / llama-cpu
๐ฆ Inference code for LLaMA models (modified for cpu)
โ12Updated 2 years ago
Alternatives and similar repositories for llama-cpu:
Users that are interested in llama-cpu are comparing it to the libraries listed below
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.โ48Updated last week
- Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.โ46Updated last year
- fine tuning mistral 7B using Huggingface, Weights and Biases, Choline, and Vast AIโ38Updated last year
- A TextTiling-based algorithm for text segmentation (aka topic segmentation) that uses neural sentence encoders, as well as extractive sumโฆโ46Updated 2 years ago
- Generate visual podcasts about novels using open source modelsโ25Updated 2 years ago
- A cog implementation of MosaicML's MPT-7B-StoryWriter-65k+ Large Language Modelโ57Updated last year
- Using langchain, deeplake and openai to create a Q&A on the Mojo lang programming manualโ22Updated last year
- โ28Updated 2 years ago
- Command-line script for inferencing from models such as MPT-7B-Chatโ101Updated last year
- โ156Updated last year
- A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented textโ36Updated 4 years ago
- The code we currently use to fine-tune models.โ114Updated 11 months ago
- Open TTS models, built for streaming on the edgeโ39Updated last month
- โ16Updated 3 years ago
- Powered by OpenAI Whisper & Gradioโ30Updated 2 years ago
- Speech to text to speech using Elevenlabsโ28Updated last year
- โ74Updated last year
- Whisper Speaker Identification (WSI), a cutting-edge model for multilingual speaker identification.โ14Updated last month
- Speaker Diarization with Transformersโ64Updated 11 months ago
- faster-whisper as serverless endpointโ95Updated last week
- A lightweight Python library for running TTS models with a unified API.โ17Updated 2 months ago
- GreenLIT: Using GPT-J with Multi-Task Learning to Create New Screenplaysโ17Updated 2 years ago
- ONNX-compatible Fast SeamlessM4TโMassively Multilingual & Multimodal Machine Translationโ43Updated last year
- Joint speech-language model - respond directly to audio!โ30Updated 11 months ago
- Fine-tuning 6-Billion GPT-J (& other models) with LoRA and 8-bit compressionโ66Updated 2 years ago
- Open-source Rewind.ai clone written in Rust and Vue running 100% locally with whisper.cppโ50Updated last year
- โ ChatGPT Plugin for performing basic arithmetic operationsโ18Updated last year
- Use OpenAI with HuggingChat by emulating the text_generation_inference_serverโ43Updated last year
- Tool to apply Legal Matter Specification Standard (LMSS) to documentsโ13Updated 8 months ago
- Simple, Fast, Parallel Huggingface GGML model downloader written in pythonโ24Updated last year