b0kch01 / llama-cpu
๐ฆ Inference code for LLaMA models (modified for cpu)
โ12Updated 2 years ago
Alternatives and similar repositories for llama-cpu:
Users that are interested in llama-cpu are comparing it to the libraries listed below
- โ ChatGPT Plugin for performing basic arithmetic operationsโ18Updated last year
- โ19Updated 2 years ago
- Nougat is a Meta AI's revolutionary OCR model designed to transcribe scientific PDFs into an easy-to-use Markdown format.โ22Updated last year
- Use OpenAI with HuggingChat by emulating the text_generation_inference_serverโ45Updated last year
- Speech to text to speech using Elevenlabsโ28Updated last year
- Speaker prediction for captions on the Lex Fridman podcastโ25Updated last year
- Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.โ46Updated last year
- Experimental sampler to make LLMs more creativeโ30Updated last year
- GGML implementation of BERT model with Python bindings and quantization.โ56Updated last year
- A lightweight Python library for running TTS models with a unified API.โ17Updated last month
- โ47Updated last month
- Example of Alpaca-LoRA with llama index.โ31Updated 2 years ago
- โ65Updated 2 years ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Modelsโ21Updated 4 months ago
- Embedding models from Jina AIโ58Updated last year
- An JS web client for connecting to Pipecat bots with voice and visionโ43Updated 3 months ago
- ONNX-compatible Fast SeamlessM4TโMassively Multilingual & Multimodal Machine Translationโ43Updated last year
- โ13Updated last year
- โ32Updated last year
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.โ46Updated last month
- Conduct consumer interviews with synthetic focus groups using LLMs and LangChainโ43Updated last year
- โ38Updated last year
- Command-line script for inferencing from models such as MPT-7B-Chatโ101Updated last year
- A TextTiling-based algorithm for text segmentation (aka topic segmentation) that uses neural sentence encoders, as well as extractive sumโฆโ45Updated 2 years ago
- LLM finetuningโ42Updated last year
- QLoRA: Efficient Finetuning of Quantized LLMsโ11Updated last year
- โ28Updated 2 years ago
- โ33Updated last year
- canvas-based talking head model using viseme dataโ30Updated last year
- Using multiple LLMs for ensemble Forecastingโ16Updated last year