openmarmot / aws-cft-llama-cpp
Cloudformation template to build a small llama.cpp server for trying large language models
☆11Updated 4 months ago
Alternatives and similar repositories for aws-cft-llama-cpp
Users that are interested in aws-cft-llama-cpp are comparing it to the libraries listed below
Sorting:
- Rust bindings for CTranslate2☆14Updated last year
- Monkey Island fine-tune of Stable Diffusion☆10Updated 2 years ago
- Apps that run on modal.com☆12Updated 11 months ago
- a version of baby agi using dspy and typed predictors☆17Updated last year
- This repository implements DSPy programs to tasks in Indian Languages☆13Updated last year
- Run ONNX RWKV-v4 models with GPU acceleration using DirectML [Windows], or just on CPU [Windows AND Linux]; Limited to 430M model at this…☆20Updated 2 years ago
- Developer showcase of projects built on Cartesia☆17Updated 8 months ago
- 🐜🔧 A minimalistic tool to fine-tune your LLMs☆18Updated last year
- Code to evaluate performance for embeddings☆12Updated 7 months ago
- A novel approach for transformer model introspection that enables saving, compressing, and manipulating internal thought states for advan…☆19Updated last month
- A python library to find differences between audio and transcriptions☆20Updated last year
- A service which wraps and chains video and audio Hugging Face Spaces together☆14Updated 8 months ago
- Demonstration of gpt-2 model with flask+uwsgi+nginx in web environment containerized in docker for quick deployment.☆13Updated 2 years ago
- A CLI in Rust to generate synthetic data for MLX friendly training☆23Updated last year
- Figma Files Scraper for Research & Studies☆23Updated last year
- ☆28Updated last year
- A swarm of LLM agents that will help you test, document, and productionize your code!☆16Updated 3 weeks ago
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆45Updated last year
- Speech to Speech conversation using the OpenAI RealTime API in Python 🐍☆26Updated 5 months ago
- Ask shortgpt for instant and concise answers☆13Updated 2 years ago
- LLMs sitting on a council together to decide, by consensus, who among them is the best.☆15Updated last week
- RootVC Dreambooth backend for TPUs☆13Updated 2 years ago
- Cog wrapper for collabora/WhisperSpeech☆25Updated last year
- ☆11Updated last year
- ☆31Updated last month
- Get up and running with Llama 2, Mistral, Gemma, and other large language models.☆16Updated last year
- Sentence Embedding as a Service☆15Updated last year
- ☆12Updated 2 weeks ago
- Dockerfile and web server for running GPT-J-6B on AWS GPU instances☆18Updated 3 years ago
- Proof of concept for running moshi/hibiki using webrtc☆18Updated 2 months ago