monatis / lmm.cpp
Inference of Large Multimodal Models in C/C++. LLaVA and others
☆46Updated 11 months ago
Related projects: ⓘ
- LLaVA server (llama.cpp).☆173Updated 10 months ago
- Local LLM inference & management server with built-in OpenAI API☆30Updated 4 months ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆117Updated 8 months ago
- ☆28Updated this week
- ☆31Updated 8 months ago
- Video+code lecture on building nanoGPT from scratch☆64Updated 3 months ago
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…☆56Updated last month
- Python bindings for ggml☆125Updated 2 weeks ago
- A pipeline parallel training script for LLMs.☆79Updated 3 weeks ago
- Embeddings focused small version of Llama NLP model☆101Updated last year
- Scripts to create your own moe models using mlx☆86Updated 6 months ago
- Local ML voice chat using high-end models.☆138Updated last week
- Gradio based tool to run opensource LLM models directly from Huggingface☆84Updated 2 months ago
- ☆26Updated last year
- Port of Microsoft's BioGPT in C/C++ using ggml☆87Updated 6 months ago
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆40Updated 5 months ago
- inference code for mixtral-8x7b-32kseqlen☆97Updated 9 months ago
- GRDN.AI app for garden optimization☆68Updated 7 months ago
- Port of Suno AI's Bark in C/C++ for fast inference☆50Updated 5 months ago
- Deploy your GGML models to HuggingFace Spaces with Docker and gradio☆34Updated last year
- ☆37Updated 9 months ago
- GPT-4 Level Conversational QA Trained In a Few Hours☆53Updated 3 weeks ago
- Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.☆44Updated 10 months ago
- GPT-2 small trained on phi-like data☆65Updated 7 months ago
- Gradio UI for a Cog API☆62Updated 5 months ago
- Low-Rank adapter extraction for fine-tuned transformers model☆154Updated 4 months ago
- The simplest, fastest repository for training/finetuning medium-sized xLSTMs.☆38Updated 3 months ago
- Port of Suno's Bark TTS transformer in Apple's MLX Framework☆62Updated 7 months ago
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆157Updated this week
- Command-line script for inferencing from models such as MPT-7B-Chat☆102Updated last year