camenduru / guanaco-lambda
☆2Updated last year
Alternatives and similar repositories for guanaco-lambda:
Users that are interested in guanaco-lambda are comparing it to the libraries listed below
- All the world is a play, we are but actors in it.☆47Updated this week
- Gradio UI for a Cog API☆66Updated 11 months ago
- ☆22Updated last year
- ☆111Updated 3 months ago
- ☆66Updated last year
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆32Updated 8 months ago
- Automated prompting and scoring framework to evaluate LLMs using updated human knowledge prompts☆111Updated last year
- ☆28Updated last year
- ☆22Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs (Windows build & kernels)☆18Updated this week
- The GeoV model is a large langauge model designed by Georges Harik and uses Rotary Positional Embeddings with Relative distances (RoPER).…☆121Updated last year
- ☆51Updated this week
- ☆78Updated last year
- Updated last month
- ☆25Updated last year
- ☆30Updated last year
- A collection of notebooks for the Hugging Face blog series (https://huggingface.co/blog).☆44Updated 7 months ago
- Inference of Large Multimodal Models in C/C++. LLaVA and others☆46Updated last year
- ☆27Updated last year
- An unsupervised model merging algorithm for Transformers-based language models.☆104Updated 11 months ago
- ☆25Updated last year
- inference code for mixtral-8x7b-32kseqlen☆99Updated last year
- ☆55Updated last year
- DeepFloyd IF web UI☆29Updated last year
- ☆18Updated last year
- Simple setup to self-host LLaMA3-70B model with an OpenAI API☆19Updated 11 months ago
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆123Updated last year
- Command-line script for inferencing from models such as MPT-7B-Chat☆101Updated last year
- Scripts to create your own moe models using mlx☆89Updated last year
- Mistral7B playing DOOM☆28Updated last year