spv420 / fastLLaMA
get LLaMA fast -- sets up llama-dl and llama.cpp automatically.
☆18Updated 2 years ago
Alternatives and similar repositories for fastLLaMA:
Users that are interested in fastLLaMA are comparing it to the libraries listed below
- Easy to deploy your LLM(large language model) server with no public address GPU machine.☆14Updated last year
- A small standalone flask python server for llama.cpp that acts like a KoboldAI api.☆13Updated last year
- 4 bits quantization of SantaCoder using GPTQ☆51Updated last year
- A package to interact with poe.com☆12Updated 2 years ago
- Inference of Large Multimodal Models in C/C++. LLaVA and others☆46Updated last year
- Train llama with lora on one 4090 and merge weight of lora to work as stanford alpaca.☆51Updated last year
- Model REVOLVER, a human in the loop model mixing system.☆33Updated last year
- ChatGPT-like Web UI for RWKVstic☆100Updated 2 years ago
- Code for the paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot" with LLaMA implementation.☆71Updated 2 years ago
- DeepFloyd IF web UI☆30Updated 2 years ago
- The paddle implementation of meta's LLaMA.☆45Updated 2 years ago
- Where we keep our notes about model training runs.☆16Updated 2 years ago
- LLaMA implementation for HuggingFace Transformers☆38Updated 2 years ago
- ☆82Updated 2 years ago
- Gradio UI for RWKV LLM☆29Updated 2 years ago
- A Qt GUI for large language models☆42Updated last year
- Instruct-tune LLaMA on consumer hardware with shareGPT data☆126Updated 2 years ago
- Generates ChatGPT/BingChat & GPT-4 prompts using this model trained by Kaludi. Enter a role and a prompt will be generated based on it.☆27Updated 2 years ago
- ☆14Updated last year
- A desktop application written in PyQT5 (python). Has support for using openai chatGPT as well as using a locally running llama model. Loc…☆77Updated last year
- Python bindings for the C++ port of GPT4All-J model.☆38Updated last year
- A reverse engineered Python API wrapper for OpenPlayground (nat.dev)☆76Updated 2 years ago
- ☆54Updated 2 years ago
- Large-Language-Model to Machine Interface project.☆18Updated last year
- convert a saved pytorch model to gguf and generate as much corresponding ggml c code as possible☆14Updated last year
- A simple package for leveraging Falcon 180B and the HF ecosystem's tools, including training/inference scripts, safetensors, integrations…☆13Updated last year
- A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.☆64Updated last year
- ☆27Updated last year
- A multi-modal AI Model that can generate high quality novel videos with text, images, or video clips.☆65Updated last year
- Inference script for Meta's LLaMA models using Hugging Face wrapper☆110Updated 2 years ago