jorahn / llama-int8Links
Quantized inference code for LLaMA models
☆13Updated 2 years ago
Alternatives and similar repositories for llama-int8
Users that are interested in llama-int8 are comparing it to the libraries listed below
Sorting:
- A library for incremental loading of large PyTorch checkpoints☆56Updated 2 years ago
- An OpenAI API compatible LLM inference server based on ExLlamaV2.☆25Updated last year
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Updated last year
- ☆22Updated last year
- Trying to deconstruct RWKV in understandable terms☆14Updated 2 years ago
- A library for simplifying fine tuning with multi gpu setups in the Huggingface ecosystem.☆16Updated 7 months ago
- Simple LLM inference server☆20Updated 11 months ago
- ☆27Updated last year
- assign color hues to a collection of text fragments based on embeddings☆20Updated 11 months ago
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆43Updated last year
- ☆14Updated last year
- A playground to make it easy to try crazy things☆33Updated last month
- GGML implementation of BERT model with Python bindings and quantization.☆55Updated last year
- Stable diffusion google colab kernel☆10Updated 2 years ago
- LMQL implementation of tree of thoughts☆34Updated last year
- ☆19Updated 2 years ago
- Port of Facebook's LLaMA model in C/C++☆21Updated last year
- Image Generation API Server - Similar to https://text-generator.io but for images☆50Updated this week
- Experimental sampler to make LLMs more creative☆31Updated last year
- Preprint: Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning☆28Updated last year
- ☆40Updated 2 years ago
- Simple, Fast, Parallel Huggingface GGML model downloader written in python☆24Updated last year
- A repository containing datasets and tools to train a watermark classifier.☆68Updated 2 years ago
- Merge LLM that are split in to parts☆26Updated last year
- ☆32Updated 2 years ago
- Use Datasette to explore LAION improved_aesthetics_6plus training data used by Stable DIffusion☆58Updated last year
- Training hybrid models for dummies.☆21Updated 4 months ago
- ☆28Updated 9 months ago
- Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models☆69Updated last year
- Controllable Language Model Interactions in TypeScript☆9Updated last year