catid / oaillama3Links
Simple setup to self-host LLaMA3-70B model with an OpenAI API
☆19Updated last year
Alternatives and similar repositories for oaillama3
Users that are interested in oaillama3 are comparing it to the libraries listed below
Sorting:
- ☆117Updated last year
- Full finetuning of large language models without large memory requirements☆94Updated 2 months ago
- An unsupervised model merging algorithm for Transformers-based language models.☆108Updated last year
- Gradio UI for a Cog API☆72Updated last year
- ☆55Updated 3 months ago
- ☆27Updated 2 years ago
- An OpenAI API compatible LLM inference server based on ExLlamaV2.☆25Updated last year
- 5X faster 60% less memory QLoRA finetuning☆21Updated last year
- ☆24Updated last year
- Experimental sampler to make LLMs more creative☆31Updated 2 years ago
- ☆68Updated last year
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆124Updated 2 years ago
- Web page with political compass quiz results for open LLMs☆37Updated last year
- A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.☆51Updated last year
- ☆24Updated last year
- Experimental LLM Inference UX to aid in creative writing☆127Updated last year
- implementation of https://arxiv.org/pdf/2312.09299☆21Updated last year
- inference code for mixtral-8x7b-32kseqlen☆104Updated 2 years ago
- Local LLM inference & management server with built-in OpenAI API☆31Updated last year
- The first AI artist☆32Updated 2 years ago
- A repository of prompts and Python scripts for intelligent transformation of raw text into diverse formats.☆30Updated 2 years ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31Updated last year
- Image Generation API Server - Similar to https://text-generator.io but for images☆52Updated 3 months ago
- ☆50Updated last year
- Simple LLM inference server☆20Updated last year
- Command-line script for inferencing from models such as falcon-7b-instruct☆75Updated 2 years ago
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆150Updated 10 months ago
- ☆63Updated last year
- Model REVOLVER, a human in the loop model mixing system.☆33Updated 2 years ago
- Genertaes control vectors for use with llama.cpp in GGUF format.☆35Updated 9 months ago