catid / oaillama3
Simple setup to self-host LLaMA3-70B model with an OpenAI API
☆20Updated 9 months ago
Alternatives and similar repositories for oaillama3:
Users that are interested in oaillama3 are comparing it to the libraries listed below
- ☆24Updated last year
- Easy to use, High Performant Knowledge Distillation for LLMs☆40Updated 3 weeks ago
- 5X faster 60% less memory QLoRA finetuning☆21Updated 8 months ago
- Cog wrapper for collabora/WhisperSpeech☆25Updated 10 months ago
- This benchmark tests how well LLMs incorporate a set of 10 mandatory story elements (characters, objects, core concepts, attributes, moti…☆38Updated last week
- Experimental sampler to make LLMs more creative☆30Updated last year
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31Updated 8 months ago
- ☆41Updated 9 months ago
- The simplest, fastest repository for training/finetuning medium-sized xLSTMs.☆38Updated 8 months ago
- ☆49Updated this week
- ☆27Updated last year
- ☆74Updated last year
- A memory manager essential for evolving AI to be more human-like, enabling dynamic, context-aware responses through structured memory han…☆27Updated 9 months ago
- entropix style sampling + GUI☆25Updated 3 months ago
- ☆109Updated last month
- A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.☆45Updated 6 months ago
- Image Generation API Server - Similar to https://text-generator.io but for images☆49Updated last month
- An unsupervised model merging algorithm for Transformers-based language models.☆102Updated 9 months ago
- The original BabyAGI, updated with LiteLLM and no vector database reliance (csv instead)☆21Updated 3 months ago
- ☆24Updated 9 months ago
- Scripts to create your own moe models using mlx☆86Updated 11 months ago
- Gradio UI for a Cog API☆65Updated 9 months ago
- ☆16Updated last year
- Large Model Proxy is designed to make it easy to run multiple resource-heavy Large Models (LM) on the same machine with limited amount of…☆49Updated this week
- ☆46Updated 9 months ago
- ☆65Updated 8 months ago
- klmbr - a prompt pre-processing technique to break through the barrier of entropy while generating text with LLMs☆67Updated 4 months ago
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆44Updated last year
- Modified Stanford-Alpaca Trainer for Training Replit's Code Model☆40Updated last year