Finetune llama2-70b and codellama on MacBook Air without quantization
☆449Mar 28, 2024Updated 2 years ago
Alternatives and similar repositories for slowllama
Users that are interested in slowllama are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Llama 2 Everywhere (L2E)☆1,526Aug 27, 2025Updated 10 months ago
- Seamlessly integrate LLMs as Python functions☆2,410Mar 11, 2026Updated 3 months ago
- A fast inference library for running LLMs locally on modern consumer-class GPUs☆4,567Mar 4, 2026Updated 3 months ago
- A simple "Be My Eyes" web app with a llama.cpp/llava backend☆496Nov 28, 2023Updated 2 years ago
- Turn expensive prompts into cheap fine-tuned models☆2,814May 25, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Toolkit for fine-tuning, ablating and unit-testing open-source LLMs.☆871May 4, 2026Updated last month
- A FastAPI service for semantic text search using precomputed embeddings and advanced similarity measures, with built-in support for vario…☆1,053Feb 27, 2025Updated last year
- An extensible, easy-to-use, and portable diffusion web UI 👨🎨☆1,669Aug 18, 2023Updated 2 years ago
- pykoi: Active learning in one unified interface☆411Sep 24, 2025Updated 9 months ago
- Simple UI for LLM Model Finetuning☆2,052Dec 21, 2023Updated 2 years ago
- Lightweight inference library for ONNX files, written in C++. It can run Stable Diffusion XL 1.0 on a RPI Zero 2 (or in 298MB of RAM) but…☆2,080Jun 18, 2026Updated last week
- Agents Capable of Self-Editing Their Prompts / Python Code☆819Mar 15, 2024Updated 2 years ago
- Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.☆865Jan 15, 2024Updated 2 years ago
- Locust on k8s example for scalable load tests☆14Apr 16, 2022Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Structured Outputs☆14,273Updated this week
- Count Tokens of Code (forked from gocloc)☆45Aug 19, 2024Updated last year
- ☆3,357Feb 25, 2024Updated 2 years ago
- ☆25Sep 19, 2023Updated 2 years ago
- CodeTF: One-stop Transformer Library for State-of-the-art Code LLM☆1,479May 1, 2025Updated last year
- AutoChain: Build lightweight, extensible, and testable LLM Agents☆1,877Dec 16, 2025Updated 6 months ago
- Create and share easy-to-make, built-to-last, innovative, and customizable experiences☆34Feb 21, 2024Updated 2 years ago
- Flacuna was developed by fine-tuning Vicuna on Flan-mini, a comprehensive instruction collection encompassing various tasks. Vicuna is al…☆112Sep 10, 2023Updated 2 years ago
- ☆615Mar 4, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Distribute and run LLMs with a single file.☆25,105Updated this week
- Fine-tune LLM agents with online reinforcement learning☆1,250Mar 19, 2024Updated 2 years ago
- Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models☆267Apr 23, 2024Updated 2 years ago
- ☆1,275Oct 24, 2023Updated 2 years ago
- LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath☆9,485Jun 7, 2025Updated last year
- Examples in the MLX framework☆8,765Apr 6, 2026Updated 2 months ago
- 💭 Chat with AI via API☆33Oct 20, 2024Updated last year
- [ICLR 2024] Efficient Streaming Language Models with Attention Sinks☆7,233Jul 11, 2024Updated last year
- 🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading☆10,248Sep 7, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- An LLM-powered advanced RAG pipeline built from scratch☆859Jan 26, 2024Updated 2 years ago
- Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.☆4,086Jan 8, 2025Updated last year
- Finetune a LLM to speak like you based on your WhatsApp Conversations☆380May 5, 2024Updated 2 years ago
- A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.☆2,924Sep 30, 2023Updated 2 years ago
- An LLM-based autonomous agent controlling real-world applications via RESTful APIs☆1,399Jun 7, 2024Updated 2 years ago
- DataDM is your private data assistant. Slide into your data's DMs☆386Oct 6, 2024Updated last year
- An Open Source text-to-speech system built by inverting Whisper.☆4,620Dec 14, 2025Updated 6 months ago