okuvshynov / slowllama
Finetune llama2-70b and codellama on MacBook Air without quantization
☆443Updated 5 months ago
Related projects: ⓘ
- Stateful load balancer custom-tailored for llama.cpp☆518Updated this week
- Visualize the intermediate output of Mistral 7B☆300Updated 7 months ago
- LLM Analytics☆593Updated last month
- Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.☆821Updated 8 months ago
- llama3.np is a pure NumPy implementation for Llama 3 model.☆955Updated 3 months ago
- JS tokenizer for LLaMA 1 and 2☆330Updated 2 months ago
- Fine-tune LLM agents with online reinforcement learning☆971Updated 6 months ago
- A FastAPI service for semantic text search using precomputed embeddings and advanced similarity measures, with built-in support for vario…☆920Updated this week
- The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM …☆467Updated last month
- An application for running LLMs locally on your device, with your documents, facilitating detailed citations in generated responses.☆459Updated this week
- Optimizing inference proxy for LLMs☆406Updated this week
- LLaMa retrieval plugin script using OpenAI's retrieval plugin☆326Updated last year
- ☆442Updated 3 weeks ago
- An implementation of bucketMul LLM inference☆212Updated 2 months ago
- Fine-tune mistral-7B on 3090s, a100s, h100s☆701Updated 11 months ago
- Prompt engineering for developers☆665Updated 7 months ago
- Code behind Arxiv Papers☆443Updated 5 months ago
- Multi-node production AI stack. Run the best of open source AI easily on your own servers. Create your own AI by fine-tuning open source …☆319Updated this week
- OpenAI-compatible Python client that can call any LLM☆364Updated last year
- A framework for building, experimenting, deploying, and continuously iterating on your LLM application☆290Updated this week
- C++ implementation for 💫StarCoder☆443Updated last year
- Bayesian Optimization as a Coverage Tool for Evaluating LLMs. Accurate evaluation (benchmarking) that's 10 times faster with just a few l…☆269Updated last month
- Serving multiple LoRA finetuned LLM as one☆946Updated 4 months ago
- Action library for AI Agent☆187Updated this week
- Customizable implementation of the self-instruct paper.☆1,004Updated 6 months ago
- An LLM-powered advanced RAG pipeline built from scratch☆785Updated 7 months ago
- [ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning☆594Updated 3 months ago
- Finetune a LLM to speak like you based on your WhatsApp Conversations☆339Updated 4 months ago
- Agents Capable of Self-Editing Their Prompts / Python Code☆732Updated 6 months ago
- Llama 2 Everywhere (L2E)☆1,510Updated last month