krychu / llamaLinks
Inference code for LLaMA models on CPU and Mac M1/M2 GPU
☆79Updated 2 years ago
Alternatives and similar repositories for llama
Users that are interested in llama are comparing it to the libraries listed below
Sorting:
- Finetune a LLM to speak like you based on your WhatsApp Conversations☆378Updated last year
- LLaMA Cog template☆303Updated last year
- Finetune llama2-70b and codellama on MacBook Air without quantization☆450Updated last year
- A simple "Be My Eyes" web app with a llama.cpp/llava backend☆493Updated 2 years ago
- Langchain realworld examples in JS☆185Updated 11 months ago
- Finetuning of Falcon-7B LLM using QLoRA on Mental Health Conversational Dataset☆262Updated 2 years ago
- OpenAI-compatible Python client that can call any LLM☆372Updated 2 years ago
- A OpenAI API compatible REST server for llama.☆208Updated 11 months ago
- Fine-tune mistral-7B on 3090s, a100s, h100s☆725Updated 2 years ago
- Wanderlust OpenAI example using Solara☆215Updated 2 years ago
- An mlx project to train a base model on your whatsapp chats using (Q)Lora finetuning☆173Updated 2 years ago
- llama.cpp with BakLLaVA model describes what does it see☆380Updated 2 years ago
- JS tokenizer for LLaMA 1 and 2☆363Updated last year
- A package for visualising Chroma vector collections in 3D☆110Updated 2 years ago
- A FastAPI service for semantic text search using precomputed embeddings and advanced similarity measures, with built-in support for vario…☆1,043Updated 11 months ago
- LLaMa retrieval plugin script using OpenAI's retrieval plugin☆323Updated 2 years ago
- Fine-tuning LLM on my Telegram chats☆217Updated 2 years ago
- Create repos and commits with AI.☆300Updated 2 years ago
- ☆135Updated 2 years ago
- ☆207Updated 2 years ago
- Train a language model to answer Slack messages as you.☆258Updated 10 months ago
- Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.☆867Updated 2 years ago
- Get 100% uptime, reliability from OpenAI. Handle Rate Limit, Timeout, API, Keys Errors☆695Updated 2 years ago
- A fully in-browser privacy solution to make Conversational AI privacy-friendly☆234Updated last year
- LLM plugin for running models using MLC☆192Updated last year
- LLM-based tool for parsing information and chatting with it☆215Updated 2 years ago
- Run inference on replit-3B code instruct model using CPU☆160Updated 2 years ago
- ☆115Updated last year
- An LLM-powered advanced RAG pipeline built from scratch☆856Updated 2 years ago
- Falcon LLM with Chat UI using LangChain and Chainlit☆172Updated 2 years ago