AI-Commandos / LLaMa2langLinks
Convenience scripts to finetune (chat-)LLaMa3 and other models for any language
☆315Updated last year
Alternatives and similar repositories for LLaMa2lang
Users that are interested in LLaMa2lang are comparing it to the libraries listed below
Sorting:
- A fast batching API to serve LLM models☆189Updated last year
- ☆206Updated last year
- Dataset Crafting w/ RAG/Wikipedia ground truth and Efficient Fine-Tuning Using MLX and Unsloth. Includes configurable dataset annotation …☆191Updated last year
- The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM …☆610Updated 10 months ago
- function calling-based LLM agents☆289Updated last year
- An AI assistant beyond the chat box.☆328Updated last year
- A multimodal, function calling powered LLM webui.☆217Updated last year
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆222Updated last year
- ☆168Updated 2 years ago
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆267Updated 9 months ago
- An application for running LLMs locally on your device, with your documents, facilitating detailed citations in generated responses.☆622Updated last year
- ☆162Updated 10 months ago
- Comparison of the output quality of quantization methods, using Llama 3, transformers, GGUF, EXL2.☆165Updated last year
- This is our own implementation of 'Layer Selective Rank Reduction'☆240Updated last year
- An OpenAI-like LLaMA inference API☆113Updated 2 years ago
- llama.cpp with BakLLaVA model describes what does it see☆379Updated 2 years ago
- Falcon LLM ggml framework with CPU and GPU support☆248Updated last year
- TheBloke's Dockerfiles☆308Updated last year
- Convenient wrapper for fine-tuning and inference of Large Language Models (LLMs) with several quantization techniques (GTPQ, bitsandbytes…☆146Updated 2 years ago
- Low-Rank adapter extraction for fine-tuned transformers models☆180Updated last year
- An mlx project to train a base model on your whatsapp chats using (Q)Lora finetuning☆172Updated last year
- Automatically evaluate your LLMs in Google Colab☆677Updated last year
- Web UI for ExLlamaV2☆514Updated 10 months ago
- The RunPod worker template for serving our large language model endpoints. Powered by vLLM.☆388Updated this week
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆233Updated last year
- FineTune LLMs in few lines of code (Text2Text, Text2Speech, Speech2Text)☆246Updated last year
- 🚀 Retrieval Augmented Generation (RAG) with txtai. Combine search and LLMs to find insights with your own data.☆429Updated 3 weeks ago
- Joint speech-language model - respond directly to audio!☆372Updated last year
- Tune any FALCON in 4-bit☆465Updated 2 years ago
- Domain Adapted Language Modeling Toolkit - E2E RAG☆334Updated last year