AI-Commandos / LLaMa2lang
Convenience scripts to finetune (chat-)LLaMa3 and other models for any language
☆304Updated 10 months ago
Alternatives and similar repositories for LLaMa2lang:
Users that are interested in LLaMa2lang are comparing it to the libraries listed below
- ☆204Updated 10 months ago
- A fast batching API to serve LLM models☆182Updated 11 months ago
- Dataset Crafting w/ RAG/Wikipedia ground truth and Efficient Fine-Tuning Using MLX and Unsloth. Includes configurable dataset annotation …☆179Updated 9 months ago
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆223Updated 11 months ago
- The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM …☆551Updated 2 months ago
- Comparison of the output quality of quantization methods, using Llama 3, transformers, GGUF, EXL2.☆150Updated 11 months ago
- 🚀 Retrieval Augmented Generation (RAG) with txtai. Combine search and LLMs to find insights with your own data.☆354Updated 3 weeks ago
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆231Updated 5 months ago
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆250Updated last month
- Large-scale LLM inference engine☆1,384Updated this week
- A multimodal, function calling powered LLM webui.☆214Updated 6 months ago
- ☆284Updated 2 weeks ago
- A simple Jupyter Notebook for learning MLX text-completion fine-tuning!☆117Updated 5 months ago
- Convenient wrapper for fine-tuning and inference of Large Language Models (LLMs) with several quantization techniques (GTPQ, bitsandbytes…☆147Updated last year
- function calling-based LLM agents☆285Updated 7 months ago
- Web UI for ExLlamaV2☆492Updated 2 months ago
- This is our own implementation of 'Layer Selective Rank Reduction'☆235Updated 10 months ago
- An application for running LLMs locally on your device, with your documents, facilitating detailed citations in generated responses.☆579Updated 5 months ago
- ☆153Updated 9 months ago
- Convert Compute And Books Into Instruct-Tuning Datasets! Makes: QA, RP, Classifiers.☆1,414Updated 2 months ago
- The RunPod worker template for serving our large language model endpoints. Powered by vLLM.☆307Updated this week
- Low-Rank adapter extraction for fine-tuned transformers models☆171Updated 11 months ago
- Automatically evaluate your LLMs in Google Colab☆615Updated 11 months ago
- Tune any FALCON in 4-bit☆466Updated last year
- A library for easily merging multiple LLM experts, and efficiently train the merged LLM.☆472Updated 7 months ago
- Joint speech-language model - respond directly to audio!☆369Updated 9 months ago
- llama.cpp with BakLLaVA model describes what does it see☆383Updated last year
- idea: https://github.com/nyxkrage/ebook-groupchat/☆86Updated 8 months ago
- An AI assistant beyond the chat box.☆325Updated last year
- A python package for developing AI applications with local LLMs.☆147Updated 3 months ago