AI-Commandos / LLaMa2lang
Convenience scripts to finetune (chat-)LLaMa3 and other models for any language
☆301Updated 8 months ago
Alternatives and similar repositories for LLaMa2lang:
Users that are interested in LLaMa2lang are comparing it to the libraries listed below
- ☆200Updated 9 months ago
- A fast batching API to serve LLM models☆181Updated 10 months ago
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆224Updated 10 months ago
- The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM …☆538Updated 3 weeks ago
- Dataset Crafting w/ RAG/Wikipedia ground truth and Efficient Fine-Tuning Using MLX and Unsloth. Includes configurable dataset annotation …☆175Updated 7 months ago
- 🚀 Retrieval Augmented Generation (RAG) with txtai. Combine search and LLMs to find insights with your own data.☆343Updated 2 months ago
- Comparison of the output quality of quantization methods, using Llama 3, transformers, GGUF, EXL2.☆146Updated 9 months ago
- ☆80Updated 2 months ago
- A bagel, with everything.☆317Updated 11 months ago
- A library for easily merging multiple LLM experts, and efficiently train the merged LLM.☆446Updated 6 months ago
- function calling-based LLM agents☆283Updated 5 months ago
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆230Updated 4 months ago
- One click templates for inferencing Language Models☆162Updated this week
- FastMLX is a high performance production ready API to host MLX models.☆268Updated this week
- A multimodal, function calling powered LLM webui.☆215Updated 5 months ago
- A simple Jupyter Notebook for learning MLX text-completion fine-tuning!☆111Updated 4 months ago
- The RunPod worker template for serving our large language model endpoints. Powered by vLLM.☆293Updated this week
- A compact LLM pretrained in 9 days by using high quality data☆298Updated 3 months ago
- Convenient wrapper for fine-tuning and inference of Large Language Models (LLMs) with several quantization techniques (GTPQ, bitsandbytes…☆147Updated last year
- Joint speech-language model - respond directly to audio!☆368Updated 8 months ago
- Start a server from the MLX library.☆179Updated 7 months ago
- Fine-tune mistral-7B on 3090s, a100s, h100s☆706Updated last year
- idea: https://github.com/nyxkrage/ebook-groupchat/☆86Updated 6 months ago
- Stateful control of Large Language Models☆114Updated this week
- An OpenAI-like LLaMA inference API☆113Updated last year
- llama.cpp with BakLLaVA model describes what does it see☆384Updated last year
- FineTune LLMs in few lines of code (Text2Text, Text2Speech, Speech2Text)☆236Updated last year
- ☆147Updated last month
- An application for running LLMs locally on your device, with your documents, facilitating detailed citations in generated responses.☆545Updated 4 months ago
- This is our own implementation of 'Layer Selective Rank Reduction'☆233Updated 9 months ago