moxin-org / Moxin-LLM
☆83Updated 3 weeks ago
Alternatives and similar repositories for Moxin-LLM:
Users that are interested in Moxin-LLM are comparing it to the libraries listed below
- Simple examples using Argilla tools to build AI☆53Updated 3 months ago
- ☆98Updated 5 months ago
- Easy to use, High Performant Knowledge Distillation for LLMs☆48Updated last month
- ☆65Updated 8 months ago
- ☆111Updated 2 months ago
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆82Updated last month
- ☆124Updated 2 weeks ago
- ☆109Updated 5 months ago
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆130Updated this week
- Distributed Inference for mlx LLm☆82Updated 6 months ago
- Embed anything.☆29Updated 8 months ago
- Gradio based tool to run opensource LLM models directly from Huggingface☆91Updated 7 months ago
- GPT-4 Level Conversational QA Trained In a Few Hours☆58Updated 6 months ago
- ☆152Updated 7 months ago
- A stable, fast and easy-to-use inference library with a focus on a sync-to-async API☆45Updated 4 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆89Updated 3 weeks ago
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆53Updated this week
- A pipeline parallel training script for LLMs.☆124Updated 3 weeks ago
- Conduct in-depth research with AI-driven insights : DeepDive is a command-line tool that leverages web searches and AI models to generate…☆36Updated 5 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆64Updated 3 months ago
- Fast parallel LLM inference for MLX☆163Updated 7 months ago
- ☆53Updated 8 months ago
- automatically quant GGUF models☆154Updated this week
- Self-hosted LLM chatbot arena, with yourself as the only judge☆36Updated last year
- Low-Rank adapter extraction for fine-tuned transformers models☆169Updated 9 months ago
- The simplest, fastest repository for training/finetuning medium-sized xLSTMs.☆39Updated 8 months ago
- run ollama & gguf easily with a single command☆49Updated 9 months ago
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆82Updated last week
- A quick and optimized solution to manage llama based gguf quantized models, download gguf files, retreive messege formatting, add more mo…☆12Updated last year
- This small API downloads and exposes access to NeuML's txtai-wikipedia and full wikipedia datasets, taking in a query and returning full …☆76Updated 3 weeks ago