🦖 X—LLM: Cutting Edge & Easy LLM Finetuning
☆408Jan 17, 2024Updated 2 years ago
Alternatives and similar repositories for xllm
Users that are interested in xllm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Курс по глубокому обучению в обработке естественных языков для магистров компьютерной лингвистики Высшей Школы Экономики☆49Sep 5, 2022Updated 3 years ago
- Probing suite for evaluation of Russian embedding and language models☆33Oct 1, 2024Updated last year
- Pytorch library for end-to-end transformer models training, inference and serving☆70Apr 19, 2025Updated last year
- Tools for merging pretrained large language models.☆7,023Mar 15, 2026Updated last month
- Pipeline for fast building text classification TF-IDF + LogReg baselines.☆61Nov 6, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆3,764May 21, 2025Updated 11 months ago
- Python Server for C3 AI app. A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) with…☆24Jan 7, 2024Updated 2 years ago
- Robust recipes to align language models with human and AI preferences☆5,587Apr 8, 2026Updated 3 weeks ago
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆3,189Apr 20, 2026Updated last week
- Language modeling and instruction tuning for Russian☆463Aug 20, 2024Updated last year
- Automatically evaluate your LLMs in Google Colab☆688May 7, 2024Updated last year
- Go ahead and axolotl questions☆11,779Updated this week
- C++ inference wrappers for running blazing fast embedding services on your favourite serverless like AWS Lambda. By Prithivi Da, PRs welc…☆23Mar 4, 2024Updated 2 years ago
- REST API for sentence tokenization and embedding using Multilingual Universal Sentence Encoder.☆51Sep 5, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆70Nov 17, 2025Updated 5 months ago
- This repository contains a wrapper for keras models with tf-hub (elmo and bert), which allows you to use them as a keras layer, with the …☆20Jun 10, 2019Updated 6 years ago
- Efficient few-shot learning with Sentence Transformers☆2,720Apr 17, 2026Updated last week
- ☆375Dec 4, 2023Updated 2 years ago
- A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) within reach of everyone, particu…☆38Jan 7, 2024Updated 2 years ago
- CraftML is a restful web service for easy pipeline creation without code.☆13Apr 18, 2021Updated 5 years ago
- BSNLP 2021☆33Nov 3, 2024Updated last year
- ☆13Dec 7, 2022Updated 3 years ago
- S-LoRA: Serving Thousands of Concurrent LoRA Adapters☆1,909Jan 21, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,914May 17, 2025Updated 11 months ago
- The repository for the code of the UltraFastBERT paper☆518Mar 24, 2024Updated 2 years ago
- Convert ML Models to Flask + Gunicorn + Docker Service.☆12Oct 15, 2022Updated 3 years ago
- Planet: Understanding the Amazon from Space☆12Jul 23, 2017Updated 8 years ago
- Embedding Studio is a framework which allows you transform your Vector Database into a feature-rich Search Engine.☆384Apr 24, 2025Updated last year
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Da☆121Mar 31, 2025Updated last year
- ☆13Feb 26, 2023Updated 3 years ago
- Large Language Model Text Generation Inference☆10,843Mar 21, 2026Updated last month
- Extend existing LLMs way beyond the original training length with constant memory usage, without retraining☆736Apr 10, 2024Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- LLM Finetuning with peft☆2,918Aug 1, 2025Updated 8 months ago
- ⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Pl…☆2,178Oct 8, 2024Updated last year
- [ICLR 2024] Efficient Streaming Language Models with Attention Sinks☆7,222Jul 11, 2024Updated last year
- Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.☆6,198Aug 22, 2025Updated 8 months ago
- MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.☆2,108Jun 30, 2025Updated 9 months ago
- Toolkit for attaching, training, saving and loading of new heads for transformer models☆296Feb 12, 2026Updated 2 months ago
- Enforce the output format (JSON Schema, Regex etc) of a language model☆2,011Apr 4, 2026Updated 3 weeks ago