🦖 X—LLM: Cutting Edge & Easy LLM Finetuning
☆408Jan 17, 2024Updated 2 years ago
Alternatives and similar repositories for xllm
Users that are interested in xllm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Курс по глубокому обучению в обработке естественных языков для магистров компьютерной лингвистики Высшей Школы Экономики☆49Sep 5, 2022Updated 3 years ago
- Probing suite for evaluation of Russian embedding and language models☆33Oct 1, 2024Updated last year
- Pytorch library for end-to-end transformer models training, inference and serving☆70Apr 19, 2025Updated last year
- Tools for merging pretrained large language models.☆7,083May 6, 2026Updated 2 weeks ago
- Pipeline for fast building text classification TF-IDF + LogReg baselines.☆61Nov 6, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆3,781May 21, 2025Updated 11 months ago
- Python Server for C3 AI app. A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) with…☆24Jan 7, 2024Updated 2 years ago
- Robust recipes to align language models with human and AI preferences☆5,602Apr 8, 2026Updated last month
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆3,217Apr 27, 2026Updated 3 weeks ago
- Language modeling and instruction tuning for Russian☆462Aug 20, 2024Updated last year
- Automatically evaluate your LLMs in Google Colab☆688May 7, 2024Updated 2 years ago
- Go ahead and axolotl questions☆11,938Updated this week
- C++ inference wrappers for running blazing fast embedding services on your favourite serverless like AWS Lambda. By Prithivi Da, PRs welc…☆23Mar 4, 2024Updated 2 years ago
- REST API for sentence tokenization and embedding using Multilingual Universal Sentence Encoder.☆51Sep 5, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆70Nov 17, 2025Updated 6 months ago
- This repository contains a wrapper for keras models with tf-hub (elmo and bert), which allows you to use them as a keras layer, with the …☆20Jun 10, 2019Updated 6 years ago
- Efficient few-shot learning with Sentence Transformers☆2,735Apr 17, 2026Updated last month
- ☆375Dec 4, 2023Updated 2 years ago
- A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) within reach of everyone, particu…☆37Jan 7, 2024Updated 2 years ago
- CraftML is a restful web service for easy pipeline creation without code.☆13Apr 18, 2021Updated 5 years ago
- BSNLP 2021☆33Nov 3, 2024Updated last year
- ☆13Dec 7, 2022Updated 3 years ago
- S-LoRA: Serving Thousands of Concurrent LoRA Adapters☆1,913Jan 21, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,919May 17, 2025Updated last year
- The repository for the code of the UltraFastBERT paper☆518Mar 24, 2024Updated 2 years ago
- Convert ML Models to Flask + Gunicorn + Docker Service.☆12Oct 15, 2022Updated 3 years ago
- Planet: Understanding the Amazon from Space☆12Jul 23, 2017Updated 8 years ago
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Da☆121Mar 31, 2025Updated last year
- ☆13Feb 26, 2023Updated 3 years ago
- Large Language Model Text Generation Inference☆10,853Mar 21, 2026Updated last month
- Extend existing LLMs way beyond the original training length with constant memory usage, without retraining☆736Apr 10, 2024Updated 2 years ago
- LLM Finetuning with peft☆2,924Aug 1, 2025Updated 9 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Pl…☆2,179Oct 8, 2024Updated last year
- [ICLR 2024] Efficient Streaming Language Models with Attention Sinks☆7,229Jul 11, 2024Updated last year
- Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.☆6,205Aug 22, 2025Updated 8 months ago
- MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.☆2,110Jun 30, 2025Updated 10 months ago
- Toolkit for attaching, training, saving and loading of new heads for transformer models☆297Feb 12, 2026Updated 3 months ago
- Enforce the output format (JSON Schema, Regex etc) of a language model☆2,012Apr 4, 2026Updated last month
- Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets☆4,975Apr 27, 2026Updated 3 weeks ago