🦖 X—LLM: Cutting Edge & Easy LLM Finetuning
☆407Jan 17, 2024Updated 2 years ago
Alternatives and similar repositories for xllm
Users that are interested in xllm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Курс по глубокому обучению в обработке естественных языков для магистров компьютерной лингвистики Высшей Школы Экономики☆49Sep 5, 2022Updated 3 years ago
- Probing suite for evaluation of Russian embedding and language models☆33Oct 1, 2024Updated last year
- Pytorch library for end-to-end transformer models training, inference and serving☆70Apr 19, 2025Updated 11 months ago
- Tools for merging pretrained large language models.☆6,945Mar 15, 2026Updated 3 weeks ago
- Pipeline for fast building text classification TF-IDF + LogReg baselines.☆61Nov 6, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆3,743May 21, 2025Updated 10 months ago
- Python Server for C3 AI app. A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) with…☆24Jan 7, 2024Updated 2 years ago
- Robust recipes to align language models with human and AI preferences☆5,551Apr 2, 2026Updated last week
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆3,155Mar 30, 2026Updated last week
- Language modeling and instruction tuning for Russian☆463Aug 20, 2024Updated last year
- Automatically evaluate your LLMs in Google Colab☆687May 7, 2024Updated last year
- Go ahead and axolotl questions☆11,608Updated this week
- C++ inference wrappers for running blazing fast embedding services on your favourite serverless like AWS Lambda. By Prithivi Da, PRs welc…☆23Mar 4, 2024Updated 2 years ago
- REST API for sentence tokenization and embedding using Multilingual Universal Sentence Encoder.☆51Sep 5, 2021Updated 4 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆70Nov 17, 2025Updated 4 months ago
- This repository contains a wrapper for keras models with tf-hub (elmo and bert), which allows you to use them as a keras layer, with the …☆20Jun 10, 2019Updated 6 years ago
- Efficient few-shot learning with Sentence Transformers☆2,705Apr 2, 2026Updated last week
- ☆375Dec 4, 2023Updated 2 years ago
- A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) within reach of everyone, particu…☆38Jan 7, 2024Updated 2 years ago
- CraftML is a restful web service for easy pipeline creation without code.☆13Apr 18, 2021Updated 4 years ago
- BSNLP 2021☆33Nov 3, 2024Updated last year
- ☆13Dec 7, 2022Updated 3 years ago
- S-LoRA: Serving Thousands of Concurrent LoRA Adapters☆1,905Jan 21, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,897May 17, 2025Updated 10 months ago
- The repository for the code of the UltraFastBERT paper☆518Mar 24, 2024Updated 2 years ago
- Convert ML Models to Flask + Gunicorn + Docker Service.☆12Oct 15, 2022Updated 3 years ago
- Planet: Understanding the Amazon from Space☆12Jul 23, 2017Updated 8 years ago
- Embedding Studio is a framework which allows you transform your Vector Database into a feature-rich Search Engine.☆384Apr 24, 2025Updated 11 months ago
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Da☆120Mar 31, 2025Updated last year
- ☆13Feb 26, 2023Updated 3 years ago
- LLM Finetuning with peft☆2,896Aug 1, 2025Updated 8 months ago
- Large Language Model Text Generation Inference☆10,817Mar 21, 2026Updated 2 weeks ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Extend existing LLMs way beyond the original training length with constant memory usage, without retraining☆736Apr 10, 2024Updated last year
- ⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Pl…☆2,178Oct 8, 2024Updated last year
- [ICLR 2024] Efficient Streaming Language Models with Attention Sinks☆7,208Jul 11, 2024Updated last year
- Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.☆6,190Aug 22, 2025Updated 7 months ago
- Toolkit for attaching, training, saving and loading of new heads for transformer models☆295Feb 12, 2026Updated last month
- MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.☆2,107Jun 30, 2025Updated 9 months ago
- Enforce the output format (JSON Schema, Regex etc) of a language model☆2,001Updated this week