migtissera/Sensei

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/migtissera/Sensei)

migtissera / Sensei

Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI

☆221

Alternatives and similar repositories for Sensei

Users that are interested in Sensei are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

sam-paech / antislop-sampler
View on GitHub
☆351Mar 5, 2026Updated 4 months ago
jondurbin / bagel
View on GitHub
A bagel, with everything.
☆326Apr 11, 2024Updated 2 years ago
databricks / lilac
View on GitHub
Curate better data for LLMs
☆1,072Mar 19, 2024Updated 2 years ago
Alignment-Lab-AI / datagen
View on GitHub
a pipeline for using api calls to agnostically convert unstructured data into structured training data
☆32Sep 22, 2024Updated last year
e-p-armstrong / augmentoolkit
View on GitHub
Create Custom LLMs
☆1,859Jun 27, 2026Updated last month
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Itachi-Uchiha581 / Auto-Data
View on GitHub
Auto Data is a library designed for quick and effortless creation of datasets tailored for fine-tuning Large Language Models (LLMs).
☆108Oct 31, 2024Updated last year
lucidrains / self-rewarding-lm-pytorch
View on GitHub
Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI
☆1,411Apr 11, 2024Updated 2 years ago
jmanhype / dspy-self-discover-framework
View on GitHub
Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through r…
☆74Nov 4, 2025Updated 8 months ago
argilla-io / distilabel
View on GitHub
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…
☆3,346Updated this week
VikParuchuri / textbook_quality
View on GitHub
Generate textbook-quality synthetic LLM pretraining data
☆508Oct 19, 2023Updated 2 years ago
QuixiAI / laserRMT
View on GitHub
This is our own implementation of 'Layer Selective Rank Reduction'
☆240May 26, 2024Updated 2 years ago
teknium1 / ShareGPT-Builder
View on GitHub
☆126Dec 18, 2024Updated last year
Mihaiii / backtrack_sampler
View on GitHub
An easy-to-understand framework for LLM samplers that rewind and revise generated tokens
☆151Jan 7, 2026Updated 6 months ago
waefrebeorn / KAN-WuBu-Memory
View on GitHub
An AI character interaction system with emotional modeling and advanced memory management
☆17Oct 26, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
geronimi73 / phi2-finetune
View on GitHub
☆85Feb 1, 2024Updated 2 years ago
Mihaiii / llm_steer
View on GitHub
Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…
☆280Jan 10, 2026Updated 6 months ago
davidkim205 / translation
View on GitHub
☆13Apr 17, 2024Updated 2 years ago
xfactlab / orpo
View on GitHub
Official repository for ORPO
☆480May 31, 2024Updated 2 years ago
arcee-ai / mergekit
View on GitHub
Tools for merging pretrained large language models.
☆7,266Jun 17, 2026Updated last month
EduardTalianu / EntropixLab
View on GitHub
entropix style sampling + GUI
☆27Oct 30, 2024Updated last year
mrconter1 / PullRequestBenchmark
View on GitHub
Evaluating LLMs performance in PR reviews as an indicator for their capability in creating PRs.
☆13Apr 10, 2024Updated 2 years ago
itsme2417 / PolyMind
View on GitHub
A multimodal, function calling powered LLM webui.
☆213Sep 23, 2024Updated last year
QuixiAI / SystemChat
View on GitHub
☆31Jul 5, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
g588928812 / qlora
View on GitHub
QLoRA: Efficient Finetuning of Quantized LLMs
☆11Jul 22, 2023Updated 3 years ago
jondurbin / airoboros
View on GitHub
Customizable implementation of the self-instruct paper.
☆1,051Mar 7, 2024Updated 2 years ago
QuixiAI / grokadamw
View on GitHub
☆137Aug 19, 2024Updated last year
apoorvumang / prompt-lookup-decoding
View on GitHub
Simple speculative decoding technique, integrated in vLLM and transformers
☆611Aug 23, 2024Updated last year
pratyushasharma / laser
View on GitHub
The Truth Is In There: Improving Reasoning in Language Models with Layer-Selective Rank Reduction
☆397Jul 9, 2024Updated 2 years ago
JD-P / minihf
View on GitHub
MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user…
☆184Nov 6, 2025Updated 8 months ago
Ozennefr / GPPPT
View on GitHub
A simple one file python script that executes AI processes defined in YML.
☆14Mar 26, 2023Updated 3 years ago
Leeroo-AI / mergoo
View on GitHub
A library for easily merging multiple LLM experts, and efficiently train the merged LLM.
☆517Aug 26, 2024Updated last year
KyujinHan / Sakura-SOLAR-DPO
View on GitHub
Sakura-SOLAR-DPO: Merge, SFT, and DPO
☆116Dec 30, 2023Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
axolotl-ai-cloud / axolotl
View on GitHub
Go ahead and axolotl questions
☆12,275Updated this week
AnswerDotAI / ModernBERT-Instruct-mini-cookbook
View on GitHub
☆53Feb 10, 2025Updated last year
austinsilveria / tricksy
View on GitHub
Fast approximate inference on a single GPU with sparsity aware offloading
☆38Jan 4, 2024Updated 2 years ago
SebastianBodza / EnsembleForecasting
View on GitHub
Using multiple LLMs for ensemble Forecasting
☆16Jan 17, 2024Updated 2 years ago
galatolofederico / microchain
View on GitHub
function calling-based LLM agents
☆292Sep 16, 2024Updated last year
NousResearch / Open-Reasoning-Tasks
View on GitHub
A comprehensive repository of reasoning tasks for LLMs (and beyond)
☆497Sep 27, 2024Updated last year
jmanhype / Storm
View on GitHub
☆13Mar 25, 2026Updated 4 months ago