Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI
☆222Apr 29, 2024Updated last year
Alternatives and similar repositories for Sensei
Users that are interested in Sensei are comparing it to the libraries listed below
Sorting:
- Low-Rank adapter extraction for fine-tuned transformers models☆180May 2, 2024Updated last year
- ☆337Updated this week
- auto fine tune of models with synthetic data☆78Feb 14, 2024Updated 2 years ago
- A bagel, with everything.☆326Apr 11, 2024Updated last year
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆270Jan 10, 2026Updated last month
- Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through r…☆74Nov 4, 2025Updated 4 months ago
- Create Custom LLMs☆1,812Nov 8, 2025Updated 3 months ago
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆32Sep 22, 2024Updated last year
- This is our own implementation of 'Layer Selective Rank Reduction'☆240May 26, 2024Updated last year
- Generate textbook-quality synthetic LLM pretraining data☆509Oct 19, 2023Updated 2 years ago
- Auto Data is a library designed for quick and effortless creation of datasets tailored for fine-tuning Large Language Models (LLMs).☆105Oct 31, 2024Updated last year
- ☆119Dec 18, 2024Updated last year
- ☆86Feb 1, 2024Updated 2 years ago
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆3,108Feb 23, 2026Updated last week
- smolLM with Entropix sampler on pytorch☆149Oct 31, 2024Updated last year
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- Demo of AI chatbot that predicts user message to generate response quickly.☆105Feb 28, 2024Updated 2 years ago
- ☆11Aug 26, 2024Updated last year
- Merge Transformers language models by use of gradient parameters.☆214Aug 8, 2024Updated last year
- Tools for merging pretrained large language models.☆6,826Updated this week
- Synthify: Seamlessly generate ai datasets with a no-code UI | https://synthify.toolstack.run☆48Feb 9, 2025Updated last year
- ☆49May 13, 2024Updated last year
- Customizable implementation of the self-instruct paper.☆1,049Mar 7, 2024Updated last year
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆150Jan 7, 2026Updated last month
- The official implementation of Self-Play Fine-Tuning (SPIN)☆1,235May 8, 2024Updated last year
- THOUGHTSCULPT, a general reasoning and search method for complex tasks☆13Dec 13, 2024Updated last year
- TaskWeaver Plugins☆12Jan 28, 2024Updated 2 years ago
- Chat language model that can use tools and interpret the results☆1,591Dec 3, 2025Updated 3 months ago
- MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user…☆184Nov 6, 2025Updated 4 months ago
- Official repository for ORPO☆472May 31, 2024Updated last year
- A multi-purpose LLM framework for RAG and data creation.☆629Jan 13, 2024Updated 2 years ago
- Training code for Sparse Autoencoders on Embedding models☆39Feb 27, 2025Updated last year
- Go ahead and axolotl questions☆11,395Updated this week
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆2,915Updated this week
- ☆12Apr 17, 2024Updated last year
- A sleek, customizable interface for managing LLMs with responsive design and easy agent personalization.☆17Aug 30, 2024Updated last year
- A library for making RepE control vectors☆691Sep 24, 2025Updated 5 months ago
- Stanford NLP Python library for Representation Finetuning (ReFT)☆1,560Jan 14, 2026Updated last month
- converts url content into JSON with a simple prefix☆73May 8, 2024Updated last year