SciPhi-AI / synthesizerView external linksLinks
A multi-purpose LLM framework for RAG and data creation.
☆629Jan 13, 2024Updated 2 years ago
Alternatives and similar repositories for synthesizer
Users that are interested in synthesizer are comparing it to the libraries listed below
Sorting:
- Generate textbook-quality synthetic LLM pretraining data☆509Oct 19, 2023Updated 2 years ago
- AgentSearch is a framework for powering search agents and enabling customizable local search.☆517Apr 22, 2024Updated last year
- Customizable implementation of the self-instruct paper.☆1,050Mar 7, 2024Updated last year
- ☆415Nov 2, 2023Updated 2 years ago
- Go ahead and axolotl questions☆11,289Updated this week
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆222Apr 29, 2024Updated last year
- data cleaning and curation for unstructured text☆328Aug 6, 2024Updated last year
- ☆63Sep 23, 2024Updated last year
- Let's create synthetic textbooks together :)☆76Jan 29, 2024Updated 2 years ago
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆3,084Jan 26, 2026Updated 2 weeks ago
- LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath☆9,477Jun 7, 2025Updated 8 months ago
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,852May 17, 2025Updated 8 months ago
- ☆21Oct 6, 2023Updated 2 years ago
- Tools for merging pretrained large language models.☆6,783Jan 26, 2026Updated 2 weeks ago
- Merge Transformers language models by use of gradient parameters.☆214Aug 8, 2024Updated last year
- Small and Efficient Mathematical Reasoning LLMs☆73Jan 27, 2024Updated 2 years ago
- Fine-tune mistral-7B on 3090s, a100s, h100s☆725Oct 11, 2023Updated 2 years ago
- Low-Rank adapter extraction for fine-tuned transformers models☆180May 2, 2024Updated last year
- The official implementation of Self-Play Fine-Tuning (SPIN)☆1,234May 8, 2024Updated last year
- Full finetuning of large language models without large memory requirements☆94Sep 22, 2025Updated 4 months ago
- GPT-2 small trained on phi-like data☆68Feb 18, 2024Updated last year
- A collection of modular datasets generated by GPT-4, General-Instruct - Roleplay-Instruct - Code-Instruct - and Toolformer☆1,630Sep 15, 2023Updated 2 years ago
- Chat language model that can use tools and interpret the results☆1,590Dec 3, 2025Updated 2 months ago
- A bagel, with everything.☆326Apr 11, 2024Updated last year
- Robust recipes to align language models with human and AI preferences☆5,495Sep 8, 2025Updated 5 months ago
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆2,885Updated this week
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Jun 21, 2023Updated 2 years ago
- batched loras☆349Sep 6, 2023Updated 2 years ago
- ☆74Sep 5, 2023Updated 2 years ago
- ⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Pl…☆2,174Oct 8, 2024Updated last year
- Fast & more realistic evaluation of chat language models. Includes leaderboard.☆190Dec 23, 2023Updated 2 years ago
- An Autonomous LLM Agent that runs on Wizcoder-15B☆333Oct 21, 2024Updated last year
- QLoRA with Enhanced Multi GPU Support☆38Aug 8, 2023Updated 2 years ago
- A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.☆2,911Sep 30, 2023Updated 2 years ago
- ☆135Nov 24, 2023Updated 2 years ago
- OpenChat: Advancing Open-source Language Models with Imperfect Data☆5,472Sep 13, 2024Updated last year
- MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user…☆183Nov 6, 2025Updated 3 months ago
- Automatically evaluate your LLMs in Google Colab☆685May 7, 2024Updated last year
- Entropy Based Sampling and Parallel CoT Decoding☆3,432Nov 13, 2024Updated last year