Alignment-Lab-AI / Dataset-Conversion-ToolkitLinks

a set of scripts to easily convert all training data from huggingface into alpaca instruct or sharegpt format, which should allow for ease of use with any trainer

☆16

Alternatives and similar repositories for Dataset-Conversion-Toolkit

Users that are interested in Dataset-Conversion-Toolkit are comparing it to the libraries listed below

Sorting:

nyunAI / PruneGPT
☆51Updated last year
lightblue-tech / lb-reranker
☆23Updated 5 months ago
Xalp / ECHO
Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)
☆91Updated 6 months ago
EduardTalianu / EntropixLab
entropix style sampling + GUI
☆26Updated 8 months ago
QuixiAI / kraken
☆66Updated last year
interstellarninja / function-calling-eval
A framework for evaluating function calls made by LLMs
☆37Updated last year
agokrani / distillKitPlus
Easy to use, High Performant Knowledge Distillation for LLMs
☆88Updated 2 months ago
matthewrenze / jhu-concise-cot
The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models
☆22Updated 8 months ago
arcee-ai / DAM
☆53Updated 8 months ago
teknium1 / ShareGPT-Builder
☆115Updated 7 months ago
AtakanTekparmak / agento
Very minimal (and stateless) agent framework
☆44Updated 6 months ago
thooton / muse
Let's create synthetic textbooks together :)
☆75Updated last year
otriscon / llm-structured-output
☆87Updated 6 months ago
serp-ai / Parameter-Efficient-MoE
Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks
☆31Updated last year
uukuguy / speechless
LLM based agents with proactive interactions, long-term memory, external tool integration, and local deployment capabilities.
☆104Updated this week
Mihaiii / backtrack_sampler
An easy-to-understand framework for LLM samplers that rewind and revise generated tokens
☆140Updated 5 months ago
ritabratamaiti / AnyModal
AnyModal is a Flexible Multimodal Language Model Framework for PyTorch
☆101Updated 7 months ago
ComposioHQ / Composio-Function-Calling-Benchmark
Function Calling Benchmark & Testing
☆88Updated last year
QuixiAI / OpenChatML
☆157Updated last year
zarakiquemparte / zaraki-tools
☆27Updated last year
sammcj / moa
Mixture-of-Ollamas
☆30Updated 11 months ago
axolotl-ai-cloud / grpo_code
A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.
☆32Updated 3 months ago
QuixiAI / extract-expert
Extract a single expert from a Mixture Of Experts model using slerp interpolation.
☆17Updated last year
Cerebras / DocChat
GPT-4 Level Conversational QA Trained In a Few Hours
☆63Updated 11 months ago
remichu-ai / gallamaUI
☆23Updated 9 months ago
monk1337 / auto-ollama
run ollama & gguf easily with a single command
☆52Updated last year
ElleLeonne / Lightning-ReLoRA
A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.
☆33Updated last year
thomasgauthier / LoRD
Low-Rank adapter extraction for fine-tuned transformers models
☆174Updated last year
ArturTanona / grpo_unsloth_docker
☆57Updated 5 months ago
s-smits / grpo-optuna
Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna
☆55Updated 5 months ago