Alignment-Lab-AI / Dataset-Conversion-ToolkitLinks
a set of scripts to easily convert all training data from huggingface into alpaca instruct or sharegpt format, which should allow for ease of use with any trainer
☆16Updated 4 months ago
Alternatives and similar repositories for Dataset-Conversion-Toolkit
Users that are interested in Dataset-Conversion-Toolkit are comparing it to the libraries listed below
Sorting:
- ☆51Updated last year
- ☆23Updated 5 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆91Updated 6 months ago
- entropix style sampling + GUI☆26Updated 8 months ago
- ☆66Updated last year
- A framework for evaluating function calls made by LLMs☆37Updated last year
- Easy to use, High Performant Knowledge Distillation for LLMs☆88Updated 2 months ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆22Updated 8 months ago
- ☆53Updated 8 months ago
- ☆115Updated 7 months ago
- Very minimal (and stateless) agent framework☆44Updated 6 months ago
- Let's create synthetic textbooks together :)☆75Updated last year
- ☆87Updated 6 months ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31Updated last year
- LLM based agents with proactive interactions, long-term memory, external tool integration, and local deployment capabilities.☆104Updated this week
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆140Updated 5 months ago
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆101Updated 7 months ago
- Function Calling Benchmark & Testing☆88Updated last year
- ☆157Updated last year
- ☆27Updated last year
- Mixture-of-Ollamas☆30Updated 11 months ago
- A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.☆32Updated 3 months ago
- Extract a single expert from a Mixture Of Experts model using slerp interpolation.☆17Updated last year
- GPT-4 Level Conversational QA Trained In a Few Hours☆63Updated 11 months ago
- ☆23Updated 9 months ago
- run ollama & gguf easily with a single command☆52Updated last year
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated last year
- Low-Rank adapter extraction for fine-tuned transformers models☆174Updated last year
- ☆57Updated 5 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆55Updated 5 months ago