oKatanaaa / kolibrifyLinks
Curriculum training of instruction-following LLMs with Unsloth
☆14Updated 3 months ago
Alternatives and similar repositories for kolibrify
Users that are interested in kolibrify are comparing it to the libraries listed below
Sorting:
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆53Updated 4 months ago
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆24Updated last year
- Crispy reranking models by Mixedbread☆32Updated 3 weeks ago
- ☆124Updated 2 months ago
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆59Updated last month
- QLoRA with Enhanced Multi GPU Support☆37Updated last year
- Train your own SOTA deductive reasoning model☆94Updated 3 months ago
- ☆47Updated 4 months ago
- ☆47Updated 10 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆64Updated last year
- ☆51Updated 7 months ago
- Pre-train Static Word Embeddings☆80Updated 3 weeks ago
- ☆35Updated 2 years ago
- ☆66Updated last year
- Simple GRPO scripts and configurations.☆58Updated 4 months ago
- This repository contains code for cleaning your training data of benchmark data to help combat data snooping.☆25Updated 2 years ago
- ☆47Updated last year
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆77Updated 8 months ago
- Lightweight toolkit package to train and fine-tune 1.58bit Language models☆80Updated last month
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆66Updated 7 months ago
- ☆76Updated last year
- ☆34Updated 3 months ago
- An introduction to LLM Sampling☆78Updated 6 months ago
- ☆61Updated last week
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆101Updated 3 months ago
- Experiments for efforts to train a new and improved t5☆77Updated last year
- ☆17Updated last year
- ☆35Updated last year
- Just a bunch of benchmark logs for different LLMs☆119Updated 11 months ago
- ☆30Updated 7 months ago