Birch-san / booru-embed
[WIP] Transformer to embed Danbooru labelsets
☆13Updated 7 months ago
Related projects ⓘ
Alternatives and complementary repositories for booru-embed
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆23Updated 8 months ago
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Updated last year
- ☆49Updated 8 months ago
- ☆22Updated last year
- Collection of autoregressive model implementation☆67Updated this week
- QLoRA for Masked Language Modeling☆20Updated last year
- Training hybrid models for dummies.☆15Updated 3 weeks ago
- QLoRA with Enhanced Multi GPU Support☆36Updated last year
- GoldFinch and other hybrid transformer components☆39Updated 4 months ago
- ☆57Updated 11 months ago
- ☆40Updated 2 weeks ago
- Lightweight tools for quick and easy LLM demo's☆26Updated last month
- Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models☆69Updated last year
- ☆27Updated 5 months ago
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…☆34Updated last year
- This repository contains code for cleaning your training data of benchmark data to help combat data snooping.☆25Updated last year
- The official implementation of MARS: Unleashing the Power of Variance Reduction for Training Large Models☆65Updated this week
- ☆62Updated last month
- Data preparation code for CrystalCoder 7B LLM☆42Updated 6 months ago
- Latent Large Language Models☆16Updated 2 months ago
- ☆24Updated last year
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated 8 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆46Updated 2 months ago
- Using multiple LLMs for ensemble Forecasting☆16Updated 10 months ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆20Updated 9 months ago
- ☆48Updated last year
- A place to store reusable transformer components of my own creation or found on the interwebs☆44Updated 2 weeks ago
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated last year