tcapelle / mixtral
Mixtral finetuning
☆19Updated last year
Alternatives and similar repositories for mixtral:
Users that are interested in mixtral are comparing it to the libraries listed below
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆24Updated last year
- Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and te…☆42Updated last year
- A place to store reusable transformer components of my own creation or found on the interwebs☆48Updated last week
- ☆40Updated 2 months ago
- ☆17Updated this week
- ☆24Updated last year
- ☆22Updated last year
- QLoRA for Masked Language Modeling☆22Updated last year
- ☆48Updated 5 months ago
- QLoRA with Enhanced Multi GPU Support☆37Updated last year
- [WIP] Transformer to embed Danbooru labelsets☆13Updated last year
- Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models☆69Updated last year
- A sample pattern for running CI tests on Modal☆17Updated this week
- Using multiple LLMs for ensemble Forecasting☆16Updated last year
- An introduction to LLM Sampling☆77Updated 3 months ago
- ☆28Updated 5 months ago
- Training code for Sparse Autoencoders on Embedding models☆38Updated last month
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆42Updated last year
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆76Updated 5 months ago
- A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.☆21Updated last week
- Code for NeurIPS LLM Efficiency Challenge☆57Updated last year
- This repository contains code for cleaning your training data of benchmark data to help combat data snooping.☆25Updated last year
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆34Updated this week
- ☆77Updated 10 months ago
- ☆26Updated last month
- 🤗 HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)☆17Updated last year
- ☆87Updated last year
- LLM training in simple, raw C/CUDA☆14Updated 4 months ago
- Codebase accompanying the Summary of a Haystack paper.☆77Updated 6 months ago
- Simple GRPO scripts and configurations.☆58Updated 2 months ago