tcapelle / mixtralLinks
Mixtral finetuning
☆19Updated last year
Alternatives and similar repositories for mixtral
Users that are interested in mixtral are comparing it to the libraries listed below
Sorting:
- QLoRA for Masked Language Modeling☆22Updated 2 years ago
- ☆88Updated last year
- Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models☆69Updated 2 years ago
- Chat Markup Language conversation library☆55Updated last year
- Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and te…☆43Updated last year
- Code for NeurIPS LLM Efficiency Challenge☆59Updated last year
- Simple GRPO scripts and configurations.☆59Updated 8 months ago
- QLoRA with Enhanced Multi GPU Support☆37Updated 2 years ago
- ☆55Updated 11 months ago
- ☆50Updated 8 months ago
- PyLate efficient inference engine☆66Updated last month
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆24Updated last year
- Supervised instruction finetuning for LLM with HF trainer and Deepspeed☆36Updated 2 years ago
- ☆46Updated 2 years ago
- A library for squeakily cleaning and filtering language datasets.☆47Updated 2 years ago
- ☆22Updated 2 years ago
- ☆25Updated 5 months ago
- Supercharge huggingface transformers with model parallelism.☆77Updated 3 months ago
- ☆69Updated last year
- ☆80Updated last year
- A sample pattern for running CI tests on Modal☆18Updated 6 months ago
- Training code for Sparse Autoencoders on Embedding models☆38Updated 8 months ago
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆66Updated last month
- ☆23Updated 2 years ago
- Based on the tree of thoughts paper☆48Updated 2 years ago
- A repository of projects and datasets under active development by Alignment Lab AI☆22Updated last year
- Improving Text Embedding of Language Models Using Contrastive Fine-tuning☆65Updated last year
- ☆29Updated this week
- Repository containing awesome resources regarding Hugging Face tooling.☆48Updated last year
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆79Updated last year