tcapelle / mixtral
Mixtral finetuning
☆19Updated 9 months ago
Related projects ⓘ
Alternatives and complementary repositories for mixtral
- ☆24Updated last year
- ☆22Updated last year
- Chat Markup Language conversation library☆54Updated 10 months ago
- QLoRA for Masked Language Modeling☆20Updated last year
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆23Updated 8 months ago
- A place to store reusable transformer components of my own creation or found on the interwebs☆44Updated 2 weeks ago
- ☆41Updated 2 weeks ago
- minimal LLM scripts for 24GB VRAM GPUs. training, inference, whatever☆33Updated this week
- ☆48Updated last year
- Tools to make language models a bit easier to use☆30Updated this week
- ☆75Updated 5 months ago
- Using multiple LLMs for ensemble Forecasting☆16Updated 10 months ago
- minimal pytorch implementation of bm25 (with sparse tensors)☆90Updated 8 months ago
- LLM training in simple, raw C/CUDA☆12Updated last month
- Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and te…☆42Updated 10 months ago
- Code for NeurIPS LLM Efficiency Challenge☆54Updated 7 months ago
- ☆18Updated this week
- PyTorch implementation for MRL☆18Updated 8 months ago
- ☆46Updated 9 months ago
- Codebase accompanying the Summary of a Haystack paper.☆72Updated 2 months ago
- ☆87Updated 9 months ago
- ☆27Updated last month
- A sample pattern for running CI tests on Modal☆13Updated 2 months ago
- Training code for Sparse Autoencoders on Embedding models☆33Updated 3 weeks ago
- QLoRA with Enhanced Multi GPU Support☆36Updated last year
- ☆36Updated 3 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆40Updated 8 months ago
- An introduction to LLM Sampling☆64Updated last week
- Training and Inference Notebooks for the RedPajama (OpenLlama) models☆18Updated last year
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆63Updated last month