mistralai-sf24 / hackathonLinks
☆447Updated last year
Alternatives and similar repositories for hackathon
Users that are interested in hackathon are comparing it to the libraries listed below
Sorting:
- Accelerate your Hugging Face Transformers 7.6-9x. Native to Hugging Face and PyTorch.☆682Updated 9 months ago
- Automatically evaluate your LLMs in Google Colab☆629Updated last year
- Extend existing LLMs way beyond the original training length with constant memory usage, without retraining☆697Updated last year
- Reaching LLaMA2 Performance with 0.1M Dollars☆979Updated 10 months ago
- ☆971Updated 3 months ago
- Banishing LLM Hallucinations Requires Rethinking Generalization☆276Updated 10 months ago
- A benchmark to evaluate language models on questions I've previously asked them to solve.☆1,011Updated last month
- Fine-tune mistral-7B on 3090s, a100s, h100s☆711Updated last year
- ☆412Updated last year
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆230Updated 7 months ago
- Training LLMs with QLoRA + FSDP☆1,479Updated 6 months ago
- Inference code for Persimmon-8B☆415Updated last year
- ☆722Updated last week
- Train Models Contrastively in Pytorch☆713Updated 2 months ago
- Toolkit for attaching, training, saving and loading of new heads for transformer models☆277Updated 2 months ago
- Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI☆1,384Updated last year
- ☆536Updated 9 months ago
- ☆459Updated last year
- Website for hosting the Open Foundation Models Cheat Sheet.☆267Updated 3 weeks ago
- A comprehensive deep dive into the world of tokens☆223Updated 11 months ago
- Start a server from the MLX library.☆187Updated 10 months ago
- This is our own implementation of 'Layer Selective Rank Reduction'☆238Updated last year
- ☆204Updated last year
- Let's build better datasets, together!☆258Updated 5 months ago
- ☆517Updated 6 months ago
- Manage scalable open LLM inference endpoints in Slurm clusters☆257Updated 10 months ago
- [ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling☆876Updated last month
- run paligemma in real time☆131Updated last year
- Generate textbook-quality synthetic LLM pretraining data☆498Updated last year
- ☆863Updated last year