mistralai-sf24 / hackathonLinks

☆447

Alternatives and similar repositories for hackathon

Users that are interested in hackathon are comparing it to the libraries listed below

Sorting:

MDK8888 / GPTFast
Accelerate your Hugging Face Transformers 7.6-9x. Native to Hugging Face and PyTorch.
☆685Updated 11 months ago
mlabonne / llm-autoeval
Automatically evaluate your LLMs in Google Colab
☆649Updated last year
mistralai / megablocks-public
☆864Updated last year
Locutusque / TPU-Alignment
Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free
☆232Updated 9 months ago
abacaj / fine-tune-mistral
Fine-tune mistral-7B on 3090s, a100s, h100s
☆717Updated last year
SkunkworksAI / hydra-moe
☆416Updated last year
migtissera / Sensei
Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI
☆222Updated last year
lamini-ai / Lamini-Memory-Tuning
Banishing LLM Hallucinations Requires Rethinking Generalization
☆276Updated last year
mistralai / mistral-common
Official inference library for pre-processing of Mistral models
☆771Updated last week
myshell-ai / JetMoE
Reaching LLaMA2 Performance with 0.1M Dollars
☆983Updated last year
allenai / fm-cheatsheet
Website for hosting the Open Foundation Models Cheat Sheet.
☆267Updated 2 months ago
center-for-humans-and-machines / transformer-heads
Toolkit for attaching, training, saving and loading of new heads for transformer models
☆284Updated 4 months ago
QuixiAI / laserRMT
This is our own implementation of 'Layer Selective Rank Reduction'
☆239Updated last year
jondurbin / bagel
A bagel, with everything.
☆323Updated last year
nomic-ai / contrastors
Train Models Contrastively in Pytorch
☆731Updated 4 months ago
tomaarsen / attention_sinks
Extend existing LLMs way beyond the original training length with constant memory usage, without retraining
☆702Updated last year
deep-diver / llamaduo
[ACL'25] Official Code for LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to Small-Scale Local LLMs
☆313Updated 3 weeks ago
VikParuchuri / textbook_quality
Generate textbook-quality synthetic LLM pretraining data
☆501Updated last year
persimmon-ai-labs / adept-inference
Inference code for Persimmon-8B
☆415Updated last year
lucidrains / self-rewarding-lm-pytorch
Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI
☆1,394Updated last year
microsoft / Samba
[ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling
☆899Updated 3 months ago
apoorvumang / prompt-lookup-decoding
☆556Updated 11 months ago
AnswerDotAI / fsdp_qlora
Training LLMs with QLoRA + FSDP
☆1,524Updated 8 months ago
pbelcak / UltraFastBERT
The repository for the code of the UltraFastBERT paper
☆516Updated last year
huggingface / optimum-nvidia
☆988Updated 5 months ago
teknium1 / Prompt-Engineering-Toolkit
☆412Updated 11 months ago
Cerebras / gigaGPT
a small code base for training large models
☆307Updated 3 months ago
valine / NeuralFlow
Visualize the intermediate output of Mistral 7B
☆367Updated 6 months ago
taylorai / galactic
data cleaning and curation for unstructured text
☆328Updated 11 months ago
modal-labs / llm-finetuning
Guide for fine-tuning Llama/Mistral/CodeLlama models and more
☆613Updated 2 months ago