zjohn77 / lightning-mlflow-hfLinks
Use QLoRA to tune LLM in PyTorch-Lightning w/ Huggingface + MLflow
β64Updated last year
Alternatives and similar repositories for lightning-mlflow-hf
Users that are interested in lightning-mlflow-hf are comparing it to the libraries listed below
Sorting:
- Lightweight demos for finetuning LLMs. Powered by π€ transformers and open-source datasets.β78Updated 11 months ago
- Repository containing awesome resources regarding Hugging Face tooling.β48Updated last year
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.β33Updated this week
- Code for NeurIPS LLM Efficiency Challengeβ59Updated last year
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.β87Updated this week
- β49Updated 7 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing β‘β67Updated 10 months ago
- β124Updated 10 months ago
- minimal pytorch implementation of bm25 (with sparse tensors)β104Updated last year
- experiments with inference on llamaβ104Updated last year
- Let's build better datasets, together!β263Updated 9 months ago
- Supercharge huggingface transformers with model parallelism.β77Updated last month
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β49Updated last year
- β37Updated 2 years ago
- Sakura-SOLAR-DPO: Merge, SFT, and DPOβ116Updated last year
- Trully flash implementation of DeBERTa disentangled attention mechanism.β63Updated 2 weeks ago
- Notebooks for training universal 0-shot classifiers on many different tasksβ137Updated 8 months ago
- minimal scripts for 24GB VRAM GPUs. training, inference, whateverβ42Updated this week
- β88Updated last year
- A fast implementation of T5/UL2 in PyTorch using Flash Attentionβ107Updated 6 months ago
- LoRA and DoRA from Scratch Implementationsβ212Updated last year
- Using open source LLMs to build synthetic datasets for direct preference optimizationβ65Updated last year
- Datamodels for hugging face tokenizersβ71Updated this week
- Small and Efficient Mathematical Reasoning LLMsβ72Updated last year
- Fine-tune Mistral 7B to generate fashion style suggestionsβ34Updated last year
- β77Updated last year
- QLoRA with Enhanced Multi GPU Supportβ37Updated 2 years ago
- A comprehensive deep dive into the world of tokensβ226Updated last year
- Pre-train Static Word Embeddingsβ85Updated 2 weeks ago
- π€ Trade any tensors over the networkβ30Updated last year