zjohn77 / lightning-mlflow-hfLinks
Use QLoRA to tune LLM in PyTorch-Lightning w/ Huggingface + MLflow
☆64Updated 2 years ago
Alternatives and similar repositories for lightning-mlflow-hf
Users that are interested in lightning-mlflow-hf are comparing it to the libraries listed below
Sorting:
- Repository containing awesome resources regarding Hugging Face tooling.☆48Updated last year
- minimal pytorch implementation of bm25 (with sparse tensors)☆104Updated last month
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.☆88Updated last week
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆78Updated last year
- Code for NeurIPS LLM Efficiency Challenge☆59Updated last year
- ☆92Updated 5 months ago
- experiments with inference on llama☆103Updated last year
- Let's build better datasets, together!☆265Updated 11 months ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆33Updated 2 months ago
- ☆124Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆68Updated last week
- QLoRA with Enhanced Multi GPU Support☆37Updated 2 years ago
- ☆51Updated 9 months ago
- Google TPU optimizations for transformers models☆122Updated 10 months ago
- ☆86Updated 4 months ago
- 🤝 Trade any tensors over the network☆30Updated 2 years ago
- Supercharge huggingface transformers with model parallelism.☆77Updated 4 months ago
- ML/DL Math and Method notes☆64Updated last year
- Datamodels for hugging face tokenizers☆86Updated this week
- LoRA and DoRA from Scratch Implementations☆215Updated last year
- **ARCHIVED** Filesystem interface to 🤗 Hub☆58Updated 2 years ago
- Improving Text Embedding of Language Models Using Contrastive Fine-tuning☆65Updated last year
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆67Updated 2 months ago
- A fast implementation of T5/UL2 in PyTorch using Flash Attention☆112Updated 3 weeks ago
- ☆37Updated 2 years ago
- Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.☆159Updated last year
- Developing tools to automatically analyze datasets☆75Updated last year
- Code for Zero-Shot Tokenizer Transfer☆142Updated 10 months ago
- Use OpenAI with HuggingChat by emulating the text_generation_inference_server☆45Updated 2 years ago
- Small and Efficient Mathematical Reasoning LLMs☆72Updated last year