neuralwork / instruct-finetune-mistral
Fine-tune Mistral 7B to generate fashion style suggestions
β34Updated last year
Alternatives and similar repositories for instruct-finetune-mistral:
Users that are interested in instruct-finetune-mistral are comparing it to the libraries listed below
- Explore the use of DSPy for extracting features from PDFs πβ38Updated last year
- Set of scripts to finetune LLMsβ36Updated 11 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimizationβ58Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing β‘β67Updated 4 months ago
- Use QLoRA to tune LLM in PyTorch-Lightning w/ Huggingface + MLflowβ58Updated last year
- β31Updated last year
- Pre-train Static Word Embeddingsβ48Updated last week
- β18Updated 5 months ago
- A clone of OpenAI's Tokenizer page for HuggingFace Modelsβ45Updated last year
- β40Updated 10 months ago
- π§° The AutoTokenizer that TikToken always needed -- Load any tokenizer with TikToken now! β¨β39Updated 2 months ago
- β20Updated last year
- Supervised instruction finetuning for LLM with HF trainer and Deepspeedβ34Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β49Updated 8 months ago
- Using modal.com to process FineWeb-edu dataβ20Updated last week
- A library for squeakily cleaning and filtering language datasets.β46Updated last year
- Tools for merging pretrained large language models.β19Updated 9 months ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.β34Updated 2 months ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)β76Updated 4 months ago
- Writing Blog Posts with Generative Feedback Loops!β47Updated 11 months ago
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.β32Updated last year
- β24Updated last year
- Generalist and Lightweight Model for Text Classificationβ90Updated last week
- π€ Trade any tensors over the networkβ30Updated last year
- minimal LLM scripts for 24GB VRAM GPUs. training, inference, whateverβ37Updated this week
- Run LLMs on Replicate with vLLMβ16Updated 5 months ago
- Lightweight demos for finetuning LLMs. Powered by π€ transformers and open-source datasets.β73Updated 4 months ago
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.β47Updated 6 months ago