☆80Jun 5, 2024Updated last year
Alternatives and similar repositories for data-for-fine-tuning-llms
Users that are interested in data-for-fine-tuning-llms are comparing it to the libraries listed below
Sorting:
- ☆171Jun 3, 2024Updated last year
- ☆79May 27, 2024Updated last year
- awesome synthetic (text) datasets☆325Jan 8, 2026Updated last month
- ☆21Oct 14, 2024Updated last year
- A webhook that integrates the W&B model registry with Modal Labs☆15Dec 24, 2023Updated 2 years ago
- a collection of resources around LLMs, aggregated for the workshop "Mastering LLMs: End-to-End Fine-Tuning and Deployment" by Dan Becker …☆110May 31, 2024Updated last year
- ☆48Aug 29, 2024Updated last year
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆40Jan 4, 2024Updated 2 years ago
- This repository is designed for deploying and managing server processes that handle embeddings using the Infinity Embedding model or Larg…☆27Mar 6, 2025Updated last year
- msglm makes it a little easier to create messages for language models like Claude and OpenAI GPTs.☆14Jan 29, 2026Updated last month
- Various test models in WNNX format. It can view with `pip install wnetron && wnetron`☆12Jun 22, 2022Updated 3 years ago
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆84Oct 29, 2024Updated last year
- Code for the examples presented in the talk "Training a Llama in your backyard: fine-tuning very large models on consumer hardware" given…☆15Oct 16, 2023Updated 2 years ago
- [WIP] ONNX parts yard. The various operations described in Operator Schemas are converted in advance into OP stand-alone ONNX files.☆11Mar 30, 2025Updated 11 months ago
- Yans2019 Annotation hackathon☆14May 22, 2023Updated 2 years ago
- Colab Notebook for SeamlessM4T model by Meta☆10Aug 23, 2023Updated 2 years ago
- An easy-to-use ML pipeline package for Python inspired by scikit-learn pipeline and PyTorch layers.☆12Aug 27, 2023Updated 2 years ago
- Full text search that feels like a numpy array☆303Feb 1, 2026Updated last month
- Widgets to make it easy to add labels☆48Nov 18, 2025Updated 3 months ago
- ☆18Feb 7, 2024Updated 2 years ago
- Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, imp…☆206Aug 31, 2024Updated last year
- ☆67Mar 4, 2024Updated 2 years ago
- Build fast gradio demos of fastai learners☆35Sep 23, 2021Updated 4 years ago
- 🛠 Self-hosted, fast, and consistent remote configuration for apps.☆17Nov 7, 2022Updated 3 years ago
- ☆162Dec 2, 2024Updated last year
- An in-cell AI assistant for JupyterLab notebooks☆38Sep 17, 2025Updated 5 months ago
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆109Sep 19, 2025Updated 5 months ago
- Extract a single expert from a Mixture Of Experts model using slerp interpolation.☆19May 26, 2024Updated last year
- This repository helps you evaluate your models on the FreshStack benchmark!☆33Dec 9, 2025Updated 2 months ago
- Convenient access to `pynvml` (the library behind `nvidia-smi`)☆23Oct 18, 2024Updated last year
- Using modal.com to process FineWeb-edu data☆20Apr 5, 2025Updated 11 months ago
- Late Interaction Models Training & Retrieval☆732Feb 27, 2026Updated last week
- ☆44Jul 23, 2024Updated last year
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆19Feb 24, 2026Updated last week
- A dictionary, but it shows you position in embedding space relative to some synonyms/antonyms instead of a definition.☆74Nov 25, 2025Updated 3 months ago
- Repositorio general para Bootcamps de Data Science en Coding Dojo☆11Nov 13, 2025Updated 3 months ago
- ShellSage saves sysadmins’ sanity by solving shell script snafus super swiftly☆394Feb 9, 2026Updated 3 weeks ago
- A Lightweight Library for AI Observability☆255Feb 20, 2025Updated last year
- auto fine tune of models with synthetic data☆78Feb 14, 2024Updated 2 years ago