davanstrien / data-for-fine-tuning-llmsView external linksLinks
☆80Jun 5, 2024Updated last year
Alternatives and similar repositories for data-for-fine-tuning-llms
Users that are interested in data-for-fine-tuning-llms are comparing it to the libraries listed below
Sorting:
- ☆171Jun 3, 2024Updated last year
- awesome synthetic (text) datasets☆323Jan 8, 2026Updated last month
- ☆21Oct 14, 2024Updated last year
- A webhook that integrates the W&B model registry with Modal Labs☆15Dec 24, 2023Updated 2 years ago
- ☆54May 28, 2024Updated last year
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆40Jan 4, 2024Updated 2 years ago
- msglm makes it a little easier to create messages for language models like Claude and OpenAI GPTs.☆14Jan 29, 2026Updated 2 weeks ago
- Various test models in WNNX format. It can view with `pip install wnetron && wnetron`☆12Jun 22, 2022Updated 3 years ago
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆84Oct 29, 2024Updated last year
- Code for the examples presented in the talk "Training a Llama in your backyard: fine-tuning very large models on consumer hardware" given…☆15Oct 16, 2023Updated 2 years ago
- [WIP] ONNX parts yard. The various operations described in Operator Schemas are converted in advance into OP stand-alone ONNX files.☆11Mar 30, 2025Updated 10 months ago
- Yans2019 Annotation hackathon☆14May 22, 2023Updated 2 years ago
- This repo contains code and data of our contribution to the 2024 LLM Hackathon, materials' property prediction from textual descriptions …☆12May 9, 2024Updated last year
- Colab Notebook for SeamlessM4T model by Meta☆10Aug 23, 2023Updated 2 years ago
- Full text search that feels like a numpy array☆301Feb 1, 2026Updated last week
- Trace LLM calls (and others) and visualize them in WandB, as interactive SVG or using a streaming local webapp☆14Feb 18, 2025Updated 11 months ago
- ☆18Feb 7, 2024Updated 2 years ago
- ☆198May 5, 2024Updated last year
- ☆67Mar 4, 2024Updated last year
- Build fast gradio demos of fastai learners☆35Sep 23, 2021Updated 4 years ago
- ☆28Sep 11, 2025Updated 5 months ago
- An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning☆37May 18, 2025Updated 8 months ago
- ☆67Aug 5, 2025Updated 6 months ago
- ☆162Dec 2, 2024Updated last year
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆108Sep 19, 2025Updated 4 months ago
- Late Interaction Models Training & Retrieval☆701Updated this week
- Using modal.com to process FineWeb-edu data☆20Apr 5, 2025Updated 10 months ago
- Extract a single expert from a Mixture Of Experts model using slerp interpolation.☆19May 26, 2024Updated last year
- This repository helps you evaluate your models on the FreshStack benchmark!☆31Dec 9, 2025Updated 2 months ago
- Repositorio general para Bootcamps de Data Science en Coding Dojo☆11Nov 13, 2025Updated 3 months ago
- A Fast, Simplified Model for Molecular Generation with Improved Physical Quality☆25Oct 1, 2025Updated 4 months ago
- ☆42Jul 23, 2024Updated last year
- ☆24Feb 4, 2026Updated last week
- ShellSage saves sysadmins’ sanity by solving shell script snafus super swiftly☆391Feb 4, 2026Updated last week
- Kura is a simple reproduction of the CLIO paper which uses language models to label user behaviour before clustering them based on embedd…☆421Sep 10, 2025Updated 5 months ago
- A Lightweight Library for AI Observability☆255Feb 20, 2025Updated 11 months ago
- Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard a…☆2,054Dec 3, 2025Updated 2 months ago
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.☆90Jan 9, 2026Updated last month
- Starter template for python projects☆18Feb 15, 2024Updated last year