yigitkonur / llm-dataset-prepView external linksLinks
Python toolkit for preparing LLM fine-tuning datasets. Features category weighting, reservoir sampling, JSONL processing, and statistical analysis.
☆15Jan 1, 2024Updated 2 years ago
Alternatives and similar repositories for llm-dataset-prep
Users that are interested in llm-dataset-prep are comparing it to the libraries listed below
Sorting:
- Higher level abstraction for franz-go.☆22Aug 22, 2022Updated 3 years ago
- Go-based task queue with MongoDB storage for distributed app tasks☆15Mar 30, 2024Updated last year
- High-performance Rust CLI and library achieving 10K+ req/s for LLM APIs. Features weighted load-balancing, HTTP/2 pooling, and real-time …☆17Nov 29, 2025Updated 2 months ago
- Aşağılayıcı Söylemlerin Doğal Dil İşleme İle Tespiti☆22Dec 25, 2023Updated 2 years ago
- It serves to visualize the muninn configs you have prepared.☆19Nov 22, 2021Updated 4 years ago
- Wrapper and parser modules for Shopify's API.☆10Dec 26, 2022Updated 3 years ago
- ☆28Feb 3, 2026Updated last week
- Mailtracker is Email Sandbox to inspect and debug emails in staging, dev, and QA environments before sending them to recipients in produc…☆11Jul 3, 2024Updated last year
- The Treblle SDK the Django framework☆11Sep 10, 2025Updated 5 months ago
- ☆11Dec 12, 2025Updated 2 months ago
- Trains small LMs. Designed for training on SimpleStories☆12Sep 15, 2025Updated 5 months ago
- JavaScript Client Library for DeployR.☆14Jun 14, 2023Updated 2 years ago
- Extract addresses and intents from tweet texts☆38Feb 17, 2023Updated 2 years ago
- It is a command line tool that will simplify Nginx operations.☆33Apr 8, 2021Updated 4 years ago
- ☆39Apr 26, 2024Updated last year
- Build TypeScript functions that are durable by default; no PhD required.☆15Apr 3, 2025Updated 10 months ago
- Text Classification Dataset for Turkish Language☆10Nov 16, 2021Updated 4 years ago
- Idiomatic way to fill structs with options logic☆13Feb 13, 2022Updated 4 years ago
- ☆11May 21, 2023Updated 2 years ago
- ☆12Feb 23, 2024Updated last year
- Covert.io blog☆12Feb 3, 2024Updated 2 years ago
- Demo application for the sse-eventbus library☆12Dec 7, 2025Updated 2 months ago
- Radiantloom Email Assist 7B is an email-assistant large language model fine-tuned from Zephyr-7B-Beta, over a custom-curated dataset of 1…☆14Jan 19, 2024Updated 2 years ago
- Easy way to look the corona stats in your country.☆11May 31, 2020Updated 5 years ago
- Official Code for What Makes and Breaks Safety Fine-tuning? A Mechanistic Study (NeurIPS 2024)☆12Oct 31, 2024Updated last year
- Library for calculating sun/moon positions and phases☆10Updated this week
- A collection of packages for chess analysis grouped under a @chess-tools scope.☆10Nov 4, 2018Updated 7 years ago
- ☆14Jan 8, 2020Updated 6 years ago
- A library to create lore plots (logistic regression of the prevalence of a categorical variable in function of a continuous feature)☆16Feb 1, 2026Updated 2 weeks ago
- John Langford's original release of Vowpal Wabbit -- a fast online learning algorithm☆16Jul 25, 2017Updated 8 years ago
- ☆20Jul 18, 2025Updated 6 months ago
- The sample Messenger Bot app for the Masterclass for Developers training class☆11Jul 12, 2018Updated 7 years ago
- lab2023's official web page☆20Feb 4, 2026Updated last week
- PESLA - TORCS Deep Reinforcement Learning Agent☆10Oct 20, 2019Updated 6 years ago
- For good words and good views.☆11Jan 5, 2023Updated 3 years ago
- 🐣🕐📅 A simple utility to draft scheduling emails.☆12Sep 13, 2023Updated 2 years ago
- Example of clingo usage on website as a client-side JS program☆14Aug 27, 2019Updated 6 years ago
- Türkçe Saldırgan İçerik Sınıflandırma Modeli☆10Apr 3, 2025Updated 10 months ago
- Veri Bilimi Yaz Okulu☆45Jul 1, 2021Updated 4 years ago