yigitkonur / data-preparation-for-fine-tuningLinks
A Python project for preparing and analyzing datasets from JSONL files. It includes tools for shuffling, categorizing, and generating reports on dataset content.
☆16Updated last year
Alternatives and similar repositories for data-preparation-for-fine-tuning
Users that are interested in data-preparation-for-fine-tuning are comparing it to the libraries listed below
Sorting:
- Python script for automated clustering of embeddings with DBSCAN, using cosine similarity for flexible, size-agnostic grouping (I hate K-…☆13Updated last year
- A repository trying to translate subtitles with GPT 3.5 Turbo without losing context (using the dynamic window context method).☆30Updated last year
- Serverless API Gateway☆70Updated this week
- Aşağılayıcı Söylemlerin Doğal Dil İşleme İle Tespiti☆22Updated last year
- deduplication☆14Updated 2 years ago
- Minimalist search engine for job applications (CVs)☆60Updated 7 months ago
- A high-performance Rust tool for sending API requests (to LLMs in my case) with built-in weighted load balancing, retry mechanisms, and r…☆16Updated last year
- Kubernetes logs to MongoDB☆16Updated 3 years ago
- Türkiye Teknoloji Takımı Vakfı - Yapay Zeka Usta Eğitimleri Serisi - Makine Öğreniminde Regresyon ve Sınıflandırma☆17Updated 4 years ago
- A Python toolkit for image clustering using deep learning, PCA, and K-means, with support for GPU and CPU processing. Simplify your image…☆37Updated last year
- Turkish LM Tuner☆84Updated 8 months ago
- Server load testing CLI tool 🏋️☆10Updated last year
- Türkiye Açık Kaynak Platformunun organizasyonluğunda düzenlenen Açık Seminer (https://www.acikseminer.com/) serisinin doğal dil işleme ha…☆22Updated 5 years ago
- Self-Driven Autonomous Python Libraries☆94Updated 7 months ago
- Turkish-Reading-Comprehension-Question-Answering-Dataset☆82Updated 3 years ago
- a lightweight and simple cli package☆13Updated 3 years ago
- Summarize webpages from specified URLs using the LangChain framework and the ChatOllama model☆118Updated 9 months ago
- Turkish News Category Classification Tutorial☆31Updated 3 years ago
- ☆12Updated last year
- ☆36Updated 2 years ago
- Calculates the calories of food.☆29Updated 4 years ago
- It serves to visualize the muninn configs you have prepared.☆20Updated 3 years ago
- Very early version of the TurkishNLP. For now it has basically 5 main functions; Detecting Turkish Language, syllabicating words, vowel h…☆150Updated last year
- ☆27Updated 2 years ago
- Muninn is a fast and flexible HTML parsing tool that simplifies the process of extracting data from HTMLs.☆144Updated 4 months ago
- A webcam application for YouTube☆34Updated last year
- Mintlemon, Türkçe Doğal Dil İşleme Kütüphanesi, Teknofest Türkçe Doğal Dil İşleme Yarışması kapsamında geliştirildi. Nane&Limon Takımı ad…☆41Updated last year
- Spotify-to-Instagram Notes a.k.a MSN What I'm Listening This tool syncs the song you listen to on Spotify with Instagram Notes☆8Updated last year
- In this repository you can find natural language processing stuff☆21Updated 4 years ago
- ☆13Updated 7 years ago