yigitkonur / data-preparation-for-fine-tuningLinks
A Python project for preparing and analyzing datasets from JSONL files. It includes tools for shuffling, categorizing, and generating reports on dataset content.
☆16Updated last year
Alternatives and similar repositories for data-preparation-for-fine-tuning
Users that are interested in data-preparation-for-fine-tuning are comparing it to the libraries listed below
Sorting:
- Python script for automated clustering of embeddings with DBSCAN, using cosine similarity for flexible, size-agnostic grouping (I hate K-…☆15Updated last year
 - Serverless API Gateway☆70Updated last week
 - Summarize webpages from specified URLs using the LangChain framework and the ChatOllama model☆121Updated last year
 - deduplication☆15Updated 2 years ago
 - A repository trying to translate subtitles with GPT 3.5 Turbo without losing context (using the dynamic window context method).☆31Updated 2 years ago
 - Minimalist search engine for job applications (CVs)☆61Updated 11 months ago
 - A script that uses Telnyx API to make bulk calls to a given list of numbers and analyzes the audio recordings with Whisper. This way, we …☆135Updated 2 years ago
 - ☆13Updated 7 years ago
 - Turkish LM Tuner☆85Updated 11 months ago
 - Turkish-Reading-Comprehension-Question-Answering-Dataset☆84Updated 3 years ago
 - A high-performance Rust tool for sending API requests (to LLMs in my case) with built-in weighted load balancing, retry mechanisms, and r…☆17Updated last year
 - Tiny AI is a platform to create/modify AI powered chatbots. This repository contains ChatGPT plugin and API for talk, create and modify T…☆34Updated last year
 - A curated list of awesome Turkish language processing libraries, models, resources and datasets. The main focus is on open source tools, …☆41Updated 5 years ago
 - Türkiye Teknoloji Takımı Vakfı - Yapay Zeka Usta Eğitimleri Serisi - Makine Öğreniminde Regresyon ve Sınıflandırma☆17Updated 5 years ago
 - Jumpstart Your Cursor AI Projects☆174Updated 8 months ago
 - Turkish WordNet KeNet☆39Updated 7 months ago
 - Muninn is a fast and flexible HTML parsing tool that simplifies the process of extracting data from HTMLs.☆144Updated 8 months ago
 - Repository for "Turkish Wikipedia Based Knowledge Graph (Vikipedi Tabanlı Türkçe Bilgi Çizgesi)" of inzva AI Projects #6☆27Updated 4 years ago
 - This sentiment analysis project determines whether the tweets posted in the Turkish language on Twitter are positive or negative.☆61Updated 2 years ago
 - ☆176Updated 3 months ago
 - Gen AI based travel assistant for Turkish Airlines customers☆11Updated last year
 - Aşağılayıcı Söylemlerin Doğal Dil İşleme İle Tespiti☆22Updated last year
 - Turkish Spell Checker Library☆55Updated last year
 - Python implementation of Zemberek☆128Updated 4 months ago
 - ☆12Updated last year
 - Very early version of the TurkishNLP. For now it has basically 5 main functions; Detecting Turkish Language, syllabicating words, vowel h…☆149Updated last year
 - ☆72Updated 4 months ago
 - TRScraper, doğal dil işleme uygulamalarında kullanılmak amacıyla geliştirilmiş, Türkçe içerik girilen büyük platformlarda metin madencili…☆75Updated 4 years ago
 - Turkish News Category Classification Tutorial☆32Updated 3 years ago
 - Server load testing CLI tool 🏋️☆11Updated 2 years ago