yigitkonur / data-preparation-for-fine-tuningLinks
A Python project for preparing and analyzing datasets from JSONL files. It includes tools for shuffling, categorizing, and generating reports on dataset content.
☆16Updated last year
Alternatives and similar repositories for data-preparation-for-fine-tuning
Users that are interested in data-preparation-for-fine-tuning are comparing it to the libraries listed below
Sorting:
- Python script for automated clustering of embeddings with DBSCAN, using cosine similarity for flexible, size-agnostic grouping (I hate K-…☆12Updated last year
- A repository trying to translate subtitles with GPT 3.5 Turbo without losing context (using the dynamic window context method).☆30Updated last year
- A high-performance Rust tool for sending API requests (to LLMs in my case) with built-in weighted load balancing, retry mechanisms, and r…☆16Updated last year
- It serves to visualize the muninn configs you have prepared.☆20Updated 3 years ago
- Serverless API Gateway☆70Updated this week
- Minimalist search engine for job applications (CVs)☆60Updated 6 months ago
- Kubernetes logs to MongoDB☆16Updated 3 years ago
- deduplication☆14Updated 2 years ago
- a lightweight and simple cli package☆13Updated 3 years ago
- 📮 Lightweight, high performance and configurable API rate limiter.☆15Updated 4 years ago
- Summarize webpages from specified URLs using the LangChain framework and the ChatOllama model☆117Updated 8 months ago
- Serverless Server Side Rendering with AWS CDK and Next.js☆24Updated 3 years ago
- Spotify Web API wrapper for Cloudflare Workers☆19Updated last year
- Tiny AI is a platform to create/modify AI powered chatbots. This repository contains ChatGPT plugin and API for talk, create and modify T…☆34Updated last year
- Aşağılayıcı Söylemlerin Doğal Dil İşleme İle Tespiti☆22Updated last year
- ☆31Updated last month
- Summarizes podcasts from RSS feed URL using whisperX and GPT3.5 APIs from OpenAI☆16Updated last year
- Server load testing CLI tool 🏋️☆10Updated last year
- Turkish News Category Classification Tutorial☆30Updated 3 years ago
- Ekşi Sözlük API☆9Updated 3 years ago
- A fuzzy key value store based on semantic similarity rather lexical equality. (python version)☆13Updated 10 months ago
- Turkish LM Tuner☆84Updated 7 months ago
- OpenAI APIs for Earthquake Related Tasks☆13Updated 2 years ago
- A database designer tool that persisted on LocalStorage☆14Updated 6 years ago
- Türkiye Açık Kaynak Platformunun organizasyonluğunda düzenlenen Açık Seminer (https://www.acikseminer.com/) serisinin doğal dil işleme ha…☆22Updated 5 years ago
- A Turkish Text-to-SQL Dataset☆9Updated 4 months ago
- A tool that finds related serp results for the given input.☆11Updated 4 years ago
- Binalyze logger is an easily customizable wrapper for logrus with log rotation☆28Updated 3 years ago
- BERTurk-Social ile T ürkçe sosyal medya analizi.☆15Updated 2 years ago
- Learn Go with test-driven development'ın Türkçeye çevrilmesinin ilerlediği repodur.☆12Updated last year