yigitkonur / data-preparation-for-fine-tuning
A Python project for preparing and analyzing datasets from JSONL files. It includes tools for shuffling, categorizing, and generating reports on dataset content.
☆15Updated last year
Alternatives and similar repositories for data-preparation-for-fine-tuning:
Users that are interested in data-preparation-for-fine-tuning are comparing it to the libraries listed below
- Python script for automated clustering of embeddings with DBSCAN, using cosine similarity for flexible, size-agnostic grouping (I hate K-…☆8Updated 11 months ago
- A repository trying to translate subtitles with GPT 3.5 Turbo without losing context (using the dynamic window context method).☆29Updated last year
- A high-performance Rust tool for sending API requests (to LLMs in my case) with built-in weighted load balancing, retry mechanisms, and r…☆14Updated 7 months ago
- Serverless API Gateway☆65Updated this week
- Minimalist search engine for job applications (CVs)☆56Updated last month
- A curated list of awesome Turkish language processing libraries, models, resources and datasets. The main focus is on open source tools, …☆37Updated 4 years ago
- It serves to visualize the muninn configs you have prepared.☆20Updated 3 years ago
- A tool that finds related serp results for the given input.☆11Updated 4 years ago
- Server load testing CLI tool 🏋️☆10Updated last year
- Aşağılayıcı Söylemlerin Doğal Dil İşleme İle Tespiti☆22Updated last year
- deduplication☆13Updated last year
- Spotify Web API wrapper for Cloudflare Workers☆18Updated last year
- a lightweight and simple cli package☆13Updated 3 years ago
- Code and slides for Kodla 2022☆23Updated 2 years ago
- Kubernetes logs to MongoDB☆16Updated 3 years ago
- Serverless Server Side Rendering with AWS CDK and Next.js☆24Updated 3 years ago
- Making homepage clone example of wope.com site with Angular framework.☆10Updated last year
- Binalyze logger is an easily customizable wrapper for logrus with log rotation☆28Updated 3 years ago
- Ekşi Sözlük API☆9Updated 3 years ago
- Developer's Helper to Docker, Kubernetes, and Terraform. Fully automatic, without any config or question 🙌☆80Updated 3 years ago
- Go-based task queue with MongoDB storage for distributed app tasks☆14Updated 9 months ago
- Summarize webpages from specified URLs using the LangChain framework and the ChatOllama model☆108Updated 3 months ago
- 📮 Lightweight, high performance and configurable API rate limiter.☆14Updated 3 years ago
- Turkish LM Tuner☆83Updated 2 months ago
- Tiny AI is a platform to create/modify AI powered chatbots. This repository contains ChatGPT plugin and API for talk, create and modify T…☆34Updated 7 months ago
- ☆12Updated 6 months ago
- A Command Line Interface that is designed for technical SEOs☆12Updated 3 years ago
- Calculates the calories of food.☆29Updated 4 years ago
- This is the reporsitory of Turkish fake news dataset which consists of Zaytung posts and Hurriyet news articles.☆14Updated 5 years ago