synthetic dataset generation workflow using local file resources for finetuning llms.
β82Oct 9, 2025Updated 6 months ago
Alternatives and similar repositories for local-datagen-cli
Users that are interested in local-datagen-cli are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ππ LLM-powered PDF Table Extractorβ19Jun 26, 2025Updated 9 months ago
- β19Jul 4, 2025Updated 9 months ago
- β17Jul 10, 2025Updated 9 months ago
- world's stupidest moe llm in 103M parametersβ20Jul 18, 2025Updated 9 months ago
- A framework for creating message-driven training systems with PyTorchβ21Oct 7, 2025Updated 6 months ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- β29Mar 15, 2026Updated last month
- Like system requirements lab but for LLMsβ31Jun 10, 2023Updated 2 years ago
- Authenticated Knowledge & Trust Architecture for AI Agentsβ32Dec 17, 2025Updated 4 months ago
- β17Dec 16, 2024Updated last year
- SwiftUI menu bar app for monitoring application bandwidth useβ74Mar 27, 2026Updated 3 weeks ago
- llmBench is a high-depth benchmarking tool designed to measure the raw performance of local LLM runtimes (Ollama, llama.cpp) while providβ¦β45Mar 15, 2026Updated last month
- Deploying full-stack on-prem deep research agent that can be run entirely on a local machine for $0!β34Nov 8, 2025Updated 5 months ago
- Text Match Cut Video Generator Web Appβ36Feb 19, 2026Updated 2 months ago
- The Oracle β The World's First AI Agent That Sees The Web, Right From Your Terminal. It searches the web, sees images & charts, and citesβ¦β25Jul 7, 2025Updated 9 months ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- β27Mar 2, 2026Updated last month
- PVMSS is a lightweight, self-service web portal for Proxmox Virtual Environment. It allows users to create and manage virtual machines wiβ¦β41Updated this week
- the composable multi-agent shellβ319Updated this week
- Efficient non-uniform quantization with GPTQ for GGUFβ63Sep 17, 2025Updated 7 months ago
- Professional desktop app for converting text to audiobooks with local TTSβ32Oct 6, 2025Updated 6 months ago
- Search, monitor, and nuke processes with ease, with system resource trackingβ60Feb 21, 2026Updated last month
- Extract data from websites in LLM ready JSON or CSV format. Crawl or Scrape entire website with Website Crawlerβ74Feb 19, 2026Updated 2 months ago
- Schedule and manage local tasks on macOS. Features a native SwiftUI interface, live log streaming, and natural language scheduling.β57Apr 3, 2026Updated 2 weeks ago
- Just a series of workflows Iβ34Sep 10, 2025Updated 7 months ago
- GPUs on demand by Runpod - Special Offer Available β’ AdRun AI, ML, and HPC workloads on powerful cloud GPUsβwithout limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Iβm trying to create something similar to Grammarly. Hail to open source!β15Jun 5, 2025Updated 10 months ago
- Qwen LLM in the mac menu bar <3β27Mar 12, 2025Updated last year
- An AI Vision Language Model System for extracting structured knowledge graph information(JSON) from images of process diagramsβ42Apr 5, 2025Updated last year
- Demo of fine-tuning QA models for answering FAQ of cloud providers documentationβ11Mar 7, 2023Updated 3 years ago
- Easy to use, High Performant Knowledge Distillation for LLMsβ98May 5, 2025Updated 11 months ago
- πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and productionβ16Aug 16, 2024Updated last year
- interactive semantic search demo using Qwen3-0.6B-Embedding in your browserβ58Feb 25, 2026Updated last month
- An agentic runtime that enables secure, extensible and configurable AI automation from any modelβ18Updated this week
- Precision Knowledge Editing (PKE): A novel method to reduce toxicity in LLMs while preserving performance, with robust evaluations and haβ¦β11Nov 26, 2024Updated last year
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- β24Aug 26, 2025Updated 7 months ago
- Recursive Self-Aggregation evals on ARC-AGIβ31Jan 26, 2026Updated 2 months ago
- Talk to your shell in natural language. Locally.β54Feb 15, 2026Updated 2 months ago
- π‘οΈ AI-powered system security auditor. A cross-platform TUI/CLI tool to analyze processes, network, and packages on Linux & Windows usinβ¦β41Dec 9, 2025Updated 4 months ago
- CIKM 2022: CorpusBrain: Pre-train a Generative Retrieval Model for Knowledge-Intensive Language Tasksβ34Aug 31, 2022Updated 3 years ago
- Implementation of a fast semantic chunker in C++, installable in python 3.7+ projects.β22Sep 20, 2025Updated 6 months ago
- Natural language β shell command, just press TABβ34Mar 6, 2026Updated last month