synthetic dataset generation workflow using local file resources for finetuning llms.
☆83Oct 9, 2025Updated 5 months ago
Alternatives and similar repositories for local-datagen-cli
Users that are interested in local-datagen-cli are comparing it to the libraries listed below
Sorting:
- The rag pipeline for optimizing dynamic data editing.☆20Oct 30, 2025Updated 4 months ago
- Using deep research workflow to generate datasets for finetuning LLMs.☆39Oct 9, 2025Updated 5 months ago
- Text Match Cut Video Generator Web App☆36Feb 19, 2026Updated 2 weeks ago
- world's stupidest moe llm in 103M parameters☆20Jul 18, 2025Updated 7 months ago
- FastAPI + MLX offline-first voice agent with <1s latency. Minimal UI☆47Oct 21, 2025Updated 4 months ago
- A framework for creating message-driven training systems with PyTorch☆21Oct 7, 2025Updated 5 months ago
- ☆17Jul 10, 2025Updated 7 months ago
- 🔍📃 LLM-powered PDF Table Extractor☆19Jun 26, 2025Updated 8 months ago
- Efficient non-uniform quantization with GPTQ for GGUF☆61Sep 17, 2025Updated 5 months ago
- ☆24Aug 26, 2025Updated 6 months ago
- Dynamic Graphviz graphs with collapsible cluster/expandible clusters and edge highlighting through user clicks. Uses either csv files or …☆34Aug 13, 2025Updated 6 months ago
- HippocampAI — Autonomous Memory Engine for LLM Agents☆60Feb 13, 2026Updated 3 weeks ago
- An AI Vision Language Model System for extracting structured knowledge graph information(JSON) from images of process diagrams☆40Apr 5, 2025Updated 11 months ago
- LinkedIn Lead Scraper - Automated Profile Discovery & Lead Generation Tool☆27Jan 21, 2026Updated last month
- ☆37Sep 21, 2025Updated 5 months ago
- Search, monitor, and nuke processes with ease, with system resource tracking☆58Feb 21, 2026Updated 2 weeks ago
- Genertaes control vectors for use with llama.cpp in GGUF format.☆38Mar 19, 2025Updated 11 months ago
- interactive semantic search demo using Qwen3-0.6B-Embedding in your browser☆56Feb 25, 2026Updated last week
- ComfyUI custom node to extend Wan videos in loops with overlap consistency, per loop prompts, and optional LoRA control.☆25Nov 29, 2025Updated 3 months ago
- Software package for holographic optical trapping (HOT) released under GPL and LGPL. Requires a CUDA compatible GPU and LabVIEW☆15Jul 2, 2022Updated 3 years ago
- An even smaller speech recognizer / force aligner☆37Dec 16, 2024Updated last year
- Spellbound - your multilingual AI-powered writing assistant☆12May 12, 2025Updated 9 months ago
- vTPM with SGX protection☆11May 30, 2019Updated 6 years ago
- Creates CMM script that can directly executed on Kaggle from easy merge script☆14Jan 12, 2026Updated last month
- Natural language control for Python CLI tools using locally-trained SLMs (CPU inference)☆30Feb 21, 2026Updated 2 weeks ago
- LexiCrawler is a powerful Go-based web crawling API meticulously designed to extract, clean, and transform web page content into a pristi…☆48Feb 27, 2025Updated last year
- Easy to use, High Performant Knowledge Distillation for LLMs☆96May 5, 2025Updated 10 months ago
- Efforts toward giving Qwen 3 Coder 30B A3B proper agentic tool calling capabilities at or near 100% reliability.☆65Aug 10, 2025Updated 6 months ago
- Implementation of a fast semantic chunker in C++, installable in python 3.7+ projects.☆22Sep 20, 2025Updated 5 months ago
- Local LLM Testing & Benchmarking for Apple Silicon☆56Feb 26, 2026Updated last week
- The official repo for "CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models"☆30Updated this week
- Source code to accompany research paper on training multi token prediction language models using self-distillation.☆24Feb 21, 2026Updated 2 weeks ago
- A Model Agnostic function to directly remove specified layers from the LLM☆10May 23, 2024Updated last year
- Strapi CMS upload provider using AWS S3 with cloudfront cdn option☆10Jan 24, 2024Updated 2 years ago
- Visualise your Google Tag Manager container's contents.☆10Sep 12, 2015Updated 10 years ago
- Koel Labs innovates open-source speech research, inclusive speech technologies, and real-time pronunciation feedback for language learner…☆18Feb 25, 2026Updated last week
- ☆64Jun 24, 2025Updated 8 months ago
- Manage a Google Drive Service Account visually☆12Oct 17, 2024Updated last year
- Fixed it, so that years actually make sense, instead of AD and BC nonsense☆14Mar 21, 2025Updated 11 months ago