I got tired of manually creating training datasets, so I built this. Transform your PDFs/docs into fine-tuning data automatically.
☆30Sep 2, 2025Updated 7 months ago
Alternatives and similar repositories for ai-dataset-generator
Users that are interested in ai-dataset-generator are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 🌍 The open-source Wikipedia of AI — 2M+ apps, agents, LLMs & datasets. Updated daily with tools, tutorials & news.☆50Updated this week
- An implementation of an iterative knowledge base search agent using Agno agent framework, inspired by Ashpreet DeepKnowledge concept usin…☆15Feb 6, 2025Updated last year
- ☆12Jul 25, 2024Updated last year
- ☆27May 19, 2025Updated 11 months ago
- Sparse Embedding Compression for Scalable Retrieval in Recommender Systems☆35Nov 21, 2025Updated 4 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆36May 21, 2025Updated 11 months ago
- A TypeScript-based MCP-server tool enabling concurrent chains of thought with real-time reinforcement learning. Seamlessly integrates wit…☆20Mar 17, 2025Updated last year
- Multimodal AI workloads: batch inference, model training and online serving.☆107Aug 22, 2025Updated 7 months ago
- ☆39Jul 14, 2024Updated last year
- Turn your Readwise library into a blazing-fast, self-hosted semantic search engine – complete with nightly syncs, vector search API, Prom…☆24Jul 28, 2025Updated 8 months ago
- ☆14Oct 11, 2024Updated last year
- Python FastApi "Circuit Breaker" implementation☆13Mar 14, 2025Updated last year
- CopilotKit utilizing LangGraph with Python☆24Mar 12, 2026Updated last month
- How far can we go with an LLM for a classification problem☆24Nov 24, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Local UI for AI task management.☆37Jul 1, 2025Updated 9 months ago
- Service for detecting hallucinations in AI generated text responses. Originally created for the AI for Media Network hackathon 2024.☆18Oct 11, 2024Updated last year
- Guides for Model Context Protocol (MCP) and Agent Communication Protocol (ACP)☆21Jan 26, 2026Updated 2 months ago
- ☆11Oct 13, 2020Updated 5 years ago
- En este repositorio habra una gran variedad de ejercicios tecnicos realizados en python☆12Feb 16, 2025Updated last year
- Turn natural language into Grafana dashboards. Powered by AI, VizGenie auto-generates PromQL queries, builds visualizations, and deploys …☆14Feb 19, 2026Updated 2 months ago
- Apuntes personales de Python mejorados con IA. Lecciones, ejercicios y documentación para aprender en comunidad. #opensource☆21Aug 2, 2025Updated 8 months ago
- ☆14May 25, 2024Updated last year
- Multi-Agent Blog Generator based on Agno framework. Supports leading LLM providers like OpenAI, Gemini, Claude, and Grok.☆74Jan 6, 2026Updated 3 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆10Feb 27, 2025Updated last year
- Transform unstructured documents into validated, rich and queryable knowledge graphs.☆134Updated this week
- Google Cloud Certified Cloud Digital Leader - Foundational☆10May 26, 2025Updated 10 months ago
- Generador de archivos para crear entornos Docker personalizados☆18Mar 15, 2025Updated last year
- High-performance EtherNet/IP driver for Allen-Bradley PLCs, written in Rust.☆32Updated this week
- Building Modern Data Applications Using Databricks Lakehouse, published by Packt☆24Nov 13, 2024Updated last year
- Testing DeepSpeed integration in 🤗 Accelerate☆11Jun 28, 2022Updated 3 years ago
- Bootstrap project to start your own local AI lab☆21Dec 27, 2025Updated 3 months ago
- CyberPreacher cloud project collection☆16Dec 21, 2025Updated 4 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Serverless ML Course for building AI-enabled Prediction Services from models and features☆15Oct 19, 2022Updated 3 years ago
- ☆27Apr 16, 2024Updated 2 years ago
- ☆13Jan 12, 2025Updated last year
- Portable, fully offline Markdown editor for Windows. No install, no internet, no Electron.☆37Mar 18, 2026Updated last month
- SIGnature is a Python package that empowers researchers to rapidly query gene sets across diverse single-cell RNA sequencing (scRNA-seq) …☆24Updated this week
- Repositorio donde se encuentra información del grupo de estudio cloud en la UdeA☆19Sep 22, 2025Updated 6 months ago
- Fine-tune copilot based on your codebase☆12Mar 26, 2024Updated 2 years ago