This project generates a high-quality Alpaca-style dataset from input text files, PDFs, and Word documents.
☆57Apr 20, 2025Updated 10 months ago
Alternatives and similar repositories for alpaca-dataset-generator
Users that are interested in alpaca-dataset-generator are comparing it to the libraries listed below
Sorting:
- ☆15Apr 9, 2025Updated 10 months ago
- Create text chunks which end at natural stopping points without using a tokenizer☆26Nov 26, 2025Updated 3 months ago
- A proxy that hosts multiple single-model runners such as LLama.cpp and vLLM☆12May 30, 2025Updated 9 months ago
- Simple node proxy for llama-server that enables MCP use☆17May 10, 2025Updated 9 months ago
- a browser gui for nvidia smi☆20Mar 17, 2025Updated 11 months ago
- ☆19Jul 4, 2025Updated 8 months ago
- ☆17Dec 16, 2024Updated last year
- The PyTorch Library for LLM Applications.☆17Jul 16, 2024Updated last year
- A forward proxy to turn network traffic into personal memory for AI agents☆36Feb 23, 2026Updated last week
- Python language chat with Ollama models locally, anthropic and openai☆24Apr 13, 2025Updated 10 months ago
- A list of language models with permissive licenses such as MIT or Apache 2.0☆24Feb 28, 2025Updated last year
- Super simple python connectors for llama.cpp, including vision models (Gemma 3, Qwen2-VL). Compile llama.cpp and run!☆29Dec 11, 2025Updated 2 months ago
- ☆35Nov 18, 2025Updated 3 months ago
- An AI Vision Language Model System for extracting structured knowledge graph information(JSON) from images of process diagrams☆40Apr 5, 2025Updated 11 months ago
- ☆75Jan 10, 2026Updated last month
- A unified library for interacting with various AI APIs through a standardized interface.☆35Mar 13, 2025Updated 11 months ago
- A Windows tool to query various LLM AIs. Supports branched conversations, history and summaries among others.☆35Feb 11, 2026Updated 3 weeks ago
- Genertaes control vectors for use with llama.cpp in GGUF format.☆38Mar 19, 2025Updated 11 months ago
- Text to audio with Tik-Tok Voices☆13Apr 6, 2023Updated 2 years ago
- A blueprint for next-gen AI. Project Infinity uses a token-efficient, Codified Agent Protocol to create specialized, secure, and imaginat…☆25Oct 2, 2025Updated 5 months ago
- Using Random Forest algorithm to detect automated accounts on Twitter and Instagram☆11Jun 21, 2024Updated last year
- LexiCrawler is a powerful Go-based web crawling API meticulously designed to extract, clean, and transform web page content into a pristi…☆48Feb 27, 2025Updated last year
- ☆10Sep 29, 2024Updated last year
- A filebased CLI notetaker☆23Feb 13, 2026Updated 3 weeks ago
- Cherry Flowers everywhere☆11Jul 19, 2024Updated last year
- The example of Ionic 3 Angular 5 search and sort list of data☆10Dec 18, 2017Updated 8 years ago
- Demonstrate using MCP with Pydantic AI framework☆14Mar 14, 2025Updated 11 months ago
- DSPYT Website - Fostering Innovation and Education in Web3 Technologies☆12Feb 15, 2026Updated 2 weeks ago
- Home server set up☆13Oct 5, 2025Updated 5 months ago
- LinkedInLearning / OpenAI-API-Building-Front-End-Voice-Apps-with-the-Realtime-API-and-WebRTC-2027322This is a repository for the LinkedIn Learning course OpenAI API: Building Front-End Voice Apps with the Realtime API and WebRTC☆16Nov 19, 2025Updated 3 months ago
- Convert Confluence MIME exports (.doc) to clean Markdown☆34Jan 13, 2026Updated last month
- Simple and powerful extension for searching web and viewing website content.☆11Apr 11, 2025Updated 10 months ago
- A modern style viewer for Dannbooru or other Booru API base site.☆13May 23, 2025Updated 9 months ago
- A convenient primitive for creating, structing and throwing errors☆13Oct 26, 2025Updated 4 months ago
- Scrapy抓取豆瓣图书☆10Aug 19, 2016Updated 9 years ago
- Life's Work - TheWorldSystem☆37Nov 5, 2025Updated 4 months ago
- Cleanai (https://github.com/willmil11/cleanai) except I'm making it in c now. Fast and clean from the start this time :)☆17Feb 28, 2026Updated last week
- Copilot with deepseek and more...☆13Mar 7, 2025Updated last year
- MCP server for GNU Radio☆31Jan 5, 2026Updated 2 months ago