A tool for creating pre-training datasets for language models, supporting one-click batch processing for both text and image datasets. 一个专为语言模型预训练设计的数据集制作工具,支持文本和图像数据集的一键式批量处理。
☆43Dec 18, 2024Updated last year
Alternatives and similar repositories for Pretuning
Users that are interested in Pretuning are comparing it to the libraries listed below
Sorting:
- A learning project for building local knowledge bases from PDFs using LangChain, supporting multiple LLMs (DeepSeek, OpenAI). Features in…☆226Jan 30, 2025Updated last year
- 基于youtube、bilibili等视频平台、webpage网页等,利用零一万物大模型或ollama本地小模型构建大语言模型高质量训练数据集(计划支持可自定义输出的训练数据格式)☆19May 2, 2024Updated last year
- CAD - Memory Efficient Convolutional Adapter for Segment Anything☆12Oct 4, 2024Updated last year
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- MCP for Security: A collection of Model Context Protocol servers for popular security tools like SQLMap, FFUF, NMAP, Masscan and more. In…☆20Apr 25, 2025Updated 10 months ago
- Yet another frontend for LLM, written using .NET and WinUI 3☆10Sep 14, 2025Updated 6 months ago
- Python library for solving reinforcement learning (RL) problems using generative models.☆11Feb 18, 2025Updated last year
- Model Based Testing of the App Based On The Description from Constructing the User Interface with Statecharts Book of Ian Horrocks using …☆13Feb 20, 2024Updated 2 years ago
- ☆10May 24, 2024Updated last year
- AI驱动的GEO优化工具 - 让你的内容被ChatGPT、Perplexity等AI引擎优先引用☆50Nov 3, 2025Updated 4 months ago
- Recurrent AMP: Adversarial Motion Priors for Stylized Physics-Based Character Control in Issac Gym 4☆10Jan 27, 2024Updated 2 years ago
- Advanced Video Graph RAG using SAM2,CLIP,BLIP,Qwen2-VL,YOLO-World ,Neo4j, WebGPU, local LLM☆14Nov 25, 2024Updated last year
- Annotations for the Mistake Detection benchmark of Assembly101☆10Aug 3, 2023Updated 2 years ago
- Prompt-based pipeline for extracting procedural knowledge graphs from text with LLMs☆15Feb 17, 2026Updated last month
- A Simple Game Using Unity ML-Agents☆10Nov 20, 2020Updated 5 years ago
- Implementation of CrossLoco, currently lite version☆14May 12, 2024Updated last year
- A semi-automated system based on LLM's to generate ontologies from datasets☆24Oct 29, 2024Updated last year
- AbationGraph® is a time-series knowledge graph database for real-time data analysis☆23Mar 12, 2026Updated last week
- ☆39Feb 16, 2024Updated 2 years ago
- Code for AISTATS 2023 paper - Estimating Total Correlation with Mutual Information Estimators☆17Dec 15, 2023Updated 2 years ago
- 一起来养一只拥有专属记忆的AI猫猫吧!☆10Oct 25, 2024Updated last year
- An agent that can run everywhere - even in your watch!☆30Mar 5, 2026Updated 2 weeks ago
- Authenticated Knowledge & Trust Architecture for AI Agents☆30Dec 17, 2025Updated 3 months ago
- RWKV-7 mini☆12Mar 29, 2025Updated 11 months ago
- ☆17Feb 20, 2025Updated last year
- ☆10Jun 1, 2014Updated 11 years ago
- An Introductory Jupyter Notebook to Manipulate Ontologies with Owlready2☆11Jan 10, 2020Updated 6 years ago
- A unified tool to generate fine-tuning datasets for LLMs, including questions, answers, and dialogues. ✨🤖📚💬☆62Mar 14, 2025Updated last year
- 华容道,一种单人拼图类游戏☆10Feb 12, 2019Updated 7 years ago
- ☆11Feb 20, 2025Updated last year
- github信息泄露搜集工具。GSIL升级版,去除发邮件方式,将结果保存在本地☆13Mar 20, 2021Updated 5 years ago
- 不用搭建环境,解压即用,4G显存可用☆12Mar 1, 2025Updated last year
- Trojan 协议的 java 服务端实现☆10Feb 15, 2023Updated 3 years ago
- ☆12Jul 14, 2024Updated last year
- 参考 Chat2DB 的效果,使用 chatgpt 进行自然语言翻译,然后对数据库进行操作,使用 rust 语言实现的 web 应用。☆10Jan 13, 2025Updated last year
- ☆13Apr 15, 2024Updated last year
- The real GPT-4 with image access (You probably don't have access)☆12Mar 17, 2023Updated 3 years ago
- Papers about hypergraph, their applications, and even similar ideas.☆20Nov 15, 2021Updated 4 years ago
- Grasp Generation models on OakInk-Shape dataset☆17Apr 4, 2024Updated last year