A tool for creating pre-training datasets for language models, supporting one-click batch processing for both text and image datasets. 一个专为语言模型预训练设计的数据集制作工具,支持文本和图像数据集的一键式批量处理。
☆43Dec 18, 2024Updated last year
Alternatives and similar repositories for Pretuning
Users that are interested in Pretuning are comparing it to the libraries listed below
Sorting:
- 基于youtube、bilibili等视频平台、webpage网页等,利用零一万物大模型或ollama本地小模型构建大语言模型高质量训练数据集(计划支持可自定义输出的训练数据格式)☆19May 2, 2024Updated last year
- 训练自己的中文 Embedding 模型☆28Jan 6, 2025Updated last year
- github信息泄露搜集工具。GSIL升级版,去除发邮件方式,将结果保存在本地☆13Mar 20, 2021Updated 4 years ago
- 不用搭建环境,解压即用,4G显存可用☆12Mar 1, 2025Updated last year
- [KGC '24] This application is for visualisation of Knowledge Graphs. We employe a novel technique which uses LLM based agent for triple e…☆11Apr 17, 2024Updated last year
- ☆39Feb 16, 2024Updated 2 years ago
- ☆24Nov 21, 2025Updated 3 months ago
- 参考 Chat2DB 的效果,使用 chatgpt 进行自然语言翻译,然后对数据库进行操作,使用 rust 语言实现的 web 应用。☆10Jan 13, 2025Updated last year
- Fine Tune DeepSeek☆44Feb 4, 2025Updated last year
- 专注于vite + vue3 的项目开发脚手架☆10May 31, 2025Updated 9 months ago
- AbationGraph® is a time-series knowledge graph database for real-time data analysis☆16Nov 23, 2023Updated 2 years ago
- a.k.a autoMBW-V2☆10Sep 6, 2024Updated last year
- ☆11Oct 11, 2023Updated 2 years ago
- Python library for solving reinforcement learning (RL) problems using generative models.☆11Feb 18, 2025Updated last year
- 图片压缩工具 类似于 tinify☆14Oct 5, 2024Updated last year
- A web framework for go like nestjs☆12Feb 26, 2025Updated last year
- 多级缓存架构项目,kafka读取缓存更新请求,从数据库获取缓存,写入ehcache和redis☆11Jul 6, 2018Updated 7 years ago
- Trojan 协议的 java 服务端实现☆10Feb 15, 2023Updated 3 years ago
- mybatis reactive based on r2dbc. 响应式、非阻塞 mybatis 实现☆10Aug 24, 2021Updated 4 years ago
- The real GPT-4 with image access (You probably don't have access)☆12Mar 17, 2023Updated 2 years ago
- Simple Zeroconf/mDNS scanner written in Go with no external dependencies☆12Apr 20, 2021Updated 4 years ago
- An Introductory Jupyter Notebook to Manipulate Ontologies with Owlready2☆11Jan 10, 2020Updated 6 years ago
- Small snippets of code we often find useful☆11Nov 9, 2019Updated 6 years ago
- ☆10Nov 25, 2022Updated 3 years ago
- 臸娥粂陆亩竟☆10May 11, 2024Updated last year
- ToolBar☆12Apr 3, 2018Updated 7 years ago
- Simple cache implementation on java☆11Jun 17, 2024Updated last year
- spring 各种组件大合集☆12Sep 1, 2022Updated 3 years ago
- EContract是一个基于PKI和二维码实现的身份验证和多点登录的电子合同系统。☆10Jul 12, 2025Updated 7 months ago
- SelfDrive_RCCar☆11Sep 9, 2024Updated last year
- Midjourney X Instant Collage -- Collage Template + Grid + Quality Style☆12May 25, 2025Updated 9 months ago
- 💻NUAA 2018 操作系统小作业-模拟内存分配程序(BF算法)☆13Jul 2, 2018Updated 7 years ago
- ☆12Jan 8, 2023Updated 3 years ago
- A semi-automated system based on LLM's to generate ontologies from datasets☆21Oct 29, 2024Updated last year
- 【自用】2024 计算机考研复习文档☆11Oct 18, 2023Updated 2 years ago
- Website vuln example.☆11Sep 26, 2025Updated 5 months ago
- The simple utils facade for javascript/typescript,the only utils you need for frontEnd application☆16Jun 20, 2025Updated 8 months ago
- A Simple Game Using Unity ML-Agents☆10Nov 20, 2020Updated 5 years ago
- A sample project that uses GPT4ALL Java bindings☆11Aug 24, 2023Updated 2 years ago