TXT文本语料数据清洗(Text corpus data cleaning):1> 合并TXT文件;2> 过滤干扰字符串;3> 对人名、地名、组织机构进行遮码处理;4> 将其他编码格式统一转换为UTF-8
☆19Oct 14, 2022Updated 3 years ago
Alternatives and similar repositories for txtfilemerge
Users that are interested in txtfilemerge are comparing it to the libraries listed below
Sorting:
- A Python toolkit for file processing, text cleaning and data splitting. 文件处理,文本清洗和数据划分的python工具包。☆35Oct 18, 2022Updated 3 years ago
- Demo for DART, Audio Imagination workshop submission in NeurIPS 2024☆12Apr 15, 2025Updated 10 months ago
- Guides to hopefully simplify the process of using ROCm.☆12Sep 26, 2024Updated last year
- ☆10Aug 30, 2025Updated 6 months ago
- ☆10May 21, 2021Updated 4 years ago
- Vin Carnival is a non-profit open source virtual reality game made with Unity 3D game engine and GoogleVR.☆10Oct 3, 2018Updated 7 years ago
- Node and Browser env supported WebAssembly version of fastText: Library for efficient text classification and representation learning.☆12Sep 17, 2024Updated last year
- This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …☆13Dec 4, 2024Updated last year
- URP14 Shadow Sample☆13Nov 27, 2023Updated 2 years ago
- Signed Distance Field Map Generator☆10Jun 19, 2023Updated 2 years ago
- A lightweight muji-moe chatbot created by Reecho.ai.☆13Oct 1, 2024Updated last year
- A thin cython wrapper around llama.cpp, whisper.cpp and stable-diffusion.cpp☆16Feb 10, 2026Updated 3 weeks ago
- Unityでデスクトップマスコットを作る際のフレームワーク、軽量にしたつもり☆11Sep 9, 2020Updated 5 years ago
- a unity-package allows to make annotations on arbitrary Unity-scenes of architectural sites☆15Dec 11, 2017Updated 8 years ago
- Chinese Prosodic Structure Prediction☆10May 18, 2019Updated 6 years ago
- ☆11Nov 30, 2020Updated 5 years ago
- MMD Camera Path Vmd's File For Unity3D☆12Sep 19, 2017Updated 8 years ago
- lightsmile个人的用于爬取网络公开语料数据的mini通用爬虫框架。☆13Sep 30, 2020Updated 5 years ago
- Play and solve a 3x3x3 Rubik's cube with Thistlethwaite's algorithm in Unity C#. (42 Silicon Valley)☆12Apr 20, 2019Updated 6 years ago
- A Stable Diffusion desktop frontend with inpainting, img2img and more!☆10Oct 8, 2022Updated 3 years ago
- ☆15Feb 6, 2026Updated last month
- Official code repository for AAAI2021 paper Finding Sparse Structures for Domain Specific Neural Machine Translation☆11Apr 1, 2021Updated 4 years ago
- A blazing fast CLIP gRPC service in rust.☆16Aug 9, 2023Updated 2 years ago
- 史藏☆12Sep 23, 2016Updated 9 years ago
- A demonstration of how to train a custom tokenizer similar to TikToken.☆15Jan 6, 2025Updated last year
- UnitychanToonShaderを使用しているマテリアルのパラメータを、特定のシェーダ/マテリアルに変換するエディタ拡張☆12Mar 17, 2020Updated 5 years ago
- 诈骗脚本语料库数据集☆12Apr 20, 2022Updated 3 years ago
- bumble bee transformer☆14Apr 19, 2021Updated 4 years ago
- 传送门☆13Dec 13, 2018Updated 7 years ago
- A high-performance script to extract the vast majority of artists on Spotify in a rapid, asynchronous, and multiprocessed manner that can…☆13Jun 3, 2024Updated last year
- 4-layer(RGBA) parallax hair shader☆15May 8, 2018Updated 7 years ago
- Hackintosh EFI for MSI Pro B760M-A WIFI DDR4 Ⅱ(2 Gen) + i5-12600kf + Gigabyte Radeon RX 6600 XT Gaming OC 8G | 黑苹果 macOS & Windows 双系统配置 …☆16Oct 16, 2024Updated last year
- This project allow to construct and edit a node graph in a Unity game in playmode (at runtime)☆15Nov 6, 2022Updated 3 years ago
- Beautify github user activity display☆18Dec 9, 2024Updated last year
- VMCProtocolの受信内容をVMC互換で表示するソフトウェア。 バーチャルモーションキャプチャーとほぼ同等の表示を実現しようとします。☆13Jul 26, 2020Updated 5 years ago
- VRM importer extension for Unity URP☆17Apr 7, 2021Updated 4 years ago
- Convert claude to chatgpt form api through Slack☆15Jun 7, 2023Updated 2 years ago
- Node graph editor framework focused on data processing using Unity UIElements and C# 4.6☆14Aug 13, 2024Updated last year
- Digital Image Watermarking use matlab(DWT,DCT), GUI use python☆13Oct 8, 2019Updated 6 years ago