TXT文本语料数据清洗(Text corpus data cleaning):1> 合并TXT文件;2> 过滤干扰字符串;3> 对人名、地名、组织机构进行遮码处理;4> 将其他编码格式统一转换为UTF-8
☆19Oct 14, 2022Updated 3 years ago
Alternatives and similar repositories for txtfilemerge
Users that are interested in txtfilemerge are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Demo for DART, Audio Imagination workshop submission in NeurIPS 2024☆13Apr 15, 2025Updated last year
- This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …☆13Dec 4, 2024Updated last year
- Chinese Prosodic Structure Prediction☆10May 18, 2019Updated 6 years ago
- 中文大语言模型评测2024高考数学专题☆19Jun 14, 2024Updated last year
- Tensorflow Implementation of "Theory and Experiments on Vector Quantized Autoencoders"☆15Feb 27, 2019Updated 7 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Official code repository for AAAI2021 paper Finding Sparse Structures for Domain Specific Neural Machine Translation☆11Apr 1, 2021Updated 5 years ago
- ChatGPT中文 学习和实践资料汇总——LLaMA、ChatGLM等大模型的Finetune☆14Apr 17, 2023Updated 3 years ago
- A Large-Scale Dataset for Long Text and Multi-Table Summarization☆18Feb 21, 2024Updated 2 years ago
- source code of EfficientTTS 2☆20Feb 18, 2024Updated 2 years ago
- speech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech an…☆15Dec 19, 2018Updated 7 years ago
- Usings LLM chat with knowledges☆21Aug 12, 2023Updated 2 years ago
- Layer normalization in PyTorch☆20Jun 6, 2020Updated 5 years ago
- A lightweight audio codec based on a single quantizer☆69Aug 15, 2025Updated 8 months ago
- Pre-trained grapheme-to-phoneme (G2P) models☆26Jul 27, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A simple sync tool to sync task from Workflowy to Teambition☆32Oct 4, 2017Updated 8 years ago
- vLLM for embedding tasks using Original LLMs (Qwen2, LLaMA)☆29Sep 9, 2024Updated last year
- a unity-package allows to make annotations on arbitrary Unity-scenes of architectural sites☆15Dec 11, 2017Updated 8 years ago
- [ACL 2026 Main] Open-Ended Speaking Style Modeling via Fine-Grained and Multi-Granular Contrastive Language-Speech Pre-training☆71Apr 6, 2026Updated last week
- Play and solve a 3x3x3 Rubik's cube with Thistlethwaite's algorithm in Unity C#. (42 Silicon Valley)☆12Apr 20, 2019Updated 6 years ago
- ☆10May 21, 2021Updated 4 years ago
- The open source code of ALMTokenizer2: Towards Low bit-rate and Semantic-rich Audio Tokenizer with Flow-based Scalar Diffusion Transforme…☆45Sep 5, 2025Updated 7 months ago
- MMD Camera Path Vmd's File For Unity3D☆12Sep 19, 2017Updated 8 years ago
- ☆15Feb 6, 2026Updated 2 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- [INTERSPEECH 2025 Oral]Official code for "Accelerating Diffusion-based Text-to-Speech Model Training with Dual Modality Alignment"☆64Jun 16, 2025Updated 10 months ago
- 4-layer(RGBA) parallax hair shader☆15May 8, 2018Updated 7 years ago
- URP14 Shadow Sample☆13Nov 27, 2023Updated 2 years ago
- Official implementation of paper: Frame-Wise Breath Detection with Self-Training: An Exploration of Enhancing Breath Naturalness in Text-…☆41Sep 18, 2024Updated last year
- Convert claude to chatgpt form api through Slack☆15Jun 7, 2023Updated 2 years ago
- Guides to hopefully simplify the process of using ROCm.☆12Sep 26, 2024Updated last year
- A LOFTER third-party APP developed based on Flutter.基于Flutter开发的LOFTER第三方APP☆81Oct 27, 2025Updated 5 months ago
- 基于随机森林和条件 随机场的中文韵律预测模型☆28Jul 25, 2024Updated last year
- Simple static blog written in Go, packaged in one binary.☆22Oct 26, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Vin Carnival is a non-profit open source virtual reality game made with Unity 3D game engine and GoogleVR.☆10Oct 3, 2018Updated 7 years ago
- #WORK IN PROGRESS PyTorch Implementation of Supervised and Deep Q-Learning EWC(Elastic Weight Consolidation), introduced in "Overcoming C…☆30May 25, 2018Updated 7 years ago
- ☆14Jul 30, 2019Updated 6 years ago
- The evaluation code for MultiIF multi-turn and multi-lingual instruction following☆63Oct 29, 2024Updated last year
- This project allow to construct and edit a node graph in a Unity game in playmode (at runtime)☆15Nov 6, 2022Updated 3 years ago
- 传送门☆13Dec 13, 2018Updated 7 years ago
- Node graph editor framework focused on data processing using Unity UIElements and C# 4.6☆14Aug 13, 2024Updated last year