中文对话数据清洗
☆32Nov 8, 2022Updated 3 years ago
Alternatives and similar repositories for dial-clean
Users that are interested in dial-clean are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A framework for cleaning Chinese dialog data☆273May 14, 2021Updated 4 years ago
- Offical code repository for PromptMix: A Class Boundary Augmentation Method for Large Language Model Distillation, EMNLP 2023☆12Dec 13, 2023Updated 2 years ago
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"☆22Jun 7, 2025Updated 10 months ago
- 受到self-instruct启发,除了通用LLM还能做垂直领域的小LLM实现定制效果,通过GPT获得question和answer来作为训练数据☆18May 12, 2023Updated 2 years ago
- PyTorch分类网络:Python训练_测试_模型转换 && Windows_LibTorch_C++部署☆19Sep 16, 2021Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Official implementation of BandPO: Bridging Trust Regions and Ratio Clipping via Probability-Aware Bounds for LLM Reinforcement Learning.…☆48Apr 8, 2026Updated 3 weeks ago
- An implementation of Jasper, QuartzNet, Citrinet and pipeline for training CTC-based ASR models☆12Nov 13, 2021Updated 4 years ago
- 百度中文实体识别和实体消歧数据集,比赛网址☆24Mar 16, 2020Updated 6 years ago
- Code for ACL22 short Paper "Hierarchical Curriculum Learning for AMR Parsing"☆13Jun 1, 2022Updated 3 years ago
- 公众号☆10Jul 24, 2023Updated 2 years ago
- ArtSpeech: Adaptive Text-to-Speech Synthesis with Articulatory Representations☆21Sep 21, 2025Updated 7 months ago
- ☆13Jun 3, 2023Updated 2 years ago
- An extension of thu-spmi/CAT which contains a full-fledged implementation of CTC-CRF for Tensorflow.☆12Jul 5, 2021Updated 4 years ago
- CTC decoder with hotwords for ASR.☆35Apr 13, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- KnowLA: Enhancing Parameter-efficient Finetuning with Knowledgeable Adaptation, NAACL 2024☆16Jul 29, 2024Updated last year
- Chinese Characters Visualization & Chinese Text Augmentation.☆17Sep 19, 2022Updated 3 years ago
- Synth-Empathy: Towards High-Quality Synthetic Empathy Data☆18Feb 28, 2025Updated last year
- The system of SUDA-HUAWEI submitted at CAMR2022.☆12Nov 22, 2022Updated 3 years ago
- ☆13Apr 3, 2026Updated last month
- CCL2024 Chinese Essay Rhetoric Recognition and Understanding☆17Oct 1, 2024Updated last year
- [NeurIPS 2022] "Adversarial Training with Complementary Labels: On the Benefit of Gradually Informative Attacks"☆13Nov 11, 2022Updated 3 years ago
- Towards Real-World Writing Assistance: A Chinese Character Checking Benchmark with Faked and Misspelled Characters☆16May 30, 2024Updated last year
- 2021科大讯飞试题标签预测挑战赛亚军方案☆13Dec 4, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- The official GitHub repository for AC-EVAL, an ancient Chinese evaluation suite for large language models (LLMs)☆16Nov 12, 2024Updated last year
- ☆25Oct 15, 2025Updated 6 months ago
- ☆12Feb 16, 2024Updated 2 years ago
- 大模型预训练中文语料清洗及质量评估 Large model pre-training corpus cleaning☆81Jul 25, 2024Updated last year
- This is the repo which record the evolution of LM-based dialogue system. More details can be found in our original survey paper: A Survey…☆63Apr 11, 2025Updated last year
- laravel 中国地图web Api集合☆13Apr 27, 2023Updated 3 years ago
- 一套代码指令微调大模型☆39Aug 1, 2023Updated 2 years ago
- Evaluating GPT-OSS on BrowseComp-Plus with Native Browsering Tools☆20Oct 17, 2025Updated 6 months ago
- Tokenizer POS-tagger and Dependency-parser for Classical Chinese☆19Apr 13, 2026Updated 2 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- AI写作小工具方案:让2个智能体合作写出真正可用的图文并茂的帖子(微信公众号,小红书,博客)。1,写作智能体,2,知识库智能体。☆21Jun 8, 2025Updated 10 months ago
- End-to-end real-world polyphonic piano audio-to-score transcription with hierarchical decoding (IJCAI 2024)☆41Sep 17, 2024Updated last year
- QA system based on Medical knowledge graph☆15Jun 26, 2019Updated 6 years ago
- ☆14Mar 11, 2024Updated 2 years ago
- Implementation of the paper 'Improve Discourse Dependency Parsing with Contextualized Representations', Findings of NAACL 2022☆14Jul 15, 2022Updated 3 years ago
- 大模型应用开发:动手做AI Agent GPT大语言模型应用 智能代理 LangChain 开发实战☆38Nov 16, 2024Updated last year
- [npj Digital Medicine] An In-Depth Evaluation of Federated Learning on Biomedical Natural Language Processing for Information Extraction☆12May 1, 2024Updated 2 years ago