中文对话数据清洗
☆32Nov 8, 2022Updated 3 years ago
Alternatives and similar repositories for dial-clean
Users that are interested in dial-clean are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A framework for cleaning Chinese dialog data☆273May 14, 2021Updated 4 years ago
- Offical code repository for PromptMix: A Class Boundary Augmentation Method for Large Language Model Distillation, EMNLP 2023☆12Dec 13, 2023Updated 2 years ago
- Official implementation of BandPO: Bridging Trust Regions and Ratio Clipping via Probability-Aware Bounds for LLM Reinforcement Learning.…☆44Mar 9, 2026Updated 2 weeks ago
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"☆21Jun 7, 2025Updated 9 months ago
- 受到self-instruct启发,除了通用LLM还能做垂直领域的小LLM实现定制效果,通过GPT获得question和answer来作为训练数据☆18May 12, 2023Updated 2 years ago
- Learn how to create impactful AI Agents using Agno AI Python Package☆13Jul 31, 2025Updated 7 months ago
- ☆13Feb 17, 2025Updated last year
- Code for ACL22 short Paper "Hierarchical Curriculum Learning for AMR Parsing"☆13Jun 1, 2022Updated 3 years ago
- This repo is the artifact of FUEL☆13Dec 2, 2025Updated 3 months ago
- ☆18Sep 8, 2021Updated 4 years ago
- Magic Sky Replace Software Implemented by Pytorch.☆10Apr 16, 2021Updated 4 years ago
- ArtSpeech: Adaptive Text-to-Speech Synthesis with Articulatory Representations☆21Sep 21, 2025Updated 6 months ago
- The official repos of "Rethinking Multi-view Representation Learning via Distilled Disentangling"☆12Apr 3, 2024Updated last year
- [ACL 2023] This is the code repo for our ACL'23 paper "Augmentation-Adapted Retriever Improves Generalization of Language Models as Gener…☆60Jul 12, 2024Updated last year
- ☆10Jan 28, 2024Updated 2 years ago
- ☆10Mar 8, 2026Updated 2 weeks ago
- CTC decoder with hotwords for ASR.☆35Apr 13, 2025Updated 11 months ago
- [ASRU 2025] Omni-R1: Do You Really Need Audio to Fine-Tune Your Audio LLM?☆44Nov 21, 2025Updated 4 months ago
- Creating the DeepSeek V3 model from scratch☆26Mar 28, 2025Updated 11 months ago
- Age and gender classification is a dual-task of identifying the age and gender of a person from an image or video.☆12Apr 16, 2019Updated 6 years ago
- The system of SUDA-HUAWEI submitted at CAMR2022.☆11Nov 22, 2022Updated 3 years ago
- ☆12Jan 19, 2026Updated 2 months ago
- Incremental Disentanglement for Environment-Aware Zero-Shot Text-to-Speech Synthesis☆27Mar 21, 2025Updated last year
- CCL2024 Chinese Essay Rhetoric Recognition and Understanding☆17Oct 1, 2024Updated last year
- Using PCA, Autoencoder and Fisher linear discriminant to extract the effective representations from the face images. Do the reconstructio…☆12Apr 23, 2019Updated 6 years ago
- Webinar content for Empower LLMs with Knowledge Graphs☆12May 17, 2024Updated last year
- Towards Real-World Writing Assistance: A Chinese Character Checking Benchmark with Faked and Misspelled Characters☆16May 30, 2024Updated last year
- 2021科大讯飞试题标签预测挑战赛亚军方案☆12Dec 4, 2021Updated 4 years ago
- Code for paper "Open-Domain Hierarchical Event Schema Induction by Incremental Prompting and Verification"☆16Jul 4, 2023Updated 2 years ago
- This is the repo which record the evolution of LM-based dialogue system. More details can be found in our original survey paper: A Survey…☆63Apr 11, 2025Updated 11 months ago
- simplest online-softmax notebook for explain Flash Attention☆16Jan 27, 2026Updated last month
- 一套代码指令微调大模型☆39Aug 1, 2023Updated 2 years ago
- Evaluating GPT-OSS on BrowseComp-Plus with Native Browsering Tools☆18Oct 17, 2025Updated 5 months ago
- 只要给物体画上一个方框,就可以在视频中去除这个物体并修复视频☆11Apr 5, 2022Updated 3 years ago
- SEED: Self-supervised Distillation for Visual Representation☆16Jul 20, 2022Updated 3 years ago
- Tokenizer POS-tagger and Dependency-parser for Classical Chinese☆19Feb 28, 2026Updated 3 weeks ago
- End-to-end real-world polyphonic piano audio-to-score transcription with hierarchical decoding (IJCAI 2024)☆41Sep 17, 2024Updated last year
- AI写作小工具方案:让2个智能体合作写出真正可用的图文并茂的帖子(微信公众号,小红书,博客)。1,写作智能体,2,知识库智能体。☆21Jun 8, 2025Updated 9 months ago
- Correcting Chinese Spelling Errors with Phonetic Pre-training 非官方实现☆39Feb 11, 2022Updated 4 years ago