KMnO4-zx/extract-dialogue

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/KMnO4-zx/extract-dialogue)

KMnO4-zx / extract-dialogue

从小说中提取对话数据集

☆359

Alternatives and similar repositories for extract-dialogue

Users that are interested in extract-dialogue are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

KMnO4-zx / huanhuan-chat
View on GitHub
Chat-甄嬛是利用《甄嬛传》剧本中所有关于甄嬛的台词和语句，基于ChatGLM2进行LoRA微调得到的模仿甄嬛语气的聊天语言模型。
☆826May 21, 2025Updated last year
KMnO4-zx / xlab-huanhuan
View on GitHub
☆73Mar 12, 2024Updated 2 years ago
xiaoqidaov2 / chatglm_dialogues
View on GitHub
对话集提取器是一个基于chatglm模型的工具，用于从文本中提取对话集。该工具可以帮助用户从小说、剧本等文本中自动提取出对话，以便进行分析、标注或其他应用。
☆12Nov 22, 2024Updated last year
FormoJ / LLM-for-Users
View on GitHub
☆10Jan 6, 2025Updated last year
datawhalechina / self-llm
View on GitHub
《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调（全参数/Lora）、部署国内外开源大模型（LLM）/多模态大模型（MLLM）教程
☆31,367Updated this week
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
JimmyMa99 / Roleplay-with-XiYou
View on GitHub
基于《西游记》原文、白话文、ChatGPT生成数据制作的，以InternLM2微调的角色扮演多LLM聊天室。本项目将介绍关于角色扮演类 LLM 的一切，从数据获取、数据处理，到使用 XTuner 微调并部署至 OpenXLab，再到使用 LMDeploy 部署，以 op…
☆109Mar 31, 2024Updated 2 years ago
KMnO4-zx / paper-agent
View on GitHub
something for paper agent
☆11Dec 18, 2024Updated last year
datawhalechina / tiny-universe
View on GitHub
《大模型白盒子构建指南》：一个全手搓的Tiny-Universe
☆4,973Feb 12, 2026Updated 5 months ago
lucadellalib / discrete-wavlm-codec
View on GitHub
A neural speech codec based on discrete WavLM representations
☆26Aug 28, 2024Updated last year
JimmyMa99 / BaJie-Chat
View on GitHub
八戒-Chat是利用《西游记》剧本中所有关于猪八戒的台词和语句，以及Chat-GPT-3.5生成的相关问题结果，基于Internlm进行QLoRA微调得到的模仿猪八戒语气的聊天语言模型。
☆27Jul 30, 2025Updated 11 months ago
Joanna0123 / character_profiling
View on GitHub
Code and Data for the paper "Evaluating Character Understanding of Large Language Models via Character Profiling from Fictional Works".
☆22Jul 24, 2024Updated last year
cpii-cai / PunCantonese
View on GitHub
A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts
☆15Dec 3, 2024Updated last year
shengcanxu / canoSpeech
View on GitHub
text to speech
☆10Mar 19, 2024Updated 2 years ago
xiaomi-research / acavcaps
View on GitHub
☆31Mar 27, 2026Updated 3 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
yongaifadian1 / MNV-17
View on GitHub
Qwen2.5-Omni fine-tuned on MNV-17 dataset for nonverbal vocalization recognition
☆31Nov 13, 2025Updated 8 months ago
limafang / agent-arxiv-daily
View on GitHub
🎓Automatically Update agent Papers Daily using Github Actions (Update Every 12th hours)每日更新agent相关论文（已附带中文摘要翻译）
☆37Updated this week
InternLM / Tutorial
View on GitHub
LLM&VLM Tutorial
☆1,968Apr 22, 2026Updated 2 months ago
SmartFlowAI / EmoLLM
View on GitHub
心理健康大模型 (LLM x Mental Health), Pre & Post-training & Dataset & Evaluation & Depoly & RAG, with InternLM / Qwen / Baichuan / DeepSeek / M…
☆1,762Jun 18, 2026Updated last month
AXYZdong / AMchat
View on GitHub
AM (Advanced Mathematics) Chat is a large language model that integrates advanced mathematical knowledge, exercises in higher mathematics…
☆233Aug 10, 2024Updated last year
innnky / audio-preprocessing-scripts
View on GitHub
数据集自动化制作脚本
☆71Mar 26, 2023Updated 3 years ago
GuoYiFantastic / IMelodist
View on GitHub
Music large model based on InternLM2-chat.
☆22Dec 21, 2024Updated last year
LC1332 / Chat-Haruhi-Suzumiya
View on GitHub
Chat凉宫春日, An open sourced Role-Playing chatbot Cheng Li, Ziang Leng, and others.
☆2,098Aug 13, 2024Updated last year
morning-hao / domain-self-instruct
View on GitHub
受到self-instruct启发,除了通用LLM还能做垂直领域的小LLM实现定制效果，通过GPT获得question和answer来作为训练数据
☆18May 12, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
a-persimmons / mcp-client-for-weather-example
View on GitHub
一个MCP客户端实践：实现LLM调用天气MCP服务端查询天气的快速示例
☆15Apr 2, 2025Updated last year
NARUTO-2024 / WavBench
View on GitHub
WavBench: Benchmarking Reasoning, Colloquialism, and Paralinguistics for End-to-End Spoken Dialogue Models
☆34Feb 13, 2026Updated 5 months ago
alibaba / vstyle
View on GitHub
☆34Sep 15, 2025Updated 10 months ago
ZhongshuHou / MHA-DPCRN
View on GitHub
We design a spectral compression mapping (SCM) for full-band speech enhancement, and propose a two-stage stream named MHA-DPCRN
☆24Jul 4, 2022Updated 4 years ago
dr-pato / SSGD
View on GitHub
Code of the paper "Low-Latency Speech Separation Guided Diarization for Telephone Conversations"
☆15Dec 22, 2022Updated 3 years ago
sanbuphy / WhisperTranslator
View on GitHub
A free tool that helps you transcribe, translate, and summarize videos in any language.
☆18Feb 27, 2024Updated 2 years ago
morecry / CharacterEval
View on GitHub
☆301May 27, 2025Updated last year
exercise-book-yq / FreeCodec
View on GitHub
FREECODEC: A DISENTANGLED NEURAL SPEECH CODEC WITH FEWER TOKENS
☆24Sep 9, 2024Updated last year
ASLP-lab / M7-TTS
View on GitHub
M7-TTS: A Mini-Scale Multilingual and Multi-Dialect Text-to-Speech Language Model with Mimi codec and Multi Token Prediction
☆20Mar 19, 2026Updated 4 months ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
limafang / tiny-graphrag
View on GitHub
☆44May 9, 2025Updated last year
Soul-AILab / SoulX-Duplug
View on GitHub
Plug-and-play streaming semantic VAD for real-time full-duplex spoken dialogue systems.
☆273Updated this week
LAZERFiring / screnc-project-backend-flask
View on GitHub
☆16Oct 9, 2025Updated 9 months ago
reppy4620 / convnext_tts
View on GitHub
Unofficial implementation of ConvNeXt-TTS powered by lightning
☆18Oct 20, 2024Updated last year
SocialAI-tianji / Tianji
View on GitHub
制作懂人情世故的大语言模型 | 涵盖提示词工程、RAG、Agent、LLM微调教程
☆1,803Apr 29, 2025Updated last year
YangXusheng-yxs / CodecFormer_5Hz
View on GitHub
☆35Oct 23, 2025Updated 8 months ago
wux-labs / OpenXLab-IntelligentSalesAssistant
View on GitHub
☆19Jun 21, 2024Updated 2 years ago