pinyintokenizer, 拼音分词器,将连续的拼音切分为单字拼音列表。
☆31Feb 5, 2025Updated last year
Alternatives and similar repositories for pinyin-tokenizer
Users that are interested in pinyin-tokenizer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 🌈 NERpy: Implementation of Named Entity Recognition using Python. 命名实体识别工具,支持BertSoftmax、BertSpan等模型,开箱即用。☆118Feb 19, 2024Updated 2 years ago
- ☆11Oct 9, 2022Updated 3 years ago
- Chinese Sentiment Classification Tool. 情感极性分类,基于知网、清华、BosonNLP情感词典,易扩展,基准方法,开箱即用。☆104Aug 22, 2023Updated 2 years ago
- pytorch实现bert做seq2seq任务,使用unilm方案。☆10Apr 1, 2020Updated 6 years ago
- labelit, label tool with active learning, for classification task. 自动标注,基于主动学习,边标注边学习,减少人工标注量。☆31Dec 9, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆23Nov 15, 2024Updated last year
- Repository for initial POC NLP based SQL adapter using LLM.☆10May 6, 2025Updated last year
- pke_zh, python keyphrase extraction for chinese(zh). 中文关键词或关键句提取工具,实现了KeyBert、PositionRank、TopicRank、TextRank等算法,开箱即用。☆215Mar 27, 2024Updated 2 years ago
- multilabel categorical crossentropy☆15Apr 26, 2020Updated 6 years ago
- auto push daily news with ai☆13Updated this week
- UCAS2020《自然语言处理》编程作业-事件抽取☆16Dec 17, 2020Updated 5 years ago
- 🤖 Open-source LLM server (OpenAI, Ollama, Groq, Anthropic) with support for HTTP, Streaming, Agents, RAG (Deprecated check out Orchestra…☆33Jun 10, 2025Updated 10 months ago
- 基于DeepSeek-R1黑盒蒸馏的网络安全渗透领域推理模型。可高效的应对断网情况下的网络安全大赛。简介写完整了,图片加载不出来看看是否梯子挂好了。2025.5.14更新英文数据集☆74May 21, 2025Updated 11 months ago
- ☆15Nov 19, 2018Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 🗲 A high-performance on-disk dictionary.☆29Dec 4, 2025Updated 5 months ago
- 必背:英语口语8000句☆12Jul 21, 2022Updated 3 years ago
- Agentica: Lightweight async-first Python framework for AI agents. 轻量级异步优先的AI Agent框架,支持工具调用、RAG、多智能体和MCP。☆291Apr 29, 2026Updated last week
- A tiny script to convert your mdx dictionary file to CSV☆11Dec 22, 2018Updated 7 years ago
- Service for Bert model to Vector. 高效的文本转向量(Text-To-Vector)服务,支持GPU多卡、多worker、多客户端调用,开箱即用。☆13May 24, 2022Updated 3 years ago
- Chinese Machine Reading 2021海华AI挑战赛·中文阅读理解·技术组·第三名☆20May 27, 2021Updated 4 years ago
- ☆11Sep 1, 2024Updated last year
- This is the public repository of AAAI 2024 paper "Is a Large Language Model a Good Annotator for Event Extraction"☆10Feb 16, 2024Updated 2 years ago
- Companion code for Awe the Audience: How the Narrative Trajectories Affect Audience Perception in Public Speaking☆14Jan 6, 2018Updated 8 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Here is a demo for PDF parser (Including OCR, object detection tools)☆36Oct 14, 2024Updated last year
- ACE2005中文数据集处理(中文信息信息抽取任务)☆21Jul 11, 2021Updated 4 years ago
- Fast instruction tuning with Llama2☆11Apr 8, 2024Updated 2 years ago
- 早期的计算机使用7位的ASCII编码,为了处理汉字,程序员设计了用于简体中文的GB2312和用于繁体中文的big5。 GB2312(1980年)一共收录了7445个字符,包括6763个汉字和682个其它符号。汉字区的内码范围高字节从B0-F7,低字节从A1-FE,占用的码…☆10Sep 10, 2017Updated 8 years ago
- ☆11Sep 8, 2017Updated 8 years ago
- Official source code for the paper "Tailored Design of Audio-Visual Speech Recognition Models using Branchformers"☆14Feb 24, 2025Updated last year
- 基于IK中文分词器,添加同义词功能☆13Feb 24, 2018Updated 8 years ago
- A searchable Chinese / English dictionary with helpful utilities.☆12Feb 24, 2024Updated 2 years ago
- A bash interface to get English<->Chinese translation based on www.iciba.com【一个简单的命令行工具,可以直接查询中英文单词或短语的翻译】☆19May 13, 2017Updated 8 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Source code for ACL 2021 paper "CLEVE: Contrastive Pre-training for Event Extraction"☆83Nov 24, 2022Updated 3 years ago
- Random program generator for Python☆10Jun 20, 2013Updated 12 years ago
- CNBlog首页博客热度分析☆10May 10, 2016Updated 9 years ago
- 一个小工具,目前实现的功能包括:①台词截图拼接;②横向图片拼接;③批量解压缩;④图片转pdf文件。☆16Dec 21, 2021Updated 4 years ago
- 【今日头条】文本作者身份识别比赛☆10Aug 20, 2018Updated 7 years ago
- Precision Knowledge Editing (PKE): A novel method to reduce toxicity in LLMs while preserving performance, with robust evaluations and ha…☆11Nov 26, 2024Updated last year
- Extraction and Interface for Oxford English Dictionary (OED) 1st Edition☆22Aug 29, 2013Updated 12 years ago