pinyintokenizer, 拼音分词器,将连续的拼音切分为单字拼音列表。
☆31Feb 5, 2025Updated last year
Alternatives and similar repositories for pinyin-tokenizer
Users that are interested in pinyin-tokenizer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 🌈 NERpy: Implementation of Named Entity Recognition using Python. 命名实体识别工具,支持BertSoftmax、BertSpan等模型,开箱即用。☆118Feb 19, 2024Updated 2 years ago
- Chinese Sentiment Classification Tool. 情感极性分类,基于知网、清华、BosonNLP情感词典,易扩展,基准方法,开箱即用。☆104Aug 22, 2023Updated 2 years ago
- pytorch实现bert做seq2seq任务,使用unilm方案。☆10Apr 1, 2020Updated 6 years ago
- labelit, label tool with active learning, for classification task. 自动标注,基于主动学习,边标注边学习,减少人工标注量。☆31Dec 9, 2022Updated 3 years ago
- ☆23Nov 15, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- SearchGPT: Building a quick conversation-based search engine with LLMs.☆46Jan 5, 2025Updated last year
- pke_zh, python keyphrase extraction for chinese(zh). 中文关键词或关键句提取工具,实现了KeyBert、PositionRank、TopicRank、TextRank等算法,开箱即用。☆215Mar 27, 2024Updated 2 years ago
- UCAS2020《自然语言处理》编程作业-事件抽取☆16Dec 17, 2020Updated 5 years ago
- 采用LTP(分词、词性标注、句法依存、角色标注)抽取事件三元组☆15Jul 14, 2021Updated 4 years ago
- 🤖 Open-source LLM server (OpenAI, Ollama, Groq, Anthropic) with support for HTTP, Streaming, Agents, RAG (Deprecated check out Orchestra…☆33Jun 10, 2025Updated 11 months ago
- The GitHub repository for the paper "Reinforcement Learning-based Dialogue Guided Event Extraction to Exploit Argument Relations"☆23Oct 31, 2022Updated 3 years ago
- Agentica: Lightweight async-first Python framework for AI agents. 轻量级异步优先的AI Agent框架,支持工具调用、RAG、多智能体和MCP。☆306Updated this week
- A simple traditional Chinese to simplified Chinese converter☆14Apr 6, 2020Updated 6 years ago
- NLP之病历分析:从病历文本之中提取关键信息,便于后续分析处理。☆22Feb 12, 2017Updated 9 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 粤语双拼输入法 Input method for typing Chinese using Cantonese pronunciations with 2-3 keys per character, based on RIME☆11Jul 25, 2021Updated 4 years ago
- Chinese Machine Reading 2021海华AI挑战赛·中文阅读理解·技术组·第三名☆20May 27, 2021Updated 5 years ago
- ☆11Sep 1, 2024Updated last year
- This is the public repository of AAAI 2024 paper "Is a Large Language Model a Good Annotator for Event Extraction"☆10Feb 16, 2024Updated 2 years ago
- Companion code for Awe the Audience: How the Narrative Trajectories Affect Audience Perception in Public Speaking☆14Jan 6, 2018Updated 8 years ago
- Here is a demo for PDF parser (Including OCR, object detection tools)☆36Oct 14, 2024Updated last year
- Fast instruction tuning with Llama2☆11Apr 8, 2024Updated 2 years ago
- An English dictionary.☆10May 31, 2016Updated 9 years ago
- 早期的计算机使用7位的ASCII编码,为了处理汉字,程序员设计了用于简体中文的GB2312和用于繁体中文的big5。 GB2312(1980年)一共收录了7445个字符,包括6763个汉字和682个其它符号。汉字区的内码范围高字节从B0-F7,低字节从A1-FE,占用的码…☆10Sep 10, 2017Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official source code for the paper "Tailored Design of Audio-Visual Speech Recognition Models using Branchformers"☆14Feb 24, 2025Updated last year
- 基于IK中文分词器,添加同义词功能☆13Feb 24, 2018Updated 8 years ago
- A searchable Chinese / English dictionary with helpful utilities.☆12Feb 24, 2024Updated 2 years ago
- An unofficial (PyTorch) implementation for the paper Deep Lip Reading: A comparison of models and an online application.☆10May 13, 2020Updated 6 years ago
- 四级、六级、考研、雅思考试词频统计程序☆12Dec 22, 2024Updated last year
- "DeepResearch-Eval: An End-to-End Evaluation Framework for DeepResearch Systems"☆45Oct 16, 2025Updated 7 months ago
- 给朗读课文添加拼音☆14Apr 9, 2018Updated 8 years ago
- Source code for ACL 2021 paper "CLEVE: Contrastive Pre-training for Event Extraction"☆83Nov 24, 2022Updated 3 years ago
- 一个小工具,目前实现的功能包括:①台词截图拼接;②横向图片拼接;③批量解压缩;④图片转pdf文件。☆16Dec 21, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 【今日头条】文本作者身份识别比赛☆10Aug 20, 2018Updated 7 years ago
- Precision Knowledge Editing (PKE): A novel method to reduce toxicity in LLMs while preserving performance, with robust evaluations and ha…☆11Nov 26, 2024Updated last year
- A Pytorch (support batch and channel) implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech…☆12Jul 24, 2024Updated last year
- Deep Variational Information Bottleneck (DVIB) in PyTorch.☆10Apr 25, 2020Updated 6 years ago
- An implementation of MSSRM method☆10Mar 23, 2023Updated 3 years ago
- ☆25Dec 6, 2022Updated 3 years ago
- 这是我的深度强化学习的学习笔记与总结☆75Mar 18, 2026Updated 2 months ago