A Python toolkit for file processing, text cleaning and data splitting. 文件处理,文本清洗和数据划分的python工具包。
☆35Oct 18, 2022Updated 3 years ago
Alternatives and similar repositories for Takin
Users that are interested in Takin are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository contains datasets (including testing set) for EMNLP-IJCNLP 2019 paper "BiPaR: A Bilingual Parallel Dataset for Multilingu…☆23Jul 13, 2021Updated 4 years ago
- Simple Transformers四种任务(分类、命名实体识别、机器阅读理解、语言模型微调)的代码样例,可以切换多种预训练模型。☆23Jun 7, 2022Updated 3 years ago
- Source code and dataset for the paper "GECOR: An End-to-End Generative Ellipsis and Co-reference Resolution Model for Task-Oriented Dialo…☆30Jul 22, 2023Updated 2 years ago
- TXT文本语料数据清洗(Text corpus data cleaning):1> 合并TXT文件;2> 过滤干扰字符串;3> 对人名、地名、组织机构进行遮码处理;4> 将其他编码格式统一转换为UTF-8☆19Oct 14, 2022Updated 3 years ago
- MNBVC项目-ShareGPT语料清洗☆15Oct 4, 2023Updated 2 years ago
- 开源QG系统(Question Generation,问题生成),基于Pytorch和Transformer编写☆55Jul 25, 2024Updated last year
- Extract Chinese/English QA Data from WikiHow pages.☆16May 21, 2023Updated 2 years ago
- 基于中文 GPT2 预训练模型的语句困惑度计算☆15Apr 20, 2023Updated 2 years ago
- NLP预/后处理工具。☆30Mar 31, 2025Updated 11 months ago
- Usings LLM chat with knowledges☆21Aug 12, 2023Updated 2 years ago
- code for ACL 2019 paper "cross lingual training for automatic question generation"☆14Jun 30, 2019Updated 6 years ago
- CamRest676 is an English data set, I translate it into Chinese for training nlu.☆12Dec 20, 2017Updated 8 years ago
- The wizard of oz code used for collecting goal-oriented dialogue systems☆13Oct 30, 2017Updated 8 years ago
- implement a RNN model of DSTC2 task☆16Jan 25, 2019Updated 7 years ago
- Neural Paraphrase Generation based on OpenNMT-py☆12Jan 2, 2018Updated 8 years ago
- Examples about using MGeo finetune models☆55Feb 9, 2023Updated 3 years ago
- This repository contains code and models for the paper: Semantic Graphs for Generating Deep Questions (ACL 2020).☆65Jan 20, 2024Updated 2 years ago
- Stochastic Answer Networks (SAN) for Machine Reading Comprehension☆149Nov 26, 2018Updated 7 years ago
- 基于Python爬虫技术的中国知网(CNKI)文献检索与下载程序,能够便利文献的检索与信息下载!☆16Jun 18, 2023Updated 2 years ago
- A Fast(er) and Accurate Syntactic Parsing by Exacter Searching.☆17Jul 25, 2024Updated last year
- DocQues answers queries on longer and multiple documents build on GPT-Index and GPT-3☆13Jan 1, 2023Updated 3 years ago
- An easy-to-use sequence labeling project(get SoA on ATIS data) with pytorch☆15Nov 21, 2018Updated 7 years ago
- A demonstration of how to train a custom tokenizer similar to TikToken.☆15Jan 6, 2025Updated last year
- 深度学习和NLP随笔☆27Jun 17, 2019Updated 6 years ago
- 以京东评论作为数据集,使用常见的机器学习算法如KNN、SVM、逻辑回归、贝叶斯、xgboost等等算法进行分类。使用深度学习中的CNN、RNN、CNN和RNN连接、Bi-GRU、bert模型进行分类。使用fastnlp的框架搭建文本分类。☆31Jul 2, 2020Updated 5 years ago
- 飞桨常规赛:中文新闻文本标题分类9月第1名方案,分数0.9+,基于PaddleNLP通过预训练模型的微调完成新闻14分类模型的训练与优化☆19Oct 15, 2021Updated 4 years ago
- Batch processor to enable large content be digested by Ollama, focused around book processing and translations by default, fully, configu…☆36Oct 27, 2025Updated 4 months ago
- A fully featured piano with multiplayer support☆14Jul 26, 2025Updated 7 months ago
- When hosted with heroku, it can be used to proxy ssl url to your fb application.☆13Jan 29, 2015Updated 11 years ago
- OKX API Interface Resender Server☆19Feb 28, 2024Updated 2 years ago
- Top-Down BTG-based Preordering☆16Jan 14, 2016Updated 10 years ago
- sailVina用于Linux的反向对接脚本☆10Feb 14, 2021Updated 5 years ago
- The source code of the paper 'Dynamic Knowledge Routing Network For Target-Guided Open-Domain Conversation'☆24Mar 24, 2023Updated 2 years ago
- Sparse Multilabel Categorical Crossentropy☆11Sep 10, 2023Updated 2 years ago
- Analyzing knowledge graph embedding methods, including TransE, DistMult, CP, SimplE, ComplEx, Quaternion☆28May 23, 2023Updated 2 years ago
- pytorch版基于gpt+nezha的中文多轮Cdial☆12Oct 22, 2022Updated 3 years ago
- Facial Landmark Detection using OpenCV and Mediapipe☆12Jul 4, 2022Updated 3 years ago
- Use LLM vision to scan receipts and sync to Feishu/Google Sheets☆18Dec 5, 2024Updated last year
- English-French MT dialogue dataset☆17Apr 29, 2022Updated 3 years ago