文言文翻译、古文翻译 语料数据集
☆51Oct 14, 2020Updated 5 years ago
Alternatives and similar repositories for CCTC
Users that are interested in CCTC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 非常全的文言文(古文)-现代文平行语料☆1,424Apr 21, 2024Updated last year
- 古文现代文翻译平行语料库☆114Jan 12, 2022Updated 4 years ago
- 百度汉语字典爬虫,拼音数据,35万海量百度词典数据。☆28Sep 5, 2022Updated 3 years ago
- Code for NAACL 2022 main conference paper "Bi-SimCut: A Simple Strategy for Boosting Neural Machine Translation"☆12May 8, 2023Updated 2 years ago
- Dataset for TALLIP2019 paper "Ancient-Modern Chinese Translation with a New Large Training Dataset"☆25Jul 8, 2022Updated 3 years ago
- 大规模中文语料☆44Nov 5, 2019Updated 6 years ago
- A dataset used for NLP tasks.☆10Apr 17, 2021Updated 4 years ago
- ☆40Aug 21, 2021Updated 4 years ago
- 文言文命名实体识别,基于BILSTM+CRF完成文言文的命名实体实体,识别实体包括人物、地点、机构、时间等。☆10Jan 19, 2021Updated 5 years ago
- Dictionaries for StarCC, the next generation of Simplified-Traditional Chinese conversion framework☆12Jun 20, 2022Updated 3 years ago
- ☆20Apr 24, 2024Updated last year
- AnchiBERT: A Pre-Trained Model for Ancient Chinese Language Understanding and Generation(古文预训练模型)☆71Jul 16, 2021Updated 4 years ago
- SikuBERT:四库全书的预训练语言模型(四库BERT) Pre-training Model of Siku Quanshu☆154Jul 30, 2023Updated 2 years ago
- 中文原生工业测评基准☆15Mar 21, 2024Updated 2 years ago
- Code for ACL 2022 paper 'Towards Making the Most of Cross-Lingual Transfer for Zero-Shot Neural Machine Translation'☆12Jun 7, 2024Updated last year
- The augmented data of the paper "Parallel Data Augmentation for Formality Style Transfer" (ACL 2020).☆12May 14, 2020Updated 5 years ago
- Official Code and data for ACL 2024 finding, "An Empirical Study on Parameter-Efficient Fine-Tuning for MultiModal Large Language Models"☆25Nov 10, 2024Updated last year
- Ancient Chinese Corpus with Word Sense Annotation☆64May 29, 2024Updated last year
- GuwenBERT: 古文预训练语言模型(古文BERT) A Pre-trained Language Model for Classical Chinese (Literary Chinese)☆558Aug 31, 2021Updated 4 years ago
- Source code of our paper "Focus on the Target’s Vocabulary: Masked Label Smoothing for Machine Translation" @ACL-2022☆18May 19, 2022Updated 3 years ago
- Attention based dialog embedding for dialog breakdown detection (in DSTC6 task 3)☆13Feb 11, 2018Updated 8 years ago
- python使用SIFT算法对图像提取特征☆11Mar 29, 2017Updated 8 years ago
- 基于seq2seq的聊天系统,使用LSTM/GRU+注意力机制。使用框架pytorch。☆12Apr 9, 2019Updated 6 years ago
- This is the official repository for paper: cross-modal information flow in multimodal large language models☆42May 21, 2025Updated 10 months ago
- Repository for AAAI 2018 paper "Using Syntax for Referring Expression Recognition"☆13Oct 7, 2020Updated 5 years ago
- 🔥 专注于中文的「自 然语言处理框架」:中文分词;平衡类别;数据集划分...☆12Nov 14, 2020Updated 5 years ago
- ChineseDiachronicCorpus,中文历时语料库,横跨六十余年,包括腾讯历时新闻2000-2016,人民日报历时语料1946-2003,参考消息历时语料1957-2002。基于历时流通语料库,可用于历时语言变化计算、语言监测、社会文化变迁研究提供基础性的语料支…☆23Jan 10, 2021Updated 5 years ago
- A Large-Scale Dataset for Long Text and Multi-Table Summarization☆18Feb 21, 2024Updated 2 years ago
- Tally Marks OpenType-SVG Font☆16Dec 3, 2019Updated 6 years ago
- Implementation of "Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation"☆21Jul 31, 2023Updated 2 years ago
- YiZhao: A 2TB Open Financial Corpus. Data and tools for generating and inspecting YiZhao, a safe, high-quality, open-source bilingual fin…☆38Jul 11, 2025Updated 8 months ago
- Simple converter for dds<->tex files, works at least for all files in League of Legends currently☆33Dec 8, 2025Updated 3 months ago
- Code for "The Expressive Power of Low-Rank Adaptation".☆20Apr 19, 2024Updated last year
- ☆35May 31, 2019Updated 6 years ago
- A frozen version of angr for the SAILR paper☆16Sep 4, 2024Updated last year
- TAT-LLM: A Specialized Language Model for Discrete Reasoning over Tabular and Textual Data☆28Feb 28, 2024Updated 2 years ago
- NExT-GPT: Any-to-Any Multimodal Large Language Model☆20Nov 3, 2024Updated last year
- 中华古诗文数据库和API。包含10000首古文(诗、词、歌、赋以及其它形式的文言文),近4000名作者,10000名句☆524Aug 15, 2024Updated last year
- ☆19Jul 21, 2025Updated 8 months ago