术语词典数据集/分词词典/专业词表语料库/词汇知识库/领域词表下载/主题词表/词库/自然语言处理/数据挖掘/深度学习
☆30Mar 4, 2025Updated last year
Alternatives and similar repositories for Word_list_dataset_terminology
Users that are interested in Word_list_dataset_terminology are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 百度百科学者词条、知网学者和中文论文元数据开源数据集☆19Jun 4, 2020Updated 5 years ago
- 将报表数据转换格式并入库时遇到许多重复性工作,于是用Python写了一些脚本进行自动化处理,并用PySide2做了GUI界面,做成了一个工具合集☆10Sep 29, 2021Updated 4 years ago
- 中国知网论文数据集,24000+篇论文信息。自然语言处理、信息管理、文本分类、文本摘要、关键词抽取、研究热点分析、数据挖掘、数据分析☆53Mar 4, 2025Updated last year
- 通过图数据库neo4j和ChatGPT的联动合作,实现将自然语言的医疗知识材料形成知识图谱☆10May 23, 2025Updated 11 months ago
- 英文文献的《中国图书馆分类法》自动标注小程序☆12Oct 29, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The official GitHub repository for AC-EVAL, an ancient Chinese evaluation suite for large language models (LLMs)☆16Nov 12, 2024Updated last year
- 人文历史知识图谱 三元组涵盖历史/文学/地理/军事/政治/艺术/科学技术史等学科领域 人物关系网络☆19Sep 4, 2025Updated 8 months ago
- 中文的 Bert+BiLSTM+CRF 命名实体识别任务☆16Jan 24, 2024Updated 2 years ago
- An evaluation bentchmark for classical Chinese☆19Dec 13, 2023Updated 2 years ago
- y-trainerY-Trainer 是一个LLM模型微调训练框架。 📊 核心优势: 📉 精准对抗过拟合: 专门优化,有效解决SFT中的过拟合难题。 🧩 突破遗忘瓶颈: 无需依赖通用语料,即可卓越地保留模型的泛化能力,守住核心竞争力的同时实现专项提升!🏆☆43Mar 3, 2026Updated 2 months ago
- A collection of datafiles created from the library of congress open data dump☆20May 19, 2017Updated 8 years ago
- Neural Topic Model via Optimal Transport, ICLR 2021☆16Mar 31, 2021Updated 5 years ago
- 中文恶意网页检测数据集与检测方法☆21Mar 4, 2025Updated last year
- 中文命名实体识别 | English NER☆19Jul 28, 2019Updated 6 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- NLP NER datasets video/music/book bio☆90Jan 3, 2021Updated 5 years ago
- The ACL RD-TEC 2.0: A corpus of annotated terms in context from domain of computational linguistics☆24Oct 7, 2016Updated 9 years ago
- ☆23Jun 2, 2019Updated 6 years ago
- ACTER is a manually annotated dataset for term extraction, covering 3 languages (English, French, and Dutch), and 4 domains (corruption, …☆24Apr 8, 2022Updated 4 years ago
- ☆13Jun 16, 2021Updated 4 years ago
- mSimCSE: Multilingual SimCSE☆33Nov 14, 2022Updated 3 years ago
- Pre-processing and training scripts for WMT 2017 ZH-EN translation task☆40Jun 7, 2020Updated 5 years ago
- DomainWordsDict, Chinese words dict that contains more than 68 domains, which can be used as text classification、knowledge enhance task。…☆761Aug 30, 2021Updated 4 years ago
- Terminology Dataset☆24Feb 27, 2020Updated 6 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- JurisLMs: Jurisprudential Language Models☆22Jul 1, 2023Updated 2 years ago
- Repository containing the group project Wind Power Forecasting for DTU's 02456 Deep Learning.☆13Apr 7, 2022Updated 4 years ago
- 基于BERT和指针网络构建实体抽取任务☆14Aug 2, 2020Updated 5 years ago
- Category Theory for Quantum Natural Language Processing☆11Feb 22, 2023Updated 3 years ago
- Benchmark dataset for the evaluation of scientific article representations on the task of citation recommendation across various scientif…☆12Oct 21, 2022Updated 3 years ago
- Scalable Quantum Neural Network builds and trains a large-scale QNN in a modular fashion. SQNN is evaluated with a binary classification …☆12Oct 4, 2023Updated 2 years ago
- 🐛 Web Apps for Boilerplate