microsoft unilm-v1 compatible with huggingface transformers
☆21Apr 5, 2023Updated 3 years ago
Alternatives and similar repositories for transformers-unilm
Users that are interested in transformers-unilm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 利用BERT预训练模型进行文本生成,可用于对话、摘要、问题生成等任务。 目前支持策略,词表的插入和删除、自定义Character Embedding、随机词替换等☆10Jun 1, 2022Updated 4 years ago
- 本仓库是基于bert4keras实现的古文-现代文翻译模型。具体使用了基于掩码自注意力机制的UNILM(Li al., 2019)预训练模型作为翻译系统的backbone。我们首先使用了普通的中文(现代文)BERT、Roberta权重作为UNILM的初始权重以训练UNILM…☆53May 3, 2022Updated 4 years ago
- FL-Tuning☆12Jul 11, 2022Updated 3 years ago
- 2019 CAIL 法研杯机器阅读理解挑战赛 第8名 解决方案☆16Jul 13, 2019Updated 6 years ago
- Dataset for Conversation Semantic Role Labeling☆11Aug 26, 2021Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆14Jan 10, 2024Updated 2 years ago
- 使用UniLM实现中文文本摘要☆43Mar 25, 2020Updated 6 years ago
- ☆13Oct 26, 2020Updated 5 years ago
- 中文版unilm预训练模型☆82Feb 22, 2021Updated 5 years ago
- 用python比较两个字符串差异,高亮差异部分☆27Jul 20, 2020Updated 5 years ago
- 斗破苍穹小说的新词发现☆13May 12, 2022Updated 4 years ago
- An old (incomplete) implementation of "Training Vision Transformers with Only 2040 Images" by YH Cao et al., 2022.☆19Feb 10, 2022Updated 4 years ago
- Chinese version of Dr chen's PhD thesis. 这里是对陈丹琦的博士毕业论文的中文翻译版本。https://chendq-thesis-zh.readthedocs.io/en/latest/☆35Aug 11, 2019Updated 6 years ago
- MT/IE: Cross-lingual Open Information Extraction with Neural Sequence-to-Sequence Models☆23Jul 15, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- segmentation of cloud Images☆20Jul 26, 2019Updated 6 years ago
- No Parameter Left Behind: How Distillation and Model Size Affect Zero-Shot Retrieval☆29Sep 26, 2022Updated 3 years ago
- 分别使用TextRank、BiLSTM和UniLM实现中文文章标题自动生成☆28Sep 22, 2020Updated 5 years ago
- Code for paper "ProgGen: Generating Named Entity Recognition Datasets Step-by-step with Self-Reflexive Large Language Models"☆17Mar 29, 2024Updated 2 years ago
- A Bert2Bert model which able to generate headlines!☆12Nov 16, 2020Updated 5 years ago
- The implementation of RAGSynth: Synthetic Data for Robust and Faithful RAG Component Optimization☆21May 26, 2025Updated last year
- [ACL 2026] R-Search: Empowering LLM Reasoning with Search via Multi-Reward Reinforcement Learning☆34Jan 4, 2026Updated 5 months ago
- blockchain news crawler 金融新闻爬虫+自然语言处理分析☆14Mar 5, 2019Updated 7 years ago
- 文言文信息抽取(实体识别+关系抽取)☆10Feb 24, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- 本项目使用云问科技训练的中文版UniLM模型对微博数据集进行自动标题生成。☆39Apr 4, 2024Updated 2 years ago
- Individual Coefficient Approximation for Risk Estimation (ICARE) model☆18Sep 9, 2023Updated 2 years ago
- ☆11Nov 17, 2024Updated last year
- Hanja Understanding Evaluation Dataset☆15May 2, 2022Updated 4 years ago
- The repository provides code for the paper RECE: Reduced Cross-Entropy Loss for Large-Catalogue Sequential Recommenders, CIKM'24☆11Oct 21, 2024Updated last year
- Analyze the content of one or more XML files. NormaTEI is designed mainly for two uses: control of encoding uniformity and encoding analy…☆18Nov 11, 2024Updated last year
- A Keras implementation of a deep learning network to simultaneously perform Word Segmentation and Part-of-Speech (POS) Tagging introduced…☆11Apr 16, 2022Updated 4 years ago
- Explicit high order interaction models implemented in Keras, including: DCN, xDeepFM, AutoInt etc.☆12Mar 25, 2023Updated 3 years ago
- official code for paper "MMA Regularization: Decorrelating Weights of Neural Networks by Maximizing the Minimal Angles"☆13Oct 20, 2020Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- The official implantation of SGPT (CVPR2024)☆18Jul 15, 2024Updated last year
- 古诗词分词,词向量分析,输出到excel,云图☆10Jul 6, 2022Updated 3 years ago
- Code for paper: Optimizing Length Compression in Large Reasoning Models☆28Oct 20, 2025Updated 7 months ago
- Code and Data for Paper "Controlling Styles in Neural Machine Translation with Activation Prompt" (ACL 2023 Findings)☆16Dec 20, 2022Updated 3 years ago
- Improving Pseudo Labels with Global-Local Denoising Framework for Cross-lingual Named Entity Recognition (IJCAI 2024)☆11Aug 18, 2024Updated last year
- 法研杯CAIL2019阅读理解赛题参赛模型☆43Aug 25, 2019Updated 6 years ago
- The dataset was used as a support to attack obfuscations. It is the largest obfuscated dataset, with realistic and various binaries and f…☆20Jan 15, 2026Updated 5 months ago