Builds wordpiece(subword) vocabulary compatible for Google Research's BERT
☆230Dec 4, 2020Updated 5 years ago
Alternatives and similar repositories for bert-vocab-builder
Users that are interested in bert-vocab-builder are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 1. Pretrain Albert on custom corpus 2. Finetune the pretrained Albert model on downstream task☆33Jun 4, 2020Updated 5 years ago
- TensorFlow code and pre-trained models for BERT☆117Mar 11, 2020Updated 6 years ago
- baikal.ai's pre-trained BERT models: descriptions and sample codes☆12Jun 24, 2021Updated 4 years ago
- Subword Language Model for Query Auto-Completion☆66Sep 5, 2019Updated 6 years ago
- A framework for Lexical Simplification.☆14Mar 27, 2018Updated 8 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Code for using and evaluating SpanBERT.☆907Jul 25, 2023Updated 2 years ago
- ☆15Jul 16, 2021Updated 4 years ago
- "다중 도메인 대화 상태 추적" Contest. Public LB 1등, Private LB 1등☆11Jun 26, 2021Updated 4 years ago
- 中文预训练XLNet模型: Pre-Trained Chinese XLNet_Large☆229Sep 13, 2019Updated 6 years ago
- Code and data for automatic paraphrase dataset augmentation.☆11Mar 8, 2021Updated 5 years ago
- ☆20Aug 21, 2020Updated 5 years ago
- Language Models as Few-Shot Learner for Task-Oriented Dialogue Systems☆22May 28, 2021Updated 4 years ago
- Multi-Task Deep Neural Networks for Natural Language Understanding☆2,256Mar 7, 2024Updated 2 years ago
- 📖The Big-&-Extending-Repository-of-Transformers: Pretrained PyTorch models for Google's BERT, OpenAI GPT & GPT-2, Google/CMU Transformer…☆12Apr 11, 2019Updated 7 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Word Piece Model python light version with functions tokenize/save/load☆64Oct 1, 2020Updated 5 years ago
- XLNet: Generalized Autoregressive Pretraining for Language Understanding☆6,175May 28, 2023Updated 2 years ago
- ☆21Nov 20, 2020Updated 5 years ago
- ALBERT: A Lite BERT for Self-supervised Learning of Language Representations☆3,283Apr 14, 2023Updated 3 years ago
- 한국어 문장 띄어쓰기(삭제/추가) 모델입니다. 데이터 준비 후 직접 학습이 가능하도록 작성하였습니다.☆56Jul 11, 2022Updated 3 years ago
- IR-BERT at TREC 2020: Leveraging BERT for Semantic Search in Background Linking☆14Feb 21, 2022Updated 4 years ago
- ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators☆2,369Mar 23, 2024Updated 2 years ago
- LM Pretraining with PyTorch/TPU☆137Oct 24, 2019Updated 6 years ago
- multi-gpu pre-training in one machine for BERT without horovod (Data Parallelism)☆171Dec 27, 2025Updated 4 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- a simple yet complete implementation of the popular BERT model☆128Mar 19, 2020Updated 6 years ago
- This repository contains the code for "BERTRAM: Improved Word Embeddings Have Big Impact on Contextualized Representations".☆64Aug 13, 2020Updated 5 years ago
- Source Code for paper "NERO: A Neural Rule Grounding Framework for Label-Efficient Relation Extraction", WWW 2020☆46May 6, 2020Updated 6 years ago
- Code for the ACL 2022 (Long paper): "New Intent Discovery with Pre-training and Contrastive Learning".☆14Jul 18, 2022Updated 3 years ago
- This repository contains various ways to calculate sentence vector similarity using NLP models☆198Apr 14, 2020Updated 6 years ago
- A Keras TensorFlow 2.0 implementation of BERT, ALBERT and adapter-BERT.☆807Jan 13, 2023Updated 3 years ago
- bert nlp papers, applications and github resources, including the newst xlnet , BERT、XLNet 相关论文和 github 项目☆1,844Mar 21, 2021Updated 5 years ago
- 🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP☆12,834Jan 23, 2024Updated 2 years ago
- ☆18Jun 28, 2017Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- TensorFlow code and pre-trained models for BERT☆40,014Jul 23, 2024Updated last year
- MIC-CIS entry in PharmaCoNER, Bacteria Biotope (BB 2029) & SeeDev 2019 Shared Tasks in EMNLP '19☆11Feb 22, 2020Updated 6 years ago
- BERT for Multitask Learning☆544Apr 12, 2023Updated 3 years ago
- XLNet for generating language.☆166Jan 30, 2021Updated 5 years ago
- ☆12Mar 20, 2020Updated 6 years ago
- Repository for the paper "Optimal Subarchitecture Extraction for BERT"☆470Jun 22, 2022Updated 3 years ago
- Semantic search using Transformers and others☆110Aug 27, 2020Updated 5 years ago