Generate BERT vocabularies and pretraining examples from Wikipedias
☆17May 11, 2020Updated 5 years ago
Alternatives and similar repositories for wiki-bert-pipeline
Users that are interested in wiki-bert-pipeline are comparing it to the libraries listed below
Sorting:
- Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"☆28Oct 3, 2021Updated 4 years ago
- Code for 'Contrastive Multi-Document Question Generation'☆11Oct 16, 2022Updated 3 years ago
- This repository contains the sample code to benchmark popular time series forecast algorithms using Gluonts in AWS Sagemaker Notebook Ins…☆13Jul 26, 2021Updated 4 years ago
- Exploring implementing a simple tagger using neural network frameworks☆20Oct 24, 2022Updated 3 years ago
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…☆28Apr 17, 2024Updated last year
- As good as new. How to successfully recycle English GPT-2 to make models for other languages (ACL Findings 2021)☆48Aug 2, 2021Updated 4 years ago
- Fusion for TREC run files with popular fusion techniques☆21Aug 26, 2022Updated 3 years ago
- ☆24Oct 23, 2020Updated 5 years ago
- [EMNLP-Findings 2020] Adapting BERT for Word Sense Disambiguation with Gloss Selection Objective and Example Sentences☆64May 12, 2024Updated last year
- ☆65Apr 8, 2020Updated 5 years ago
- Hinglish Text Classification☆30Jun 12, 2023Updated 2 years ago
- ☆33Mar 1, 2023Updated 3 years ago
- mReasoner is a unified computational implementation of the model theory of thinking and reasoning☆13Aug 17, 2023Updated 2 years ago
- This is the Javascript Code, it helps you to find you visited your Facebook Profile.☆12Sep 13, 2018Updated 7 years ago
- This will help for users who want to integrate django and keycloak using django all-auth☆11Sep 5, 2022Updated 3 years ago
- Source codes of Neural Quality Estimation with Multiple Hypotheses for Grammatical Error Correction☆43Jul 2, 2021Updated 4 years ago
- BERT, RoBERTa fine-tuning over SQuAD Dataset using pytorch-lightning⚡️, 🤗-transformers & 🤗-nlp.☆36Jun 12, 2023Updated 2 years ago
- Containerfile for the Vanilla OS Desktop+Nvidia image.☆16Mar 1, 2026Updated last week
- A simple API that can generate various types of hexagon grids - returns GeoJSON data or load into PostGIS with performant JDBC.☆10Aug 2, 2025Updated 7 months ago
- ☆10Jul 6, 2023Updated 2 years ago
- A Library for Scaling Mixed-Integer Optimization-Based Machine Learning.☆12Jun 24, 2024Updated last year
- Fake NEWS detector using LIAR dataset.☆11Aug 19, 2019Updated 6 years ago
- Collection of iPython notebooks with some quick demos☆11May 25, 2017Updated 8 years ago
- Code that drives the public web-based tools for the Media Cloud Online News Archive and Directory.☆11Updated this week
- Use Rust in React Native through WebAssembly☆11Jan 7, 2023Updated 3 years ago
- Linear Relational Embeddings (LREs) and Linear Relational Concepts (LRCs) for LLMs in PyTorch☆10Aug 7, 2024Updated last year
- a new dataset for persian text emotion detection☆12Sep 19, 2022Updated 3 years ago
- Security research organization dedicated to finding low hanging, critical, vulnerabilities.☆15May 12, 2022Updated 3 years ago
- C4RepSet: Representative Subset from C4 data for Training Pre-trained LMs☆11Jan 13, 2023Updated 3 years ago
- ☆12Dec 8, 2022Updated 3 years ago
- A monolithic index that supports worst-case optimal joins (WCOJ) by providing all collation orders in a single redundancy eliminating dat…☆16Sep 18, 2025Updated 5 months ago
- Implementation of our paper "Injecting Knowledge Base Information into End-to-End Joint Entity and Relation Extraction and Coreference Re…☆10Jan 22, 2022Updated 4 years ago
- Simulated user for TREC 2016-2017 Dynamic Domain track☆10Dec 27, 2017Updated 8 years ago
- Ukrainian ELECTRA model☆12Mar 11, 2023Updated 2 years ago
- ☆11Feb 23, 2024Updated 2 years ago
- Evaluation of GPT-3 for clinical information extraction tasks.☆11Dec 13, 2022Updated 3 years ago
- Building applications with DeepSeek R1 model☆12Feb 15, 2025Updated last year
- ☆23Oct 2, 2025Updated 5 months ago
- EMNLP 2022: Analyzing and Evaluating Faithfulness in Dialogue Summarization☆13Mar 20, 2025Updated 11 months ago