This repository contains the codes for pre-training a BERT-base model on a large, un-annotated corpus of text using dynamic Masked Language Modeling (MLM) and dynamic Next Sentence Prediction (NSP).
☆13Jun 19, 2023Updated 2 years ago
Alternatives and similar repositories for pretraining-bert
Users that are interested in pretraining-bert are comparing it to the libraries listed below
Sorting:
- Data Mining project☆10Jan 10, 2024Updated 2 years ago
- Semantic Segmentation of Legal texts that labels sentences with one of 7 rhetorical roles.☆78Jun 19, 2024Updated last year
- Dataset and codes for the paper "LeSICiN: A Heterogeneous Graph-based Approach for Automatic Legal Statute Identification from Indian Leg…☆25Apr 20, 2024Updated last year
- Identifying charges from the Indian Penal Code given the textual description of the charges and facts of a criminal case☆27Feb 1, 2023Updated 3 years ago
- ☆13Mar 2, 2024Updated 2 years ago
- An Enthusiastic undergraduate with a passion for Data Science and Machine learning. With over a year of hands-on experience in the field,…☆23Updated this week
- An SDK and Library that is used in several Deutsche Telekom mobile apps☆12Sep 23, 2024Updated last year
- New Relic integration for Salesforce logs.☆12Feb 12, 2026Updated 2 weeks ago
- An online adaptation of The Republic of Rome, a strategy board game☆11Feb 10, 2026Updated 2 weeks ago
- Web app for simulating the collective movement of fish schools☆11May 20, 2022Updated 3 years ago
- Matrix digital rain in P5.JS & Canvas☆11Sep 22, 2019Updated 6 years ago
- ☆11Feb 25, 2020Updated 6 years ago
- OpenNyAI is a mission aimed at developing open source software and datasets to catalyze the creation of AI-powered solutions to improve a…☆42Apr 17, 2024Updated last year
- Code for our PLOS ONE paper: "Predicting Human Decision Making in Psychological Tasks with Recurrent Neural Networks"☆13Jun 3, 2022Updated 3 years ago
- Behavior-driven tests for web applications. Use proven patterns for your test project. You can write the executable specifications in Cuc…☆16May 20, 2024Updated last year
- ☆11Apr 22, 2024Updated last year
- Abstractive text summarization done with the help of LSTMs using encoder-decoder model which was able to achieve accuracy of 77.27% on t…☆10Sep 22, 2020Updated 5 years ago
- ☆10May 20, 2020Updated 5 years ago
- ☆13Jan 30, 2025Updated last year
- In this repository, I will keep my all Deep Learning project implementations.☆11Nov 22, 2020Updated 5 years ago
- Analyse the self-attention patterns in BERT for humor classification and verify the linguistic theory of humor, use GPT-2 to create humor…☆11Apr 30, 2020Updated 5 years ago
- How to link the DFPlayer MP3 player to an Arduino☆10May 20, 2018Updated 7 years ago
- Talking Avatar: create video from plain text or audio file in minutes, support up to 100+ languages and 350+ voice models.☆11Nov 16, 2022Updated 3 years ago
- ☆17Apr 2, 2025Updated 11 months ago
- fast, lock-free, core-dumpable prints (meaning you can see not-yet-flushed prints in core dumps/live processes)☆27Oct 24, 2013Updated 12 years ago
- This is the repository for my 20 credit project during my final year at cardiff university. The project involves researching the best res…☆12Jun 15, 2018Updated 7 years ago
- Pipeline for building Machine Learning Classifiers for the diagnosis of EHR text-data. We used this pipeline for our study, published her…☆12Jul 6, 2023Updated 2 years ago
- Volca SaaS Boilerplate☆21Sep 3, 2024Updated last year
- POSIX: A Prompt Sensitivity Index for Language Models☆13Nov 13, 2024Updated last year
- Notes for learning and applying R to questions about crime and the justice system☆21Feb 11, 2026Updated 2 weeks ago
- custom kubernetes scheduler for placing pods based on location data☆11Apr 7, 2023Updated 2 years ago
- Code for the paper Factorizing Content and Budget Decisions in Abstractive Summarization of Long Documents: https://arxiv.org/abs/2205.12…☆12Feb 10, 2024Updated 2 years ago
- Thesis template for IIT Hyderabad.☆14May 9, 2023Updated 2 years ago
- AI@School - Wissen zu künstlicher Intelligenz spielerisch in die Schulen bringen☆15Jun 4, 2025Updated 8 months ago
- 🗺️ Tacking the user distance traveled and time taken using the google maps API☆12Aug 9, 2023Updated 2 years ago
- Network Management Protocols for Mountebank☆13Oct 13, 2025Updated 4 months ago
- The first real-world FL benchmark for legal NLP☆13Nov 29, 2023Updated 2 years ago
- Machine Learning Toolbox 2☆13Nov 22, 2025Updated 3 months ago
- Implementation of Fast Reactive Control for Illumination Through Rain and Snow (de Charette et al., 2012)☆12Oct 29, 2024Updated last year