This repository contains the codes for pre-training a BERT-base model on a large, un-annotated corpus of text using dynamic Masked Language Modeling (MLM) and dynamic Next Sentence Prediction (NSP).
☆13Jun 19, 2023Updated 2 years ago
Alternatives and similar repositories for pretraining-bert
Users that are interested in pretraining-bert are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Semantic Segmentation of Legal texts that labels sentences with one of 7 rhetorical roles.☆79Jun 19, 2024Updated last year
- Identifying charges from the Indian Penal Code given the textual description of the charges and facts of a criminal case☆27Feb 1, 2023Updated 3 years ago
- 🗺️ Tacking the user distance traveled and time taken using the google maps API☆12Aug 9, 2023Updated 2 years ago
- Dataset and codes for the paper "LeSICiN: A Heterogeneous Graph-based Approach for Automatic Legal Statute Identification from Indian Leg…☆25Apr 20, 2024Updated last year
- OpenNyAI is a mission aimed at developing open source software and datasets to catalyze the creation of AI-powered solutions to improve a…☆42Apr 17, 2024Updated last year
- Abstractive text summarization done with the help of LSTMs using encoder-decoder model which was able to achieve accuracy of 77.27% on t…☆10Sep 22, 2020Updated 5 years ago
- Code for the paper Factorizing Content and Budget Decisions in Abstractive Summarization of Long Documents: https://arxiv.org/abs/2205.12…☆12Feb 10, 2024Updated 2 years ago
- Thesis template for IIT Hyderabad.☆14May 9, 2023Updated 2 years ago
- The first real-world FL benchmark for legal NLP☆13Nov 29, 2023Updated 2 years ago
- Data Mining project☆10Jan 10, 2024Updated 2 years ago
- ☆13Mar 2, 2024Updated 2 years ago
- ☆10May 20, 2020Updated 5 years ago
- ☆13Jan 30, 2025Updated last year
- ☆15Jan 29, 2025Updated last year
- ☆17Feb 29, 2024Updated 2 years ago
- ☆13Jan 8, 2025Updated last year
- A text summarizer using Seq2Seq model☆14Sep 7, 2021Updated 4 years ago
- fast, lock-free, core-dumpable prints (meaning you can see not-yet-flushed prints in core dumps/live processes)☆27Oct 24, 2013Updated 12 years ago
- Programs executed as part of the Computer Networks Lab☆15Jan 10, 2024Updated 2 years ago
- Matrix digital rain in P5.JS & Canvas☆11Sep 22, 2019Updated 6 years ago
- Python client library for the Api2Pdf.com REST API - Convert HTML to PDF, URL to PDF, Office Docs to PDF, Merge PDFs, HTML to Image, URL …☆26Mar 8, 2026Updated 2 weeks ago
- An agent with human in the loop that can search the web for information while bypassing bot detection for private sites.☆34Apr 15, 2023Updated 2 years ago
- An SDK and Library that is used in several Deutsche Telekom mobile apps☆12Sep 23, 2024Updated last year
- DeepDip, a DRL Gym agent that plays no-press Diplomacy in BANDANA☆13Jul 22, 2019Updated 6 years ago
- ☆11Feb 25, 2020Updated 6 years ago
- Code for our PLOS ONE paper: "Predicting Human Decision Making in Psychological Tasks with Recurrent Neural Networks"☆13Jun 3, 2022Updated 3 years ago
- Behavior-driven tests for web applications. Use proven patterns for your test project. You can write the executable specifications in Cuc…☆16May 20, 2024Updated last year
- Talking Avatar: create video from plain text or audio file in minutes, support up to 100+ languages and 350+ voice models.☆11Nov 16, 2022Updated 3 years ago
- ☆17Apr 2, 2025Updated 11 months ago
- How to link the DFPlayer MP3 player to an Arduino☆10May 20, 2018Updated 7 years ago
- New Relic integration for Salesforce logs.☆12Feb 12, 2026Updated last month
- Analyse the self-attention patterns in BERT for humor classification and verify the linguistic theory of humor, use GPT-2 to create humor…☆11Apr 30, 2020Updated 5 years ago
- A chill and fun blog about Rust stuff and the journey of building my company: Meilisearch☆12Mar 16, 2026Updated last week
- An Enthusiastic undergraduate with a passion for Data Science and Machine learning. With over a year of hands-on experience in the field,…☆24Feb 24, 2026Updated 3 weeks ago
- An online adaptation of The Republic of Rome, a strategy board game☆11Mar 12, 2026Updated last week
- GOST's combined tools for urban analysis☆13Mar 16, 2026Updated last week
- custom kubernetes scheduler for placing pods based on location data☆11Apr 7, 2023Updated 2 years ago
- React Component for china-location☆14Dec 6, 2022Updated 3 years ago
- Data and additional information regarding the paper: Contract Discovery. Dataset and a Few-Shot Semantic Retrieval Challenge with Competi…☆32Nov 12, 2020Updated 5 years ago