This repository contains the codes for pre-training a BERT-base model on a large, un-annotated corpus of text using dynamic Masked Language Modeling (MLM) and dynamic Next Sentence Prediction (NSP).
☆15Jun 19, 2023Updated 2 years ago
Alternatives and similar repositories for pretraining-bert
Users that are interested in pretraining-bert are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Semantic Segmentation of Legal texts that labels sentences with one of 7 rhetorical roles.☆79Jun 19, 2024Updated last year
- Identifying charges from the Indian Penal Code given the textual description of the charges and facts of a criminal case☆27Feb 1, 2023Updated 3 years ago
- 🗺️ Tacking the user distance traveled and time taken using the google maps API☆12Aug 9, 2023Updated 2 years ago
- Dataset and codes for the paper "LeSICiN: A Heterogeneous Graph-based Approach for Automatic Legal Statute Identification from Indian Leg…☆25Apr 20, 2024Updated 2 years ago
- OpenNyAI is a mission aimed at developing open source software and datasets to catalyze the creation of AI-powered solutions to improve a…☆44Apr 17, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Abstractive text summarization done with the help of LSTMs using encoder-decoder model which was able to achieve accuracy of 77.27% on t…☆10Sep 22, 2020Updated 5 years ago
- Code for the paper Factorizing Content and Budget Decisions in Abstractive Summarization of Long Documents: https://arxiv.org/abs/2205.12…☆12Feb 10, 2024Updated 2 years ago
- Thesis template for IIT Hyderabad.☆14May 9, 2023Updated 3 years ago
- The first real-world FL benchmark for legal NLP☆13Nov 29, 2023Updated 2 years ago
- Data Mining project☆10Jan 10, 2024Updated 2 years ago
- ☆17Mar 2, 2024Updated 2 years ago
- ☆10May 20, 2020Updated 6 years ago
- ☆13Jan 30, 2025Updated last year
- ☆15Jan 29, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆19Feb 29, 2024Updated 2 years ago
- ☆13Jan 8, 2025Updated last year
- A text summarizer using Seq2Seq model☆14Sep 7, 2021Updated 4 years ago
- fast, lock-free, core-dumpable prints (meaning you can see not-yet-flushed prints in core dumps/live processes)☆27Oct 24, 2013Updated 12 years ago
- Programs executed as part of the Computer Networks Lab☆15Jan 10, 2024Updated 2 years ago
- Matrix digital rain in P5.JS & Canvas☆11Sep 22, 2019Updated 6 years ago
- Python client library for the Api2Pdf.com REST API - Convert HTML to PDF, URL to PDF, Office Docs to PDF, Merge PDFs, HTML to Image, URL …☆26Apr 7, 2026Updated last month
- An agent with human in the loop that can search the web for information while bypassing bot detection for private sites.☆34Apr 15, 2023Updated 3 years ago
- An SDK and Library that is used in several Deutsche Telekom mobile apps☆12Sep 23, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- DeepDip, a DRL Gym agent that plays no-press Diplomacy in BANDANA☆13Jul 22, 2019Updated 6 years ago
- ☆17Apr 2, 2025Updated last year
- ☆11Feb 25, 2020Updated 6 years ago
- Code for our PLOS ONE paper: "Predicting Human Decision Making in Psychological Tasks with Recurrent Neural Networks"☆13Jun 3, 2022Updated 3 years ago
- Behavior-driven tests for web applications. Use proven patterns for your test project. You can write the executable specifications in Cuc…☆16May 20, 2024Updated 2 years ago
- Talking Avatar: create video from plain text or audio file in minutes, support up to 100+ languages and 350+ voice models.☆13Nov 16, 2022Updated 3 years ago
- How to link the DFPlayer MP3 player to an Arduino☆10May 20, 2018Updated 8 years ago
- New Relic integration for Salesforce logs.☆12May 14, 2026Updated last week
- Analyse the self-attention patterns in BERT for humor classification and verify the linguistic theory of humor, use GPT-2 to create humor…☆11Apr 30, 2020Updated 6 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- An Enthusiastic undergraduate with a passion for Data Science and Machine learning. With over a year of hands-on experience in the field,…☆30Feb 24, 2026Updated 2 months ago
- A chill and fun blog about Rust stuff and the journey of building my company: Meilisearch☆13May 11, 2026Updated last week
- GOST's combined tools for urban analysis☆13Apr 27, 2026Updated 3 weeks ago
- custom kubernetes scheduler for placing pods based on location data☆11Apr 7, 2023Updated 3 years ago
- React Component for china-location☆14Dec 6, 2022Updated 3 years ago
- Data and additional information regarding the paper: Contract Discovery. Dataset and a Few-Shot Semantic Retrieval Challenge with Competi…☆32Nov 12, 2020Updated 5 years ago
- New York City street maps from the New York Public Library — grouped by decade☆17Jun 21, 2017Updated 8 years ago