Law-AI / pretraining-bert
This repository contains the codes for pre-training a BERT-base model on a large, un-annotated corpus of text using dynamic Masked Language Modeling (MLM) and dynamic Next Sentence Prediction (NSP).
☆13Updated last year
Alternatives and similar repositories for pretraining-bert
Users that are interested in pretraining-bert are comparing it to the libraries listed below
Sorting:
- Data Mining project☆10Updated last year
- Dataset and codes for the paper "LeSICiN: A Heterogeneous Graph-based Approach for Automatic Legal Statute Identification from Indian Leg…☆23Updated last year
- A Systematic Investigation of Transferability and Robustness of Humor Detection Models☆16Updated 9 months ago
- Identifying charges from the Indian Penal Code given the textual description of the charges and facts of a criminal case☆23Updated 2 years ago
- This repository contains links to different Law-AI resources such as datasets and tools.☆17Updated 2 years ago
- Implementation of different summarization algorithms applied to legal case judgements.☆202Updated 2 years ago
- An Enthusiastic undergraduate with a passion for Data Science and Machine learning. With over a year of hands-on experience in the field,…☆12Updated last month
- ☆90Updated 3 months ago
- Code for DELSumm, an unsupervised summarization algorithm for legal case judgements.☆29Updated 2 years ago
- Semantic Segmentation of Legal texts that labels sentences with one of 7 rhetorical roles.☆72Updated 10 months ago
- ☆38Updated 2 years ago
- OpenNyAI is a mission aimed at developing open source software and datasets to catalyze the creation of AI-powered solutions to improve a…☆76Updated last year
- Fine-Tuning Falcon-7B, LLAMA 2 with QLoRA to create an advanced AI model with a profound understanding of the Indian legal context.☆79Updated last year
- In this repository, I will keep my all Deep Learning project implementations.☆11Updated 4 years ago
- Coursera Deep Learning Specialization☆14Updated 3 years ago
- OpenNyAI is a mission aimed at developing open source software and datasets to catalyze the creation of AI-powered solutions to improve a…☆40Updated last year
- Text summation using python, deep learning, machine learning, transformer, huggingface, openai and langchain☆13Updated 5 months ago
- This Repository Contain All the Artificial Intelligence Projects such as Machine Learning, Deep Learning and Generative AI that I have do…☆32Updated 11 months ago
- Learn NLP Tutorials with HuggingFace Transformers☆85Updated 9 months ago
- ☆13Updated 3 months ago
- Description Describes the IndicNLP corpus and associated datasets☆172Updated 2 years ago
- Abstractive and Extractive Text summarization using Transformers.☆83Updated last year
- This repo is about the classification of rhetorical roles in Legal Documents such as: Citation, Findings of Fact, Evidence, Legal Rule, R…☆14Updated 3 years ago
- LexGLUE: A Benchmark Dataset for Legal Language Understanding in English☆204Updated last year
- ☆27Updated last year
- indicTranslate v1 - Machine Translation for 11 Indic languages. For latest v2, check: https://github.com/AI4Bharat/IndicTrans2☆126Updated last year
- This repository contains the relevant materials for the tutorial "Legal IR and NLP: the History, Challenges, and State-of-the-Art", held …☆41Updated 2 years ago
- meta_llama_2finetuned_text_generation_summarization☆21Updated last year
- A pipeline for transliteration, spell correction, POS tagging and word sense disambiguation of Hinglish code mixed data to Hindi Devanaga…☆36Updated last year
- Implementation of various data science techniques and research papers☆25Updated 5 months ago