Law-AI / pretraining-bertLinks

This repository contains the codes for pre-training a BERT-base model on a large, un-annotated corpus of text using dynamic Masked Language Modeling (MLM) and dynamic Next Sentence Prediction (NSP).

☆13

Alternatives and similar repositories for pretraining-bert

Users that are interested in pretraining-bert are comparing it to the libraries listed below

Sorting:

Law-AI / LeSICiN
Dataset and codes for the paper "LeSICiN: A Heterogeneous Graph-based Approach for Automatic Legal Statute Identification from Indian Leg…
☆24Updated last year
Law-AI / codscomad2023tutorial
This repository contains links to different Law-AI resources such as datasets and tools.
☆18Updated 2 years ago
Law-AI / automatic-charge-identification
Identifying charges from the Indian Penal Code given the textual description of the charges and facts of a criminal case
☆23Updated 2 years ago
Legal-NLP-EkStep / legal_NER
OpenNyAI is a mission aimed at developing open source software and datasets to catalyze the creation of AI-powered solutions to improve a…
☆77Updated last year
NisaarAgharia / Indian-LawyerGPT
Fine-Tuning Falcon-7B, LLAMA 2 with QLoRA to create an advanced AI model with a profound understanding of the Indian legal context.
☆81Updated last year
Law-AI / semantic-segmentation
Semantic Segmentation of Legal texts that labels sentences with one of 7 rhetorical roles.
☆73Updated last year
Law-AI / summarization
Implementation of different summarization algorithms applied to legal case judgements.
☆205Updated 2 years ago
KalyanM45 / KalyanM45
An Enthusiastic undergraduate with a passion for Data Science and Machine learning. With over a year of hands-on experience in the field,…
☆12Updated 2 months ago
Exploration-Lab / CJPE
☆95Updated 4 months ago
Law-AI / DELSumm
Code for DELSumm, an unsupervised summarization algorithm for legal case judgements.
☆28Updated 2 years ago
ShubhamMandowara / Text_summarization
Text summation using python, deep learning, machine learning, transformer, huggingface, openai and langchain
☆13Updated 7 months ago
OpenNyAI / Opennyai
Opennyai : An efficient NLP Pipeline for Indian Legal documents
☆72Updated last year
shsarv / Deep-Learning-Projects
In this repository, I will keep my all Deep Learning project implementations.
☆11Updated 4 years ago
manyasrinivas2021 / ARTIFICIAL-INTELLIGENCE-HEALTHCARE-CHATBOT-SYSTEM-USING-PYTHON
Through chatbots one can communicate with text or voice interface and get reply through Artificial intelligence. Typically, a chat bot wi…
☆16Updated 4 years ago
Legal-NLP-EkStep / rhetorical-role-baseline
OpenNyAI is a mission aimed at developing open source software and datasets to catalyze the creation of AI-powered solutions to improve a…
☆40Updated last year
OssamaLouati / Legal-AI_Project
The Automated Legal Document Analysis Platform is a powerful web application that automates the laborious process of analyzing legal docu…
☆43Updated last year
KalyanM45 / AI-Project-Gallery
This Repository Contain All the Artificial Intelligence Projects such as Machine Learning, Deep Learning and Generative AI that I have do…
☆33Updated last year
deepanshu1995 / HateSpeech-Hindi-English-Code-Mixed-Social-Media-Text
☆12Updated 6 years ago
telekom / mltb2
Machine Learning Toolbox 2
☆12Updated this week
nachiketashunya / Amazon-ML-Challenge-2024
This repo is for Amazon ML Challenge 2024. The challenge was to develop a Machine Learning model to extract product details directly from…
☆65Updated 6 months ago
Ananyapam7 / AILA-Artificial-Intelligence-for-Legal-Assistance
Python implementations of the various methods used in FIRE 2019 conference.
☆60Updated 3 years ago
coastalcph / lex-glue
LexGLUE: A Benchmark Dataset for Legal Language Understanding in English
☆208Updated 2 years ago
ozdemiroorhan / NLP-SQUAD
NLP-CHATBOT
☆12Updated 2 years ago
harshitv804 / LawGPT
A RAG based Generative AI Attorney fed with Indian Penal Code data. Developed using Streamlit, LangChain and TogetherAI API.
☆49Updated last year
AI4Bharat / indicnlp_corpus
Description Describes the IndicNLP corpus and associated datasets
☆173Updated 2 years ago
Liquid-Legal-Institute / Legal-LLMs-GPTs
Large Language Models (LLMs) and Generative Pre-trained Transformers (GPTs) for Legal
☆90Updated 2 years ago
precog-iiith / LLMWorkshop
☆28Updated last year
SnehaSirnam / Diagnose-Cardiovascular-disease
Artificial Intelligence project where I developed an expert system to detect cardiovascular diseases and provide a recommended treatment …
☆20Updated 4 years ago
AI4Bharat / IndicBERT
Pretraining, fine-tuning and evaluation scripts for IndicBERT-v2 and IndicXTREME
☆97Updated 2 months ago
Exploration-Lab / HLDC
☆14Updated 4 months ago