Law-AI / pretraining-bertLinks
This repository contains the codes for pre-training a BERT-base model on a large, un-annotated corpus of text using dynamic Masked Language Modeling (MLM) and dynamic Next Sentence Prediction (NSP).
☆13Updated 2 years ago
Alternatives and similar repositories for pretraining-bert
Users that are interested in pretraining-bert are comparing it to the libraries listed below
Sorting:
- Data Mining project☆10Updated last year
- Dataset and codes for the paper "LeSICiN: A Heterogeneous Graph-based Approach for Automatic Legal Statute Identification from Indian Leg…☆24Updated last year
- Through chatbots one can communicate with text or voice interface and get reply through Artificial intelligence. Typically, a chat bot wi…☆16Updated 4 years ago
- An Enthusiastic undergraduate with a passion for Data Science and Machine learning. With over a year of hands-on experience in the field,…☆12Updated 2 weeks ago
- Identifying charges from the Indian Penal Code given the textual description of the charges and facts of a criminal case☆25Updated 2 years ago
- Coursera Deep Learning Specialization☆15Updated 4 years ago
- In this repository, I will keep my all Deep Learning project implementations.☆11Updated 4 years ago
- This repository contains links to different Law-AI resources such as datasets and tools.☆18Updated 2 years ago
- Artificial Intelligence project where I developed an expert system to detect cardiovascular diseases and provide a recommended treatment …☆21Updated 4 years ago
- Code for DELSumm, an unsupervised summarization algorithm for legal case judgements.☆29Updated 2 years ago
- Semantic Segmentation of Legal texts that labels sentences with one of 7 rhetorical roles.☆77Updated last year
- A category wise collection of 200+ LLM survey papers.☆176Updated 4 months ago
- A blueprint for creating Pretraining and Fine-Tuning datasets for Indic languages☆109Updated 10 months ago
- Chitralekha - A video transcreation platform for Indic languages, supporting transcription, translation and voice-over☆107Updated 8 months ago
- Master list of curated resources on NLP and LLMs☆139Updated last year
- Text summation using python, deep learning, machine learning, transformer, huggingface, openai and langchain☆13Updated 9 months ago
- This Repo consists the python note books of IITM - Mathematical Foundations for Generative AI Course,☆188Updated 3 weeks ago
- ☆97Updated 6 months ago
- This repo consists of prompting style of different widely used LLMs in the LLM space.☆36Updated last year
- This repo contains assignments and projects specifically to the various Gen AI courses that I am auditing.☆19Updated 10 months ago
- ☆29Updated last year
- A Hands on series on developing LLM applications☆65Updated 11 months ago
- Repository of implementations of classic and sota rl algorithms from scratch in PyTorch☆133Updated last week
- A collaborative catalog of NLP resources for Indic languages☆611Updated 8 months ago
- Machine Learning Toolbox 2☆13Updated last week
- Implementation of different summarization algorithms applied to legal case judgements.☆209Updated 2 years ago
- A New Tamil Large Language Model (LLM) Based on Llama 2☆310Updated last year
- Everything about LLMs in production.☆75Updated last year
- Toolkit for a learning health system☆20Updated last week
- Advanced Retrieval-Augmented Generation (RAG) through practical notebooks, using the power of the Langchain, OpenAI GPTs ,META LLAMA3 , A…☆79Updated last year