Pretraining, fine-tuning and evaluation scripts for IndicBERT-v2 and IndicXTREME
☆113Apr 6, 2025Updated last year
Alternatives and similar repositories for IndicBERT
Users that are interested in IndicBERT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository contains the HiNER dataset released with our paper at LREC 2022☆16Jun 6, 2023Updated 2 years ago
- A collaborative catalog of NLP resources for Indic languages☆631Dec 14, 2024Updated last year
- Translation models for 22 scheduled languages of India☆427Oct 3, 2025Updated 7 months ago
- Pre-trained, multilingual sequence-to-sequence models for Indian languages☆51Jul 20, 2022Updated 3 years ago
- Pretraining, fine-tuning and evaluation scripts for Indic-Wav2Vec2☆111Aug 28, 2025Updated 8 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- We want to build open-source solutions and standards for using AI to solve mental health challenges. The goal is to apply DPI knowledge a…☆27Jun 13, 2025Updated 10 months ago
- ☆10Dec 5, 2017Updated 8 years ago
- 🗺️ OpenStreetMap Countries GeoJSON — updated daily!☆18Aug 17, 2025Updated 8 months ago
- We created the Telugu dataset to address the challenge of building Automatic Speech Recognition (ASR) systems for Indian languages, consi…☆21Feb 8, 2024Updated 2 years ago
- Desktop Sanskrit-English Dictionary☆12Jul 7, 2020Updated 5 years ago
- A rule-based lemmatizer for Bengali / Bangla based written in Python. Under active development.☆25Dec 28, 2019Updated 6 years ago
- Code for extracting parallel corpora from pmindia☆17Jan 28, 2020Updated 6 years ago
- Code repository for "Introducing Airavata: Hindi Instruction-tuned LLM"☆64Oct 26, 2024Updated last year
- IndicGenBench is a high-quality, multilingual, multi-way parallel benchmark for evaluating Large Language Models (LLMs) on 4 user-facing …☆58Sep 1, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆25Nov 1, 2024Updated last year
- Description Describes the IndicNLP corpus and associated datasets☆203Apr 16, 2023Updated 3 years ago
- ☆18May 28, 2024Updated last year
- Marathi NLP - is a repository dedicated to development of tools and resources for Marathi language.☆158Sep 14, 2025Updated 7 months ago
- The Dakshina dataset is a collection of text in both Latin and native scripts for 12 South Asian languages. For each language, the datase…☆208May 27, 2020Updated 5 years ago
- Yet Another Neural Machine Translation Toolkit☆179Mar 7, 2025Updated last year
- Code Repository for the IndicXNLI paper.☆15Jul 8, 2023Updated 2 years ago
- ☆18Feb 10, 2025Updated last year
- a flight controller for a ppm receiver and mpu6050 stabilizer☆11Jun 9, 2020Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A hypernym discovery system which learns to predict is-a relationships between words using projection learning☆36Jun 24, 2020Updated 5 years ago
- Operating Systems code for problems mentioned in Galvin - for pure academic purposes only☆12May 26, 2022Updated 3 years ago
- Curated implementation notebooks and scripts of deep learning based natural language processing tasks and challenges in TensorFlow.☆11Apr 24, 2020Updated 6 years ago
- Tools for calculating psycholinguistically-relevant metrics of language statistics using transformer language models☆12Nov 11, 2022Updated 3 years ago
- Code for the paper Factorizing Content and Budget Decisions in Abstractive Summarization of Long Documents: https://arxiv.org/abs/2205.12…☆12Feb 10, 2024Updated 2 years ago
- Framework for working with brat-annotated .ann files☆10Mar 16, 2026Updated last month
- Thesis template for IIT Hyderabad.☆14May 9, 2023Updated 3 years ago
- Basic Vacuum Cleaner World Problem in Artificial Intelligence that describes how any action is perfomed after sensing the environment.☆11Feb 11, 2019Updated 7 years ago
- The first real-world FL benchmark for legal NLP☆13Nov 29, 2023Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆13Dec 15, 2022Updated 3 years ago
- State of the Art Language models and Classifier for Tamil language (spoken in India, and few other South Asian countries)☆52Aug 7, 2020Updated 5 years ago
- A Kaldi recipe for training automatic speech recognition systems on the Torgo corpus of dysarthric speech☆17Sep 22, 2023Updated 2 years ago
- This is the home of the source code for Motion Vector Extrapolation (MOVEX).☆13Jan 3, 2022Updated 4 years ago
- Repository contains Python code for image pre-processing and captioning with Deep learning model☆15Dec 8, 2020Updated 5 years ago
- Repository of the ICNLSP 2024 paper "Efficient Few-shot Learning for Multi-label Classification of Scientific Documents with Many Classes…☆17Jan 7, 2025Updated last year
- ☆45Feb 11, 2026Updated 2 months ago