Exploration-Lab / HLDCLinks
☆14Updated 4 months ago
Alternatives and similar repositories for HLDC
Users that are interested in HLDC are comparing it to the libraries listed below
Sorting:
- ☆95Updated 4 months ago
- OpenNyAI is a mission aimed at developing open source software and datasets to catalyze the creation of AI-powered solutions to improve a…☆40Updated last year
- indicTranslate v1 - Machine Translation for 11 Indic languages. For latest v2, check: https://github.com/AI4Bharat/IndicTrans2☆127Updated last year
- Pretraining, fine-tuning and evaluation scripts for IndicBERT-v2 and IndicXTREME☆97Updated 2 months ago
- MAFAND-MT☆56Updated 11 months ago
- This repository contains the HiNER dataset released with our paper at LREC 2022☆15Updated 2 years ago
- LexGLUE: A Benchmark Dataset for Legal Language Understanding in English☆208Updated 2 years ago
- Description Describes the IndicNLP corpus and associated datasets☆173Updated 2 years ago
- Pre-trained, multilingual sequence-to-sequence models for Indian languages☆48Updated 2 years ago
- Code repository for "Introducing Airavata: Hindi Instruction-tuned LLM"☆59Updated 8 months ago
- Some notebooks for NLP☆204Updated last year
- Yet Another Neural Machine Translation Toolkit☆179Updated 3 months ago
- ☆18Updated 3 years ago
- A benchmark for code-switched NLP, ACL 2020☆75Updated last year
- Efficient Attention for Long Sequence Processing☆94Updated last year
- Fine-tuning Open-Source LLMs for Adaptive Machine Translation☆80Updated last month
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆93Updated 2 years ago
- Contains notebooks related to various transformers based models for different nlp based tasks☆41Updated 2 years ago
- This repository is dedicated to development of code-mixed language resources.☆26Updated last year
- Code Repository for the IndicXNLI paper.☆15Updated last year
- A pipeline for transliteration, spell correction, POS tagging and word sense disambiguation of Hinglish code mixed data to Hindi Devanaga…☆36Updated last year
- [EMNLP'23] Official Code for "FOCUS: Effective Embedding Initialization for Monolingual Specialization of Multilingual Models"☆32Updated 3 weeks ago
- ☆99Updated 6 months ago
- Code for Multilingual Eval of Generative AI paper published at EMNLP 2023☆69Updated last year
- Tools for evaluating the performance of MT metrics on data from recent WMT metrics shared tasks.☆109Updated 3 months ago
- Define Transformers, T5 model and RoBERTa Encoder decoder model for product names generation☆48Updated 3 years ago
- Marathi NLP - is a repository dedicated to development of tools and resources for Marathi language.☆138Updated 2 weeks ago
- 📖 A curated list of LegalNLP resources from all around the web.☆276Updated 2 years ago
- A blueprint for creating Pretraining and Fine-Tuning datasets for Indic languages☆107Updated 8 months ago
- This repository contains the code, data, and models of the paper titled "XL-Sum: Large-Scale Multilingual Abstractive Summarization for 4…☆271Updated last year