Exploration-Lab / HLDC
☆13Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for HLDC
- ☆84Updated last year
- OpenNyAI is a mission aimed at developing open source software and datasets to catalyze the creation of AI-powered solutions to improve a…☆35Updated 7 months ago
- Resources for cultural NLP research☆67Updated this week
- A Python package to compute HONEST, a score to measure hurtful sentence completions in language models. Published at NAACL 2021.☆20Updated last year
- This repository is dedicated to development of code-mixed language resources.☆24Updated last year
- A reading list of up-to-date papers on NLP for Social Good.☆286Updated last year
- Description Describes the IndicNLP corpus and associated datasets☆157Updated last year
- This repository contains the HiNER dataset released with our paper at LREC 2022☆15Updated last year
- SemEval 2024 Task 1 : Textual Semantic Relatedness☆23Updated 5 months ago
- Yet Another Neural Machine Translation Toolkit☆174Updated 4 months ago
- indicTranslate v1 - Machine Translation for 11 Indic languages. For latest v2, check: https://github.com/AI4Bharat/IndicTrans2☆120Updated 10 months ago
- Code for Multilingual Eval of Generative AI paper published at EMNLP 2023☆66Updated 8 months ago
- Code Repository for the IndicXNLI paper.☆14Updated last year
- Efficient Attention for Long Sequence Processing☆89Updated 11 months ago
- Dataset and codes for the paper "LeSICiN: A Heterogeneous Graph-based Approach for Automatic Legal Statute Identification from Indian Leg…☆19Updated 7 months ago
- ☆16Updated 9 months ago
- Course for Interpreting ML Models☆52Updated last year
- An assignment for CMU CS11-711 Advanced NLP, building NLP systems from scratch☆170Updated last year
- Some notebooks for NLP☆188Updated last year
- LexGLUE: A Benchmark Dataset for Legal Language Understanding in English☆187Updated last year
- Pretraining, fine-tuning and evaluation scripts for IndicBERT-v2 and IndicXTREME☆76Updated 2 months ago
- Collection of NLP model explanations and accompanying analysis tools☆145Updated last year
- A curated list of awesome datasets with human label variation (un-aggregated labels) in Natural Language Processing and Computer Vision, …☆76Updated 7 months ago
- ☆42Updated 2 years ago
- Code repository for "Introducing Airavata: Hindi Instruction-tuned LLM"☆54Updated 3 weeks ago
- ☆65Updated last year
- Dataset from the paper "Mintaka: A Complex, Natural, and Multilingual Dataset for End-to-End Question Answering" (COLING 2022)☆104Updated 2 years ago
- Google's BigBird (Jax/Flax & PyTorch) @ 🤗Transformers☆47Updated last year
- SeeGULL is a broad-coverage stereotype dataset in English containing stereotypes about identity groups spanning 178 countries across 8 di…☆33Updated last year
- Define Transformers, T5 model and RoBERTa Encoder decoder model for product names generation☆48Updated 3 years ago