A blueprint for creating Pretraining and Fine-Tuning datasets for Indic languages
☆401Oct 7, 2024Updated last year
Alternatives and similar repositories for IndicLLMSuite
Users that are interested in IndicLLMSuite are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Setu is a comprehensive pipeline designed to clean, filter, and deduplicate diverse data sources including Web, PDF, and Speech data. Bui…☆16May 17, 2024Updated last year
- A simple, consistent and extendable toolkit for IndicTrans2. (Pypi: https://pypi.org/project/indictranstoolkit)☆38Apr 30, 2026Updated last week
- Translation models for 22 scheduled languages of India☆427Oct 3, 2025Updated 7 months ago
- FBI: Finding Blindspots in LLM Evaluations with Interpretable Checklists☆31Aug 14, 2025Updated 8 months ago
- Repository for fine-tuning gemma models using unsloth for indic languages☆100Mar 18, 2024Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Code repository for "Introducing Airavata: Hindi Instruction-tuned LLM"☆64Oct 26, 2024Updated last year
- End-to-end Text-to-Speech with Generative Adversarial Networks☆20Feb 6, 2021Updated 5 years ago
- A collaborative catalog of NLP resources for Indic languages☆631Dec 14, 2024Updated last year
- ☆11Oct 9, 2023Updated 2 years ago
- Ongoing research training transformer language models at scale, including: BERT☆16Apr 25, 2019Updated 7 years ago
- My second web development project using Dash.☆10Jun 20, 2023Updated 2 years ago
- Shoonya - Platform to Annotate and label data at scale.☆67Oct 31, 2025Updated 6 months ago
- This repository contains the code for dataset curation and finetuning of instruct variant of the Bilingual OpenHathi model. The resultin…☆23Dec 23, 2023Updated 2 years ago
- A lightweight evaluation suite tailored specifically for assessing Indic LLMs across a diverse range of tasks☆39Jun 10, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆30Apr 20, 2024Updated 2 years ago
- Open source subtitling platform 💻 for transcribing and translating videos/audios in Indic languages.☆94Oct 3, 2025Updated 7 months ago
- A fast CPU-first video/audio transcriber for generating caption files with Whisper and CTranslate2, hosted on Hugging Face Spaces.☆11Apr 26, 2026Updated last week
- A JavaScript Input Method Engine inspired by ibus on GNU/Linux☆17May 13, 2023Updated 2 years ago
- Bangla PDF to text converter that works on Windows, macOS, and Linux without any extra downloads or configurations.☆21Oct 12, 2024Updated last year
- Source code and dataset for the paper 'Saamayik: A Benchmark and Dataset for English-Sanskrit Translation'☆15Oct 11, 2025Updated 6 months ago
- survery of small language models☆18Jul 23, 2024Updated last year
- generate video with voice narration from ppt/pdf Slides☆10Sep 4, 2023Updated 2 years ago
- Writing Blog Posts with Generative Feedback Loops!☆50Mar 19, 2024Updated 2 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- DL Backtrace is a new explainablity technique for deep learning models that works for any modality and model type.☆25Updated this week
- An LLM enabled XML generator for Indian laws in the LegalDocML and LegalRuleML formats☆20Sep 6, 2024Updated last year
- Dhruva is an open-source platform for serving language AI models at scale.☆22Aug 25, 2025Updated 8 months ago
- Prompts and evaluation data for LLMs on real world coding and writing tasks☆17Sep 13, 2025Updated 7 months ago
- Simple AI agents / assistants☆52Oct 8, 2024Updated last year
- ☆24Jan 28, 2024Updated 2 years ago
- This repository is describes the Indic NLP resources from L3Cube.☆23Jun 7, 2025Updated 10 months ago
- A Tutorial on RAG and Fine-Tuning LLMs☆14Nov 27, 2023Updated 2 years ago
- [WIP] AI Try-On plugin for Chrome☆28Mar 16, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆35Jun 15, 2023Updated 2 years ago
- Official implementation of the paper "SPEAKER VGG CCT: Cross-corpus Speech Emotion Recognition with Speaker Embedding and Vision Transfor…☆25Feb 17, 2023Updated 3 years ago
- Data and code for replicating WMT17 Multimodal Translation results☆16Oct 10, 2018Updated 7 years ago
- Apps that run on modal.com☆13Sep 14, 2025Updated 7 months ago
- Indic-Conformer models for ASR☆20Jul 19, 2024Updated last year
- IBM Quantum Challenge Fall 2023☆10May 23, 2023Updated 2 years ago
- An example of stateful AI agent powered by Letta and Gemini 3 pro.☆30Dec 6, 2025Updated 5 months ago