A blueprint for creating Pretraining and Fine-Tuning datasets for Indic languages
☆401Oct 7, 2024Updated last year
Alternatives and similar repositories for IndicLLMSuite
Users that are interested in IndicLLMSuite are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Setu is a comprehensive pipeline designed to clean, filter, and deduplicate diverse data sources including Web, PDF, and Speech data. Bui…☆16May 17, 2024Updated 2 years ago
- Translation models for 22 scheduled languages of India☆430Oct 3, 2025Updated 7 months ago
- Language Identification for Indian languages☆36Dec 2, 2025Updated 5 months ago
- Code repository for "Introducing Airavata: Hindi Instruction-tuned LLM"☆64Oct 26, 2024Updated last year
- End-to-end Text-to-Speech with Generative Adversarial Networks☆20Feb 6, 2021Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A collaborative catalog of NLP resources for Indic languages☆632Dec 14, 2024Updated last year
- ☆11Oct 9, 2023Updated 2 years ago
- Ongoing research training transformer language models at scale, including: BERT☆16Apr 25, 2019Updated 7 years ago
- My second web development project using Dash.☆10Jun 20, 2023Updated 2 years ago
- Shoonya - Platform to Annotate and label data at scale.☆68Oct 31, 2025Updated 6 months ago
- This repository contains the code for dataset curation and finetuning of instruct variant of the Bilingual OpenHathi model. The resultin…☆23Dec 23, 2023Updated 2 years ago
- A lightweight evaluation suite tailored specifically for assessing Indic LLMs across a diverse range of tasks☆39Jun 10, 2024Updated last year
- ☆45Dec 15, 2022Updated 3 years ago
- ☆30Apr 20, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A New Tamil Large Language Model (LLM) Based on Llama 2☆327Apr 5, 2024Updated 2 years ago
- A fast CPU-first video/audio transcriber for generating caption files with Whisper and CTranslate2, hosted on Hugging Face Spaces.☆11Updated this week
- Text-to-Speech for languages of India☆367Nov 8, 2024Updated last year
- Bangla PDF to text converter that works on Windows, macOS, and Linux without any extra downloads or configurations.☆21Oct 12, 2024Updated last year
- Named Entity Recognition in PyTorch on CoNLL2003 dataset☆16Nov 30, 2021Updated 4 years ago
- generate video with voice narration from ppt/pdf Slides☆10Sep 4, 2023Updated 2 years ago
- 🎯 Speech Recognition Challenge by Speech Lab - IIT Madras☆10Nov 5, 2020Updated 5 years ago
- Build Agentic workflows with function calling using open LLMs☆28May 4, 2026Updated 3 weeks ago
- Writing Blog Posts with Generative Feedback Loops!☆50Mar 19, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A library of translation-based text similarity measures☆25Dec 11, 2023Updated 2 years ago
- An LLM enabled XML generator for Indian laws in the LegalDocML and LegalRuleML formats☆20Sep 6, 2024Updated last year
- Dhruva is an open-source platform for serving language AI models at scale.☆23Aug 25, 2025Updated 9 months ago
- Includes additional materials for the following keras.io blog post.☆12Jun 23, 2021Updated 4 years ago
- Prompts and evaluation data for LLMs on real world coding and writing tasks☆17Sep 13, 2025Updated 8 months ago
- ☆24Jan 28, 2024Updated 2 years ago
- A Tutorial on RAG and Fine-Tuning LLMs☆14Nov 27, 2023Updated 2 years ago
- Official implementation of the paper "SPEAKER VGG CCT: Cross-corpus Speech Emotion Recognition with Speaker Embedding and Vision Transfor…☆25Feb 17, 2023Updated 3 years ago
- Data and code for replicating WMT17 Multimodal Translation results☆16Oct 10, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆88May 29, 2024Updated last year
- A common protocol for AI agent tools☆10Oct 21, 2024Updated last year
- Apps that run on modal.com☆13Sep 14, 2025Updated 8 months ago
- Extract text from your DOCX documents.☆11Feb 10, 2024Updated 2 years ago
- ☆12Oct 24, 2017Updated 8 years ago
- Automate non-novel work☆11May 1, 2023Updated 3 years ago
- Indic-Conformer models for ASR☆19Jul 19, 2024Updated last year