A blueprint for creating Pretraining and Fine-Tuning datasets for Indic languages
☆399Oct 7, 2024Updated last year
Alternatives and similar repositories for IndicLLMSuite
Users that are interested in IndicLLMSuite are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Setu is a comprehensive pipeline designed to clean, filter, and deduplicate diverse data sources including Web, PDF, and Speech data. Bui…☆16May 17, 2024Updated last year
- A Continually LoRA PreTrained and FineTuned 7B Llama-2 Indic model for Malayalam Language.☆67Jul 16, 2024Updated last year
- A simple, consistent and extendable toolkit for IndicTrans2. (Pypi: https://pypi.org/project/indictranstoolkit)☆38Jul 24, 2025Updated 8 months ago
- Translation models for 22 scheduled languages of India☆421Oct 3, 2025Updated 6 months ago
- FBI: Finding Blindspots in LLM Evaluations with Interpretable Checklists☆31Aug 14, 2025Updated 8 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Language Identification for Indian languages☆33Dec 2, 2025Updated 4 months ago
- Repository for fine-tuning gemma models using unsloth for indic languages☆100Mar 18, 2024Updated 2 years ago
- Code repository for "Introducing Airavata: Hindi Instruction-tuned LLM"☆64Oct 26, 2024Updated last year
- Generate large textual corpora for almost any language by crawling the web☆13Feb 17, 2024Updated 2 years ago
- A Catalog lists instruction sets, models available for Indic language☆10Mar 14, 2024Updated 2 years ago
- Open Fiesta lets you chat with 100+ AI models like OpenAI, Gemini, Claude, Perplexity, Deepseek, and Grok in one place. Compare model res…☆25Apr 2, 2026Updated 2 weeks ago
- End-to-end Text-to-Speech with Generative Adversarial Networks☆20Feb 6, 2021Updated 5 years ago
- Ongoing research training transformer language models at scale, including: BERT☆16Apr 25, 2019Updated 6 years ago
- My second web development project using Dash.☆10Jun 20, 2023Updated 2 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Shoonya - Platform to Annotate and label data at scale.☆67Oct 31, 2025Updated 5 months ago
- This repository contains the code for dataset curation and finetuning of instruct variant of the Bilingual OpenHathi model. The resultin…☆23Dec 23, 2023Updated 2 years ago
- A lightweight evaluation suite tailored specifically for assessing Indic LLMs across a diverse range of tasks☆39Jun 10, 2024Updated last year
- ASCII <-> Unicode conversion library☆18Apr 1, 2024Updated 2 years ago
- ☆29Apr 20, 2024Updated last year
- A New Tamil Large Language Model (LLM) Based on Llama 2☆325Apr 5, 2024Updated 2 years ago
- A JavaScript Input Method Engine inspired by ibus on GNU/Linux☆17May 13, 2023Updated 2 years ago
- ☆24May 5, 2022Updated 3 years ago
- generate video with voice narration from ppt/pdf Slides☆10Sep 4, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 🎯 Speech Recognition Challenge by Speech Lab - IIT Madras☆10Nov 5, 2020Updated 5 years ago
- Writing Blog Posts with Generative Feedback Loops!☆50Mar 19, 2024Updated 2 years ago
- A library of translation-based text similarity measures☆25Dec 11, 2023Updated 2 years ago
- Dhruva is an open-source platform for serving language AI models at scale.☆21Aug 25, 2025Updated 7 months ago
- Includes additional materials for the following keras.io blog post.☆12Jun 23, 2021Updated 4 years ago
- Prompts and evaluation data for LLMs on real world coding and writing tasks☆17Sep 13, 2025Updated 7 months ago
- Simple AI agents / assistants☆51Oct 8, 2024Updated last year
- Exploration of Vector database Index for fast approximate nearest neighbour search.☆37Aug 4, 2024Updated last year
- This is the Placeholder for Llama. Starting with Llama 3☆11May 20, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆15Aug 15, 2023Updated 2 years ago
- Data and code for replicating WMT17 Multimodal Translation results☆16Oct 10, 2018Updated 7 years ago
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆88May 29, 2024Updated last year
- Apps that run on modal.com☆13Sep 14, 2025Updated 7 months ago
- Modified version of fairseq, including new implementations for criterions using reinforcement learning methods.☆11Aug 14, 2019Updated 6 years ago
- ML algorithms implementations that are good for learning the underlying principles☆28Dec 7, 2024Updated last year
- An example of stateful AI agent powered by Letta and Gemini 3 pro.☆30Dec 6, 2025Updated 4 months ago