adithya-s-k / indic_eval
A lightweight evaluation suite tailored specifically for assessing Indic LLMs across a diverse range of tasks
☆32Updated 9 months ago
Alternatives and similar repositories for indic_eval:
Users that are interested in indic_eval are comparing it to the libraries listed below
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 8 months ago
- Repository for fine-tuning gemma models using unsloth for indic languages☆85Updated 11 months ago
- This repository contains the code for dataset curation and finetuning of instruct variant of the Bilingual OpenHathi model. The resultin…☆23Updated last year
- A open-source framework designed to adapt pre-trained Language Models (LLMs), such as Llama, Mistral, and Mixtral, to a wide array of dom…☆13Updated 9 months ago
- Code repository for "Introducing Airavata: Hindi Instruction-tuned LLM"☆58Updated 4 months ago
- Solving data for LLMs - Create quality synthetic datasets!☆145Updated last month
- A simple, consistent and extendable toolkit for IndicTrans2☆23Updated last week
- A blueprint for creating Pretraining and Fine-Tuning datasets for Indic languages☆105Updated 5 months ago
- IndicGenBench is a high-quality, multilingual, multi-way parallel benchmark for evaluating Large Language Models (LLMs) on 4 user-facing …☆44Updated 6 months ago
- CompanionLLM - A framework to finetune LLMs to be your own sentient conversational companion☆40Updated last year
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆107Updated 2 weeks ago
- Supervised instruction finetuning for LLM with HF trainer and Deepspeed☆34Updated last year
- A tool that facilitates easy, efficient and high-quality fine-tuning of Cohere's models☆67Updated last month
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Da☆94Updated 2 months ago
- ☆18Updated 4 months ago
- Function Calling Benchmark & Testing☆83Updated 8 months ago
- ☆118Updated 4 months ago
- Setu is a comprehensive pipeline designed to clean, filter, and deduplicate diverse data sources including Web, PDF, and Speech data. Bui…☆14Updated 9 months ago
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆99Updated 11 months ago
- ☆87Updated last year
- Evaluating LLMs with CommonGen-Lite☆89Updated 11 months ago
- End-to-End LLM Guide☆103Updated 8 months ago
- Fun project: LLM powered RAG Discord Bot that works seamlessly on CPU☆32Updated last year
- Just a bunch of benchmark logs for different LLMs☆119Updated 7 months ago
- Testing speed and accuracy of RAG with, and without Cross Encoder Reranker.☆48Updated last year
- Python SDK for experimenting, testing, evaluating & monitoring LLM-powered applications - Parea AI (YC S23)☆75Updated last month
- ☆76Updated 9 months ago
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆47Updated 6 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆58Updated last year