adithya-s-k / indic_evalLinks
A lightweight evaluation suite tailored specifically for assessing Indic LLMs across a diverse range of tasks
☆35Updated 11 months ago
Alternatives and similar repositories for indic_eval
Users that are interested in indic_eval are comparing it to the libraries listed below
Sorting:
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 10 months ago
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Da☆105Updated 2 months ago
- Just a bunch of benchmark logs for different LLMs☆119Updated 10 months ago
- Solving data for LLMs - Create quality synthetic datasets!☆148Updated 4 months ago
- This repository contains the code for dataset curation and finetuning of instruct variant of the Bilingual OpenHathi model. The resultin…☆23Updated last year
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆64Updated 7 months ago
- ☆19Updated 7 months ago
- KMD is a collection of conversational exchanges between patients and doctors on various medical topics. It aims to capture the intricaci…☆24Updated last year
- Repository for fine-tuning gemma models using unsloth for indic languages☆92Updated last year
- Fun project: LLM powered RAG Discord Bot that works seamlessly on CPU☆31Updated last year
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆99Updated last year
- ☆43Updated 3 months ago
- Training an LLM to use a calculator with multi-turn reinforcement learning, achieving a **62% absolute increase in evaluation accuracy**.☆38Updated 3 weeks ago
- A blueprint for creating Pretraining and Fine-Tuning datasets for Indic languages☆106Updated 7 months ago
- IndicGenBench is a high-quality, multilingual, multi-way parallel benchmark for evaluating Large Language Models (LLMs) on 4 user-facing …☆50Updated 9 months ago
- Sphynx Hallucination Induction☆54Updated 4 months ago
- Fine-tune an LLM to perform batch inference and online serving.☆111Updated 3 weeks ago
- A open-source framework designed to adapt pre-trained Language Models (LLMs), such as Llama, Mistral, and Mixtral, to a wide array of dom…☆13Updated last year
- So, I trained a Llama a 130M architecture I coded from ground up to build a small instruct model from scratch. Trained on FineWeb dataset…☆14Updated 2 months ago
- ☆23Updated last year
- ☆48Updated last year
- ☆57Updated last week
- Set of scripts to finetune LLMs☆37Updated last year
- ☆19Updated 9 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆53Updated 4 months ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆34Updated 3 weeks ago
- Cerule - A Tiny Mighty Vision Model☆66Updated 8 months ago
- ☆19Updated last year
- 🦾💻🌐 distributed training & serverless inference at scale on RunPod☆17Updated last year
- Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created by…☆31Updated 9 months ago