pacman100 / openhathi_instructLinks
This repository contains the code for dataset curation and finetuning of instruct variant of the Bilingual OpenHathi model. The resulting model is meant to follow instructions and chat in Hindi and Hinglish.
☆23Updated last year
Alternatives and similar repositories for openhathi_instruct
Users that are interested in openhathi_instruct are comparing it to the libraries listed below
Sorting:
- A blueprint for creating Pretraining and Fine-Tuning datasets for Indic languages☆107Updated 10 months ago
- Code repository for "Introducing Airavata: Hindi Instruction-tuned LLM"☆61Updated 10 months ago
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Da☆113Updated 4 months ago
- Repository for fine-tuning gemma models using unsloth for indic languages☆95Updated last year
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines☆197Updated last year
- A set of scripts and notebooks on LLM finetunning and dataset creation☆110Updated 10 months ago
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆232Updated 9 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated last year
- A lightweight evaluation suite tailored specifically for assessing Indic LLMs across a diverse range of tasks☆37Updated last year
- Starter pack for NeurIPS LLM Efficiency Challenge 2023.☆125Updated last year
- A simple, consistent and extendable toolkit for IndicTrans2. (Pypi: https://pypi.org/project/indictranstoolkit)☆35Updated last month
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆78Updated 10 months ago
- Fine-tune an LLM to perform batch inference and online serving.☆112Updated 2 months ago
- Let's build better datasets, together!☆261Updated 8 months ago
- Toolkit for attaching, training, saving and loading of new heads for transformer models☆285Updated 5 months ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆33Updated 3 months ago
- ☆155Updated 8 months ago
- Complete implementation of Llama2 with/without KV cache & inference 🚀☆48Updated last year
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆143Updated last month
- List of resources, libraries and more for developers who would like to build with open-source machine learning off-the-shelf☆200Updated last year
- Manage scalable open LLM inference endpoints in Slurm clusters☆270Updated last year
- ☆78Updated last year
- Domain Adapted Language Modeling Toolkit - E2E RAG☆327Updated 9 months ago
- ☆207Updated last year
- [ACL'25] Official Code for LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to Small-Scale Local LLMs☆314Updated last month
- 🕹️ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.☆139Updated last year
- A comprehensive guide to LLM evaluation methods designed to assist in identifying the most suitable evaluation techniques for various use…☆136Updated this week
- Efficient vector database for hundred millions of embeddings.☆207Updated last year
- ☆23Updated 2 years ago
- Just a bunch of benchmark logs for different LLMs☆120Updated last year