pacman100 / openhathi_instructLinks
This repository contains the code for dataset curation and finetuning of instruct variant of the Bilingual OpenHathi model. The resulting model is meant to follow instructions and chat in Hindi and Hinglish.
☆23Updated last year
Alternatives and similar repositories for openhathi_instruct
Users that are interested in openhathi_instruct are comparing it to the libraries listed below
Sorting:
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines☆197Updated last year
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆232Updated 10 months ago
- A set of scripts and notebooks on LLM finetunning and dataset creation☆110Updated 11 months ago
- Code repository for "Introducing Airavata: Hindi Instruction-tuned LLM"☆60Updated 10 months ago
- Repository for fine-tuning gemma models using unsloth for indic languages☆96Updated last year
- A blueprint for creating Pretraining and Fine-Tuning datasets for Indic languages☆113Updated 11 months ago
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Da☆117Updated 5 months ago
- A lightweight evaluation suite tailored specifically for assessing Indic LLMs across a diverse range of tasks☆38Updated last year
- 🕹️ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.☆137Updated last year
- Toolkit for attaching, training, saving and loading of new heads for transformer models☆286Updated 6 months ago
- Fine-tune an LLM to perform batch inference and online serving.☆112Updated 3 months ago
- A simple, consistent and extendable toolkit for IndicTrans2. (Pypi: https://pypi.org/project/indictranstoolkit)☆36Updated last month
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆147Updated 2 months ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆33Updated last week
- ☆210Updated 2 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated last year
- Starter pack for NeurIPS LLM Efficiency Challenge 2023.☆125Updated 2 years ago
- Let's build better datasets, together!☆263Updated 8 months ago
- experiments with inference on llama☆104Updated last year
- A comprehensive deep dive into the world of tokens☆226Updated last year
- Solving data for LLMs - Create quality synthetic datasets!☆150Updated 7 months ago
- Complete implementation of Llama2 with/without KV cache & inference 🚀☆48Updated last year
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆78Updated 10 months ago
- LLM_library is a comprehensive repository serves as a one-stop resource hands-on code, insightful summaries.☆69Updated last year
- Mistral + Haystack: build RAG pipelines that rock 🤘☆105Updated last year
- ☆155Updated 9 months ago
- Simple UI for debugging correlations of text embeddings☆291Updated 3 months ago
- Domain Adapted Language Modeling Toolkit - E2E RAG☆328Updated 10 months ago
- Various installation guides for Large Language Models☆74Updated 4 months ago
- Banishing LLM Hallucinations Requires Rethinking Generalization☆276Updated last year