pacman100 / openhathi_instructLinks
This repository contains the code for dataset curation and finetuning of instruct variant of the Bilingual OpenHathi model. The resulting model is meant to follow instructions and chat in Hindi and Hinglish.
☆23Updated last year
Alternatives and similar repositories for openhathi_instruct
Users that are interested in openhathi_instruct are comparing it to the libraries listed below
Sorting:
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines☆195Updated last year
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆231Updated last year
- A set of scripts and notebooks on LLM finetunning and dataset creation☆111Updated last year
- Fine-tune an LLM to perform batch inference and online serving.☆113Updated 5 months ago
- Repository for fine-tuning gemma models using unsloth for indic languages☆96Updated last year
- 🕹️ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.☆139Updated last year
- Code repository for "Introducing Airavata: Hindi Instruction-tuned LLM"☆61Updated last year
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Da☆117Updated 7 months ago
- A lightweight evaluation suite tailored specifically for assessing Indic LLMs across a diverse range of tasks☆38Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆50Updated last year
- Toolkit for attaching, training, saving and loading of new heads for transformer models☆290Updated 8 months ago
- Various installation guides for Large Language Models☆76Updated 6 months ago
- A blueprint for creating Pretraining and Fine-Tuning datasets for Indic languages☆236Updated last year
- Starter pack for NeurIPS LLM Efficiency Challenge 2023.☆126Updated 2 years ago
- A simple, consistent and extendable toolkit for IndicTrans2. (Pypi: https://pypi.org/project/indictranstoolkit)☆37Updated 3 months ago
- Domain Adapted Language Modeling Toolkit - E2E RAG☆330Updated last year
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆33Updated last month
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆82Updated 2 years ago
- Manage scalable open LLM inference endpoints in Slurm clusters☆276Updated last year
- Chunk your text using gpt4o-mini more accurately☆44Updated last year
- experiments with inference on llama☆103Updated last year
- ☆159Updated 11 months ago
- ☆216Updated last year
- ☆94Updated 2 years ago
- Complete implementation of Llama2 with/without KV cache & inference 🚀☆48Updated last year
- Notes from the Latent Space paper club. Follow along or start your own!☆241Updated last year
- List of resources, libraries and more for developers who would like to build with open-source machine learning off-the-shelf☆198Updated last year
- Fun project: LLM powered RAG Discord Bot that works seamlessly on CPU☆31Updated 2 years ago
- Let's build better datasets, together!☆264Updated 10 months ago
- ☆170Updated last year