pacman100 / openhathi_instruct
This repository contains the code for dataset curation and finetuning of instruct variant of the Bilingual OpenHathi model. The resulting model is meant to follow instructions and chat in Hindi and Hinglish.
☆23Updated last year
Alternatives and similar repositories for openhathi_instruct:
Users that are interested in openhathi_instruct are comparing it to the libraries listed below
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Da☆101Updated last week
- Code repository for "Introducing Airavata: Hindi Instruction-tuned LLM"☆58Updated 4 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 8 months ago
- A set of scripts and notebooks on LLM finetunning and dataset creation☆105Updated 5 months ago
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines☆198Updated 10 months ago
- Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created by…☆29Updated 6 months ago
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆125Updated 3 months ago
- Starter pack for NeurIPS LLM Efficiency Challenge 2023.☆124Updated last year
- 🕹️ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.☆136Updated 7 months ago
- Repository for fine-tuning gemma models using unsloth for indic languages☆89Updated last year
- A blueprint for creating Pretraining and Fine-Tuning datasets for Indic languages☆105Updated 5 months ago
- End-to-End LLM Guide☆104Updated 8 months ago
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆73Updated 5 months ago
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆230Updated 4 months ago
- A simple, consistent and extendable toolkit for IndicTrans2☆24Updated 2 weeks ago
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆82Updated last year
- ☆92Updated last year
- Chunk your text using gpt4o-mini more accurately☆44Updated 7 months ago
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆99Updated 11 months ago
- Complete implementation of Llama2 with/without KV cache & inference 🚀☆47Updated 9 months ago
- ☆76Updated 9 months ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆34Updated 3 months ago
- Generalist and Lightweight Model for Text Classification☆92Updated this week
- experiments with inference on llama☆104Updated 9 months ago
- ☆120Updated 4 months ago
- Check for data drift between two OpenAI multi-turn chat jsonl files.☆38Updated 11 months ago
- ☆113Updated 5 months ago
- Just a bunch of benchmark logs for different LLMs☆119Updated 7 months ago
- ☆142Updated 8 months ago
- Initiative to evaluate and rank the most popular LLMs across common task types based on their propensity to hallucinate.☆107Updated 6 months ago