pacman100 / openhathi_instructLinks
This repository contains the code for dataset curation and finetuning of instruct variant of the Bilingual OpenHathi model. The resulting model is meant to follow instructions and chat in Hindi and Hinglish.
☆23Updated 2 years ago
Alternatives and similar repositories for openhathi_instruct
Users that are interested in openhathi_instruct are comparing it to the libraries listed below
Sorting:
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆233Updated last year
- A set of scripts and notebooks on LLM finetunning and dataset creation☆114Updated last year
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines☆196Updated last year
- Repository for fine-tuning gemma models using unsloth for indic languages☆97Updated last year
- Code repository for "Introducing Airavata: Hindi Instruction-tuned LLM"☆63Updated last year
- A lightweight evaluation suite tailored specifically for assessing Indic LLMs across a diverse range of tasks☆38Updated last year
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Da☆119Updated 9 months ago
- LLM_library is a comprehensive repository serves as a one-stop resource hands-on code, insightful summaries.☆69Updated 2 years ago
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆77Updated last year
- Fine-tune an LLM to perform batch inference and online serving.☆117Updated 7 months ago
- 🕹️ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.☆138Updated last year
- A simple, consistent and extendable toolkit for IndicTrans2. (Pypi: https://pypi.org/project/indictranstoolkit)☆37Updated 5 months ago
- ☆210Updated 6 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆51Updated last year
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆154Updated 6 months ago
- Efficient vector database for hundred millions of embeddings.☆211Updated last year
- Starter pack for NeurIPS LLM Efficiency Challenge 2023.☆129Updated 2 years ago
- Manage scalable open LLM inference endpoints in Slurm clusters☆278Updated last year
- Toolkit for attaching, training, saving and loading of new heads for transformer models☆294Updated 10 months ago
- Let's build better datasets, together!☆269Updated last year
- Domain Adapted Language Modeling Toolkit - E2E RAG☆333Updated last year
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆83Updated 2 years ago
- Just a bunch of benchmark logs for different LLMs☆119Updated last year
- ☆161Updated last year
- Mistral + Haystack: build RAG pipelines that rock 🤘☆106Updated last year
- ☆94Updated 2 years ago
- ☆147Updated last year
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆107Updated 3 months ago
- experiments with inference on llama☆103Updated last year
- Simple UI for debugging correlations of text embeddings☆305Updated 7 months ago