shivendrra / SmallLanguageModel
a LLM cookbook, for building your own from scratch, all the way from gathering data to training a model
☆143Updated 10 months ago
Alternatives and similar repositories for SmallLanguageModel:
Users that are interested in SmallLanguageModel are comparing it to the libraries listed below
- Solving data for LLMs - Create quality synthetic datasets!☆146Updated 3 months ago
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆231Updated 6 months ago
- Build a Streamlit Chatbot using Langchain, ColBERT, Ragatouille, and ChromaDB☆119Updated last year
- ☆74Updated 7 months ago
- Large Language Model (LLM) Inference API and Chatbot☆125Updated last year
- Repository for fine-tuning gemma models using unsloth for indic languages☆92Updated last year
- ☆85Updated 7 months ago
- ☆113Updated 4 months ago
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆80Updated 11 months ago
- Complete implementation of Llama2 with/without KV cache & inference 🚀☆46Updated 11 months ago
- Function Calling Benchmark & Testing☆87Updated 9 months ago
- ☆80Updated 3 weeks ago
- FastAPI wrapper around DSPy☆238Updated last year
- Train your own SOTA deductive reasoning model☆91Updated 2 months ago
- Gradio based tool to run opensource LLM models directly from Huggingface☆91Updated 10 months ago
- A lightweight evaluation suite tailored specifically for assessing Indic LLMs across a diverse range of tasks☆35Updated 10 months ago
- a tiny vectorstore implementation built with numpy.☆62Updated last year
- Claude API Test Project☆87Updated last year
- This repository contains the code for dataset curation and finetuning of instruct variant of the Bilingual OpenHathi model. The resultin…☆23Updated last year
- everything i know about cuda and triton☆13Updated 3 months ago
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆64Updated 6 months ago
- Machine Learning Serving focused on GenAI with simplicity as the top priority.☆58Updated last month
- An all-new Language Model That Processes Ultra-Long Sequences of 100,000+ Ultra-Fast☆148Updated 8 months ago
- Own your AI, search the web with it🌐😎☆85Updated 3 months ago
- ☆126Updated last month
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 9 months ago
- Video+code lecture on building nanoGPT from scratch☆66Updated 10 months ago
- ☆27Updated this week
- An automated tool for discovering insights from research papaer corpora☆138Updated 11 months ago
- Simple examples using Argilla tools to build AI☆52Updated 5 months ago