shivendrra / SmallLanguageModelLinks
a LLM cookbook, for building your own from scratch, all the way from gathering data to training a model
☆155Updated last year
Alternatives and similar repositories for SmallLanguageModel
Users that are interested in SmallLanguageModel are comparing it to the libraries listed below
Sorting:
- Solving data for LLMs - Create quality synthetic datasets!☆150Updated 7 months ago
- ☆75Updated 11 months ago
- ☆86Updated 11 months ago
- Build a Streamlit Chatbot using Langchain, ColBERT, Ragatouille, and ChromaDB☆122Updated last year
- ☆102Updated last year
- An automated tool for discovering insights from research papaer corpora☆139Updated last year
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆84Updated last year
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆62Updated 10 months ago
- ☆54Updated 3 weeks ago
- Repository for fine-tuning gemma models using unsloth for indic languages☆96Updated last year
- ☆127Updated 5 months ago
- Finetune Llama-3-8b on the MathInstruct dataset☆111Updated 11 months ago
- A lightweight evaluation suite tailored specifically for assessing Indic LLMs across a diverse range of tasks☆38Updated last year
- Following master Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfish☆172Updated last year
- LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.☆184Updated last year
- Various installation guides for Large Language Models☆74Updated 4 months ago
- Building a chatbot powered with a RAG pipeline to read,summarize and quote the most relevant papers related to the user query.☆168Updated last year
- ☆116Updated 9 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆92Updated 7 months ago
- ☆68Updated 3 months ago
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆232Updated 10 months ago
- ☆62Updated 10 months ago
- Cerule - A Tiny Mighty Vision Model☆68Updated last year
- ☆88Updated last year
- tiny_fnc_engine is a minimal python library that provides a flexible engine for calling functions extracted from a LLM.☆38Updated last year
- Port of Andrej Karpathy's nanoGPT to Apple MLX framework.☆112Updated last year
- One click away from a locally downloaded, fine-tuned model, hosted on hugging face, with inference built in. In two hours.☆22Updated 6 months ago
- Just a bunch of benchmark logs for different LLMs☆119Updated last year
- Data extraction with LLM on CPU☆112Updated last year
- One click templates for inferencing Language Models☆213Updated last month