a LLM cookbook, for building your own from scratch, all the way from gathering data to training a model
☆172Jun 25, 2024Updated last year
Alternatives and similar repositories for SmallLanguageModel
Users that are interested in SmallLanguageModel are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Large Visual Language Model(LVLM), Large Language Model(LLM), Multimodal Large Language Model(MLLM), Alignment, Agent, AI System, Survey☆21Jul 27, 2025Updated 10 months ago
- Latent Large Language Models☆19Aug 24, 2024Updated last year
- Radiantloom Email Assist 7B is an email-assistant large language model fine-tuned from Zephyr-7B-Beta, over a custom-curated dataset of 1…☆14Jan 19, 2024Updated 2 years ago
- ☆45Oct 13, 2023Updated 2 years ago
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆88May 29, 2024Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Solving data for LLMs - Create quality synthetic datasets!☆151Jan 20, 2025Updated last year
- Code for "What really matters in matrix-whitening optimizers?"☆24Oct 31, 2025Updated 7 months ago
- A conditionally adapted protein language model for the generation of enzymes☆24Nov 26, 2024Updated last year
- Setu is a comprehensive pipeline designed to clean, filter, and deduplicate diverse data sources including Web, PDF, and Speech data. Bui…☆16May 17, 2024Updated 2 years ago
- Portfolio REgret for Confidence SEquences☆21Jan 6, 2026Updated 5 months ago
- Real-world AI engineering dataset creation, SFT fine-tuning, and GRPO alignment ETL pipeline.☆34Aug 27, 2025Updated 9 months ago
- CaptionBot : Sequence to Sequence Modelling where Encoder is CNN(Resnet-50) and Decoder is LSTMCell with soft attention mechanism☆51Nov 2, 2021Updated 4 years ago
- Official Code for What Makes and Breaks Safety Fine-tuning? A Mechanistic Study (NeurIPS 2024)☆12Oct 31, 2024Updated last year
- Code for the EACL 2024 paper: "Small Language Models Improve Giants by Rewriting Their Outputs"☆12Apr 20, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A Pipe-Friendly Image Calculator☆14Mar 3, 2022Updated 4 years ago
- A script that republishes any tweets that start with "@username", where “username” is an arbitrary username that you supply.☆14Jun 9, 2013Updated 13 years ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆25Nov 25, 2024Updated last year
- defaultMODE is a Python framework for creating Discord AI agents with persistent memory and evolving behavior through brain-inspired sele…☆13Apr 21, 2026Updated last month
- ☆32Jul 5, 2024Updated last year
- 5X faster 60% less memory QLoRA finetuning☆21May 28, 2024Updated 2 years ago
- ☆161Dec 2, 2024Updated last year
- RAG example using DSPy, Gradio, FastAPI☆92Apr 11, 2024Updated 2 years ago
- 6,080-param transformer achieving 100% accuracy on 10-digit addition. Trained from scratch in 10 minutes.☆22Feb 19, 2026Updated 3 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A stable, fast and easy-to-use inference library with a focus on a sync-to-async API☆48Sep 26, 2024Updated last year
- Build a Streamlit Chatbot using Langchain, ColBERT, Ragatouille, and ChromaDB☆121Jan 28, 2024Updated 2 years ago
- Code, notebooks, and other material for FuturePath AI's training course on Generative AI☆13Apr 24, 2025Updated last year
- Collection of autoregressive model implementation☆85Feb 23, 2026Updated 3 months ago
- Python script to archive Tweets☆12Oct 2, 2012Updated 13 years ago
- Don't bug your friends with articles they'll never read. AI's have infinite attention, leverage them instead! Use the curation buddy to e…☆21May 2, 2024Updated 2 years ago
- midi to dac convertor for korg monotron synthesizer☆10Dec 11, 2015Updated 10 years ago
- R package for Byte Pair Encoding based on YouTokenToMe☆16May 20, 2026Updated 2 weeks ago
- ☆13Jan 20, 2023Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Minimal open-source implementation of AlphaProof and HyperTree Proof Search.☆87May 13, 2026Updated 3 weeks ago
- Model Behavior Study Group☆30May 22, 2026Updated 2 weeks ago
- This repo has all the basic things you'll need in-order to understand complete vision transformer architecture and its various implementa…☆228Jan 2, 2025Updated last year
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆86Oct 29, 2024Updated last year
- Empowering RAG with a versatile model-driven data interface for all-purpose applications!☆17Sep 10, 2024Updated last year
- Gradio application using LLMs to generate csv/apkg to aid with memorizing topics in Anki☆24Jun 3, 2026Updated last week
- Full stack advanced chatbot over LlamaIndex.TS documentation with preview feature using Multi-documents-agents, bootstrapped with create-…☆155Mar 10, 2024Updated 2 years ago