a LLM cookbook, for building your own from scratch, all the way from gathering data to training a model
☆171Jun 25, 2024Updated last year
Alternatives and similar repositories for SmallLanguageModel
Users that are interested in SmallLanguageModel are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Training Small Language Model☆28Dec 26, 2023Updated 2 years ago
- A framework for few-shot evaluation of autoregressive language models.☆16Aug 23, 2023Updated 2 years ago
- Large Visual Language Model(LVLM), Large Language Model(LLM), Multimodal Large Language Model(MLLM), Alignment, Agent, AI System, Survey☆21Jul 27, 2025Updated 9 months ago
- Seamless Voice Interactions with LLMs☆12Oct 28, 2023Updated 2 years ago
- Latent Large Language Models☆19Aug 24, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Radiantloom Email Assist 7B is an email-assistant large language model fine-tuned from Zephyr-7B-Beta, over a custom-curated dataset of 1…☆14Jan 19, 2024Updated 2 years ago
- ☆45Oct 13, 2023Updated 2 years ago
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆88May 29, 2024Updated last year
- Solving data for LLMs - Create quality synthetic datasets!☆151Jan 20, 2025Updated last year
- a set of scripts to easily convert all training data from huggingface into alpaca instruct or sharegpt format, which should allow for eas…☆19Mar 14, 2025Updated last year
- Code for "What really matters in matrix-whitening optimizers?"☆23Oct 31, 2025Updated 6 months ago
- GSoC '17 - R language bindings for TensorFlow☆13Sep 18, 2017Updated 8 years ago
- ☆21Jan 10, 2024Updated 2 years ago
- Code for "Accelerating Training with Neuron Interaction and Nowcasting Networks" [ICLR 2025]☆28Feb 20, 2026Updated 2 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Portfolio REgret for Confidence SEquences☆21Jan 6, 2026Updated 3 months ago
- Real-world AI engineering dataset creation, SFT fine-tuning, and GRPO alignment ETL pipeline.☆33Aug 27, 2025Updated 8 months ago
- Flax (JAX) implementation of Progressive Growing of GANs for Improved Quality, Stability, and Variation☆12May 24, 2021Updated 4 years ago
- A Pipe-Friendly Image Calculator☆14Mar 3, 2022Updated 4 years ago
- ☆35Jul 5, 2023Updated 2 years ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆25Nov 25, 2024Updated last year
- ☆32Jul 5, 2024Updated last year
- 5X faster 60% less memory QLoRA finetuning☆21May 28, 2024Updated last year
- Modern Stable Diffusion models family - Fluently☆32Jun 6, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- RAG example using DSPy, Gradio, FastAPI☆92Apr 11, 2024Updated 2 years ago
- 6,080-param transformer achieving 100% accuracy on 10-digit addition. Trained from scratch in 10 minutes.☆22Feb 19, 2026Updated 2 months ago
- Autoregressive Image Generation☆31Jun 13, 2025Updated 10 months ago
- A stable, fast and easy-to-use inference library with a focus on a sync-to-async API☆48Sep 26, 2024Updated last year
- Build a Streamlit Chatbot using Langchain, ColBERT, Ragatouille, and ChromaDB☆121Jan 28, 2024Updated 2 years ago
- NLP model that predicts subreddit based on the title of a post☆32Mar 22, 2023Updated 3 years ago
- Don't bug your friends with articles they'll never read. AI's have infinite attention, leverage them instead! Use the curation buddy to e…☆21May 2, 2024Updated 2 years ago
- Chatbot framework powered by regular expressions☆10Feb 25, 2019Updated 7 years ago
- MoCo: A One-Stop Shop for Model Collaboration Research☆53Feb 24, 2026Updated 2 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- This repo has all the basic things you'll need in-order to understand complete vision transformer architecture and its various implementa…☆228Jan 2, 2025Updated last year
- Gradio application using LLMs to generate csv/apkg to aid with memorizing topics in Anki☆25Apr 1, 2026Updated last month
- Full stack advanced chatbot over LlamaIndex.TS documentation with preview feature using Multi-documents-agents, bootstrapped with create-…☆155Mar 10, 2024Updated 2 years ago
- Effective Unsupervised Domain Adaptation of Neural Rankers by Diversifying Synthetic Query Generation☆15Apr 23, 2025Updated last year
- Machine Learning Serving focused on GenAI with simplicity as the top priority.☆59Apr 6, 2026Updated 3 weeks ago
- [ACL 2025] Official implementation of the "CoT-ICL Lab" framework☆11Updated this week
- THOUGHTSCULPT, a general reasoning and search method for complex tasks☆13Dec 13, 2024Updated last year