So, I trained a Llama a 130M architecture I coded from ground up to build a small instruct model from scratch. Trained on FineWeb dataset form HuggingFace consisting of 15 M texts (10BT snapshot) for a total of full 3 epochs
☆17Mar 26, 2025Updated last year
Alternatives and similar repositories for SmolLlama
Users that are interested in SmolLlama are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Learning notes and code from CS 329S: Machine Learning Systems Design series.☆23Jun 23, 2025Updated 11 months ago
- Fine-Tuning Llama3-8B LLM in a multi-GPU environment using DeepSpeed☆21May 27, 2024Updated 2 years ago
- A synthesizer made in C#☆15Jan 31, 2021Updated 5 years ago
- This repository includes examples of using Microsoft Semantic Kernel with local LLMS via Ollama☆10May 14, 2024Updated 2 years ago
- This is a simple example of how to serve a DeepSeek model with Azure ML.☆10Feb 10, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Serverless RAG application with LlamaIndex and code interperter on Azure Container Apps☆12Jan 30, 2026Updated 3 months ago
- Semantic Kernel connector for ONNX models.☆12Jun 10, 2024Updated last year
- An Efficent BPE Algorithm Faster then Hugging Face Tokenizer's Implementation☆13Sep 9, 2024Updated last year
- https://github.com/juliakorea/translate-doc 로 옮깁니다☆10Nov 21, 2017Updated 8 years ago
- BASI is the first-ever smart contract created by autonomous AI agents. The token was deployed to ETH mainnet on 6/6/23.☆25Nov 22, 2024Updated last year
- This GUI aims to simplify the process of converting GGUF files to llamafile format by providing an intuitive and convenient way for users…☆14Jan 2, 2026Updated 4 months ago
- 🔍 Code Search Tools & Experiments☆12May 18, 2026Updated last week
- Agent CLI☆13May 20, 2026Updated last week
- ☆12Dec 20, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆22Nov 4, 2024Updated last year
- A guide to structured generation using constrained decoding☆18Jun 9, 2024Updated last year
- Implementation of CoLA: Compute-Efficient Pre-Training of LLMs via Low-Rank Activation☆26Feb 18, 2025Updated last year
- Arrakis is a library to conduct, track and visualize mechanistic interpretability experiments.☆31Apr 14, 2026Updated last month
- This repo contains starter code for the learning path module exercises.☆21Jan 21, 2026Updated 4 months ago
- Real time faster whisper gradio☆25Aug 17, 2025Updated 9 months ago
- Template repository of a machine-learning Python project powered by FastAPI and PyTorch☆15Aug 26, 2021Updated 4 years ago
- ChatBot App built using LangChain and Lightning AI☆16Mar 4, 2023Updated 3 years ago
- starter kit of sunmao-ui☆22Jan 9, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- JAX port of FLUX.1 models using flax.nnx☆23Sep 28, 2024Updated last year
- P5js sketches (Processing for JavaScript)☆18Jan 21, 2026Updated 4 months ago
- ☆23Jun 6, 2025Updated 11 months ago
- Samples of good AI generated CUDA kernels☆105May 30, 2025Updated 11 months ago
- Jarvis made by Kaushik Shresth Reverse Engineered by Likhi☆15Feb 16, 2025Updated last year
- ☆14Aug 29, 2023Updated 2 years ago
- This chat application is built using .NET Aspire and uses Semantic Kernel to connect to locally running Phi-3 model by Ollama to respond …☆21May 29, 2024Updated 2 years ago
- LCM OpenVINO model converter☆24Mar 27, 2024Updated 2 years ago
- Simply drag and drop your PDF files into Preve to get started. Ask Preve questions about your document. Get Summaries, key points, specif…☆11Apr 9, 2026Updated last month
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A small streamlit app to visualize the output of sentence clustering☆14Dec 15, 2020Updated 5 years ago
- A collection of awesome lists that are about a variety of different topics.☆45May 5, 2026Updated 3 weeks ago
- A NodeJS application to upload, watch and stream live videos.☆12Jan 24, 2023Updated 3 years ago
- Rust FTL + WebRTC live streaming software.☆13Mar 12, 2022Updated 4 years ago
- This is the official implementation of the voxel-based humanoid locomotion in "Gallant: Voxel Grid-based Humanoid Locomotion and Local-na…☆66Apr 24, 2026Updated last month
- ☆40Feb 18, 2024Updated 2 years ago
- A tool to help you generate java call graph.☆10Apr 14, 2021Updated 5 years ago