YuvrajSingh-mist / SmolLlamaView external linksLinks
So, I trained a Llama a 130M architecture I coded from ground up to build a small instruct model from scratch. Trained on FineWeb dataset form HuggingFace consisting of 15 M texts (10BT snapshot) for a total of full 3 epochs
☆16Mar 26, 2025Updated 10 months ago
Alternatives and similar repositories for SmolLlama
Users that are interested in SmolLlama are comparing it to the libraries listed below
Sorting:
- Coding Agent CLI☆14Updated this week
- An Efficent BPE Algorithm Faster then Hugging Face Tokenizer's Implementation☆13Sep 9, 2024Updated last year
- Simply drag and drop your PDF files into Preve to get started. Ask Preve questions about your document. Get Summaries, key points, specif…☆11Apr 5, 2025Updated 10 months ago
- Reverse engineered Twitter's API☆13Nov 28, 2023Updated 2 years ago
- Generative AI app for Lost and Found belonggins using Open AI clip-vit-large to create image embeddings and search them using Natural Lan…☆10Jul 15, 2024Updated last year
- This is a simple example of how to serve a DeepSeek model with Azure ML.☆10Feb 10, 2025Updated last year
- A guide to structured generation using constrained decoding☆14Jun 9, 2024Updated last year
- AutoLog: Anomaly Detection by Deep Autoencoding of System Logs☆12Oct 28, 2021Updated 4 years ago
- Rust FTL + WebRTC live streaming software.☆14Mar 12, 2022Updated 3 years ago
- Notes of ADRL course taught at IISC as part of MTech AI curriculum☆13Nov 30, 2024Updated last year
- A Python program that adds A.I voice over to Lord of The Rings Online quests by using OCR.☆18Nov 19, 2025Updated 2 months ago
- This is the official repo of Text Summarizer Streamlit App video from AI Anytime YouTube channel.☆16Mar 21, 2024Updated last year
- 🔍 Code Search Tools & Experiments☆12Dec 29, 2025Updated last month
- https://github.com/juliakorea/translate-doc 로 옮깁니다☆10Nov 21, 2017Updated 8 years ago
- AI-powered self-interview preparation platform. This platform will use the magic of AI and language processing to simulate real intervie…☆18Jul 29, 2023Updated 2 years ago
- Optimizing diffusion for production-ready speeds☆34Jan 10, 2026Updated last month
- Multimodal RAG using LlamaIndex, Qdrant, llama.cpp for document QA with local VisonLLM and embedding models☆17Nov 8, 2024Updated last year
- An end-to-end video transcoding system that efficiently generates different resolutions of uploaded videos, enhancing accessibility and p…☆14Nov 4, 2024Updated last year
- Text summation using python, deep learning, machine learning, transformer, huggingface, openai and langchain☆13Nov 26, 2024Updated last year
- This GUI aims to simplify the process of converting GGUF files to llamafile format by providing an intuitive and convenient way for users…☆14Jan 2, 2026Updated last month
- WhisperMesh is an advanced chatbot that integrates voice and text interactions, delivering personalized responses through LLM models and …☆15Apr 23, 2025Updated 9 months ago
- https://openreview.net/forum?id=OC1o4_OI6Jw☆13May 27, 2022Updated 3 years ago
- ☆13Jun 6, 2022Updated 3 years ago
- An AI-powered assistant using LLM with voice and text query support.☆17Jul 30, 2025Updated 6 months ago
- ☆12Dec 20, 2024Updated last year
- GitHub repositories and users recommendations by embeddings☆17Nov 21, 2022Updated 3 years ago
- LoRA supervised fine-tuning, RLHF (PPO) and RAG with llama-3-8B on the TLDR summarization dataset☆14Feb 2, 2025Updated last year
- An LLM-based app to easily track calories and exercise by taking a photo of your meal or describing your physical activity☆17Oct 31, 2025Updated 3 months ago
- Matching The Statements: A Simple and Accurate Model for Key Point Analysis (ArgMining | EMNLP 2021)☆12Feb 11, 2022Updated 4 years ago
- Cross language information retrieval pipeline☆19Jan 12, 2026Updated last month
- image retrieval/tagging with CLIP☆13Jul 13, 2024Updated last year
- Alpha-Zero Connect Four NN trained via self play☆25Mar 7, 2025Updated 11 months ago
- ☆14Apr 26, 2024Updated last year
- Contrastive Dialogue Disentanglement via Clustering☆12Apr 26, 2023Updated 2 years ago
- Samples of good AI generated CUDA kernels☆99May 30, 2025Updated 8 months ago
- The Real time emotion recognition model will return the emotion predicted in real time. The model classifies face as stressed and not str…☆15Jun 22, 2022Updated 3 years ago
- Document Summarization App using large language model (LLM) and Langchain framework. Used a pre-trained T5 model and its tokenizer from H…☆13Oct 5, 2023Updated 2 years ago
- React-based reader and editor for creating notes and flashcards directly from PDF documents.☆16Apr 23, 2024Updated last year
- 가벼운 멀티에이전트 오케스트레이션을 탐구하는 교육 프레임워크입니다. OpenAI 솔루션 팀에서 관리합니다.☆16Oct 20, 2024Updated last year