huggingface / smollm
Everything about the SmolLM & SmolLM2 family of models
☆1,554Updated last week
Alternatives and similar repositories for smollm:
Users that are interested in smollm are comparing it to the libraries listed below
- Things you can do with the token embeddings of an LLM☆1,411Updated last week
- MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.☆1,217Updated last month
- Local realtime voice AI☆2,162Updated this week
- open-source framework for creating and managing simulations populated with AI-powered agents. It provides an intuitive platform for desig…☆837Updated last week
- 🦛 CHONK your texts with Chonkie ✨ - The no-nonsense RAG chunking library☆2,249Updated this week
- Optimizing inference proxy for LLMs☆1,926Updated this week
- A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and full…☆605Updated last month
- ☆590Updated last month
- Run PyTorch LLMs locally on servers, desktop and mobile☆3,462Updated this week
- Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜☆1,093Updated 3 weeks ago
- MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.☆706Updated this week
- A framework for serving and evaluating LLM routers - save LLM costs without compromising quality☆3,442Updated 5 months ago
- ☆664Updated this week
- LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve spee…☆2,746Updated 2 months ago
- A lightweight task engine for building stateful AI agents that prioritizes simplicity and flexibility.☆837Updated last week
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆1,238Updated last month
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆1,879Updated this week
- Bringing BERT into modernity via both architecture changes and scaling☆1,045Updated last week
- nanoGPT style version of Llama 3.1☆1,290Updated 5 months ago
- The code used to train and run inference with the ColPali architecture.☆1,386Updated this week
- NanoGPT (124M) in 3.4 minutes☆2,068Updated last week
- LLM Analytics☆634Updated 2 months ago
- ☆2,802Updated 4 months ago
- Large Concept Models: Language modeling in a sentence representation space☆1,713Updated this week
- Felafax is building AI infra for non-NVIDIA GPUs☆551Updated this week
- RAG that intelligently adapts to your use case, data, and queries☆2,747Updated this week
- llama3.np is a pure NumPy implementation for Llama 3 model.☆975Updated 7 months ago
- Implementing the 4 agentic patterns from scratch☆973Updated 2 months ago
- ☆1,403Updated last week
- Use late-interaction multi-modal models such as ColPali in just a few lines of code.☆693Updated 2 months ago