Experimenting with small language models
☆75Jan 16, 2024Updated 2 years ago
Alternatives and similar repositories for slms
Users that are interested in slms are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Conformer block with Rotary Position Embedding, modified from lucidrains' implement☆19Sep 13, 2024Updated last year
- Remove generated stories with stray unicode characters☆12Jan 3, 2024Updated 2 years ago
- Training Small Language Model☆29Dec 26, 2023Updated 2 years ago
- A Python-based voice assistant integrating speech-to-text (STT), text-to-speech (TTS), and powerful AI capabilities using either a local …☆18Dec 8, 2025Updated 5 months ago
- This repo implements and trains DallE-1 on a synthetically generated dataset which has colored mnist images on texture/solid background a…☆13Oct 30, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- GTK+ Console Desktop Environment, a desktop environment to give Linux a game-console look and feel.☆11Jan 15, 2021Updated 5 years ago
- Exploring the minimal architecture required for coherent English language generation.☆13May 21, 2026Updated last week
- An implementation of the base GPT-3 Model architecture from the paper by OPENAI "Language Models are Few-Shot Learners"☆21Jun 29, 2024Updated last year
- Official implementation of ICLR 2025 'LORO: Parameter and Memory Efficient Pretraining via Low-rank Riemannian Optimization'☆18Apr 24, 2025Updated last year
- ☆127Jan 9, 2026Updated 4 months ago
- A curated collection of prompts for Grok Imagine by xAI☆29Oct 19, 2025Updated 7 months ago
- Residual Quantization Autoencoder, used for interpreting LLMs☆14Jan 1, 2025Updated last year
- PyTorch Implementation of GPT-2☆34Sep 4, 2024Updated last year
- LlamaNet: Decentralized Inference Swarm for llama.cpp☆23Jan 18, 2026Updated 4 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Play Minecraft with AI☆11Jul 20, 2022Updated 3 years ago
- ☆26Mar 20, 2024Updated 2 years ago
- React Milestones Visualization☆12May 13, 2025Updated last year
- Rust-GPU org website☆15Feb 23, 2026Updated 3 months ago
- A python package for developing AI applications with local LLMs.☆150Jan 4, 2025Updated last year
- ☆29Dec 15, 2025Updated 5 months ago
- Sample project showing how to use Carbon components in Vue3☆14May 21, 2026Updated last week
- ☆21Jul 23, 2025Updated 10 months ago
- A Qwen .5B reasoning model trained on OpenR1-Math-220k☆14Oct 11, 2025Updated 7 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Locally running LLM with internet access☆96Jun 30, 2025Updated 10 months ago
- Implement Retrospective Reader for Machine Reading Comprehension with 🤗 transformers and datasets☆19Jun 7, 2022Updated 3 years ago
- Infrastucture as code☆23May 13, 2026Updated 2 weeks ago
- flow-merge is a powerful Python library that enables seamless merging of multiple transformer-based language models using the most popula…☆20Feb 12, 2025Updated last year
- ☆11Jan 24, 2025Updated last year
- ☆12Dec 14, 2024Updated last year
- Simple LLM inference server☆20Jun 13, 2024Updated last year
- Lightweight tools for quick and easy LLM demo's☆28Sep 22, 2024Updated last year
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆14Mar 30, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 🥪 Mess portal where owners can set their weekly menu, price, time, and students can purchase their desired coupons, with a QR code syste…☆11Jun 2, 2023Updated 2 years ago
- Python client for Jikan.moe, MyAnimeList unofficial API with good intentions.☆14Dec 20, 2022Updated 3 years ago
- Neural Networks in Cognitive Science☆17Updated this week
- ☆23Nov 8, 2023Updated 2 years ago
- PID Controller written in Rust☆29Mar 8, 2026Updated 2 months ago
- Load multiple LoRA modules simultaneously and automatically switch the appropriate combination of LoRA modules to generate the best answe…☆159Feb 9, 2024Updated 2 years ago
- Fundamentals and Applications of Density Functional Theory with Interactive Live Scripts and Intuitive Codes☆18Apr 21, 2025Updated last year