☆40Feb 25, 2026Updated 2 months ago
Alternatives and similar repositories for MiniModel
Users that are interested in MiniModel are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS 2025, Spotlight]: Ambient-o: Training Good models with Bad Data.☆34Apr 6, 2026Updated 3 weeks ago
- ☆44Sep 15, 2025Updated 7 months ago
- A curated collection of prompts for Grok Imagine by xAI☆28Oct 19, 2025Updated 6 months ago
- ☆16Jul 17, 2025Updated 9 months ago
- Quadratic formula implemented in real life.☆17Aug 12, 2025Updated 8 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Sentencepiece based BPE tokenizer for English and Japanese language text.☆28Apr 4, 2024Updated 2 years ago
- Docker/podman container for llama.cpp/vllm/exllamav{2,3} orchestrated using llama-swap☆18Apr 10, 2026Updated 2 weeks ago
- Multi-Modal Language Modeling with Image, Audio and Text Integration, included multi-images and multi-audio in a single multiturn.☆18Feb 20, 2024Updated 2 years ago
- LLM Benchmark Using Project Euler For Coding Challenges☆14Updated this week
- ☆21Mar 3, 2026Updated last month
- ☆59Updated this week
- Macro-from-Micro Planning for High-Quality and Parallelized Autoregressive Long Video Generation☆38Oct 31, 2025Updated 6 months ago
- Conformer block with Rotary Position Embedding, modified from lucidrains' implement☆18Sep 13, 2024Updated last year
- A PyTorch implementation of Vector Quantized Variational Autoencoder (VQ-VAE) with EMA updates, pretrained encoder, and K-means initializ…☆21Mar 26, 2026Updated last month
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- This repo implements Video generation model using Latent Diffusion Transformers(Latte) in PyTorch and provides training and inference cod…☆18Jan 6, 2025Updated last year
- A replication of the paper "Adaptive Mixtures of Local Experts" applied to the CIFAR-10 image classification dataset.☆12Mar 19, 2021Updated 5 years ago
- node.js driver for ublox gps☆18Sep 24, 2025Updated 7 months ago
- Code for "From Ideal to Real: Unified and Data-Efficient Dense Prediction for Real-World Scenarios"☆27Jul 7, 2025Updated 9 months ago
- A simple implementation of Llama 1, 2. Llama Architecture built from scratch using PyTorch all the models are built from scratch that inc…☆14May 6, 2024Updated last year
- Absensi Kepegawaian menggunakan PHP Framework Codeigniter☆10Jan 11, 2021Updated 5 years ago
- Course Project for COMP4471 on RWKV☆17Feb 11, 2024Updated 2 years ago
- Environment equipped with reinforcement learning algorithms to train agents to play tic-tac-toe.☆13Mar 4, 2023Updated 3 years ago
- Network Etiquette (Netiquette) -- Written with 2020 technology in mind☆10Nov 19, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Firecracker VM orchestration for Claude Code sessions☆26Mar 30, 2026Updated last month
- synthetic dataset generation workflow using local file resources for finetuning llms.☆82Oct 9, 2025Updated 6 months ago
- An implementation of the base GPT-3 Model architecture from the paper by OPENAI "Language Models are Few-Shot Learners"☆20Jun 29, 2024Updated last year
- Simplistic Implementation of Zipformer:A faster and better encoder for automatic speech recognition in PyTorch☆20Jun 3, 2024Updated last year
- A straightforward explanation of how DeepSeek R1 works☆18Feb 7, 2025Updated last year
- Community Implementation of the paper: "Multi-Head Mixture-of-Experts" In PyTorch☆30Apr 13, 2026Updated 2 weeks ago
- This repo implements and trains DallE-1 on a synthetically generated dataset which has colored mnist images on texture/solid background a…☆13Oct 30, 2024Updated last year
- [ICLR 2026] GRAPE: Group Representational Position Encoding (https://arxiv.org/abs/2512.07805)☆84Apr 1, 2026Updated 3 weeks ago
- Official implementation of the paper: [EMNLP 2025] RICO: Improving Accuracy and Completeness in Image Recaptioning via Visual Reconstruct…☆21Dec 9, 2025Updated 4 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- an auto-sleeping and -waking framework around llama.cpp☆12Feb 8, 2025Updated last year
- ☆45Feb 20, 2026Updated 2 months ago
- Learn the building blocks of how to build gpt-oss from scratch☆118Sep 23, 2025Updated 7 months ago
- ☆12May 30, 2025Updated 11 months ago
- Minimal Kréta client written in Python.☆11Oct 7, 2023Updated 2 years ago
- Exploration into the proposed architecture from Sapient Intelligence of Singapore 🇸🇬☆74Aug 18, 2025Updated 8 months ago
- Game Companion AI is an advanced application designed to enhance the gaming experience by providing real-time analysis and interpretation…☆54Sep 30, 2024Updated last year