NanoGPT (124M) in 5 minutes
☆15Feb 14, 2025Updated last year
Alternatives and similar repositories for nanogpt-speedrun
Users that are interested in nanogpt-speedrun are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Baseline system for Language-based Audio Retrieval (Task 6B) in DCASE 2023 Challenge☆10Aug 8, 2023Updated 2 years ago
- Minimal (truly) muP implementation, consistent with TP4 and TP5 papers notation☆14Jan 2, 2026Updated 2 months ago
- Qudiet: A high performance hybrid qubit-qudit quantum simulator that scales and eats qudits for lunch.☆24Feb 2, 2025Updated last year
- ☆30Feb 12, 2025Updated last year
- Mapping out the "memory" of neural nets with data attribution☆50Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- A locally trained model of Stoney Nakoda has been developed and released. You can access the working model here or train your own instanc…☆10Oct 29, 2025Updated 5 months ago
- ☆16Apr 29, 2025Updated 11 months ago
- All the content of my youtube channel : https://youtube.com/@florenzerstling?si=7t10PBr6MDha74PO☆14May 28, 2025Updated 10 months ago
- A small tool for embedding files in a Go source file.☆11Nov 3, 2020Updated 5 years ago
- Simulation of job offers and CVs with real-time processing, classification, and analytics using Kafka, Ray, Spark, and Databricks. Includ…☆14Dec 25, 2024Updated last year
- (WIP)long form speech generatoins☆31Apr 2, 2025Updated 11 months ago
- Using multiple LLMs for ensemble Forecasting☆16Jan 17, 2024Updated 2 years ago
- a webapp that helps you listen to colors, see also the first version here, http://www.synestizer.com allow camera and microphone if promp…☆11Jan 9, 2026Updated 2 months ago
- Go struct tags for marshaling and unmarshaling map[string]string☆12Nov 15, 2020Updated 5 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Text-To-Speech for NotebookLM☆39Jul 20, 2025Updated 8 months ago
- ☆22Aug 8, 2022Updated 3 years ago
- ☆16Jan 26, 2025Updated last year
- Calculate allowed interactions in QED☆10Nov 2, 2022Updated 3 years ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆59Oct 18, 2025Updated 5 months ago
- An intelligent code optimization system leveraging AI analysis, automated refactoring, and test generation. Built with DSPy and Gradio, i…☆20Feb 1, 2025Updated last year
- Exploring Agno framework for building AI agents.☆25Mar 5, 2025Updated last year
- Emacs 中看 B 站☆11Jul 27, 2025Updated 8 months ago
- The ASAS-SN Sky Patrol python client☆20Oct 22, 2025Updated 5 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 本项目主要对开源的MOSS SFT数据进行整理 ,转换成mnbvc多轮对话格式。MOSS-003涵盖用性、忠实性、无害性三个层面,共353w样本,MOSS-003 包含更细粒度的有用性类别标记、更广泛的无害性数据和更长对话轮数,共630w样本,☆12Dec 3, 2023Updated 2 years ago
- PyTorch implementation of Miipher-2 [2025] which is a speech restoration model by Google DeepMind☆64Sep 22, 2025Updated 6 months ago
- Fine-tunes a student LLM using teacher feedback for improved reasoning and answer quality. Implements GRPO with teacher-provided evaluati…☆52May 7, 2025Updated 10 months ago
- Voxtral: Convert Mistral into a end2end SpeechLM. No information bottleneck, preserves prosody, learns interruptions from data. Unlike GP…☆42Mar 7, 2025Updated last year
- Cross-platform ssh-server based chat program, with data persisted into relational databases of MySQL, PostgreSQL or Sqlite3.☆11Jan 31, 2021Updated 5 years ago
- middleware based components to build a custom mqtt broker☆15Nov 30, 2015Updated 10 years ago
- Code for "What really matters in matrix-whitening optimizers?"☆23Oct 31, 2025Updated 4 months ago
- Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …☆15Mar 11, 2024Updated 2 years ago
- Range-based algorithms in Go☆14Sep 10, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A fundamental and extendable JSON API library for Go.☆11Jan 17, 2023Updated 3 years ago
- A graphRAG based CLI-tool to add any documentation as a context using just a single URL☆21Nov 23, 2025Updated 4 months ago
- Wenet speech to text for react native☆10Nov 1, 2022Updated 3 years ago
- OMNI-P2x: A universal neural network potential for excited states☆12Mar 19, 2026Updated last week
- ☆15Oct 24, 2023Updated 2 years ago
- Check that error return value are wrapped☆17Oct 17, 2019Updated 6 years ago
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.☆13Jul 15, 2024Updated last year