Enable moe for nanogpt.
☆35Dec 11, 2023Updated 2 years ago
Alternatives and similar repositories for nanoGPT-moe
Users that are interested in nanoGPT-moe are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Unity Reinforcement Learning compared to Goal-Oriented Action Planning☆15Dec 20, 2019Updated 6 years ago
- Jeroen Cottaar's work for the Kaggle Geophysical Waveform Inversion competition (2nd place)☆13Aug 11, 2025Updated 10 months ago
- Test server code for Phi-2 model. support OpenAI API spec☆18Dec 15, 2023Updated 2 years ago
- ☆11Aug 3, 2023Updated 2 years ago
- Inference code for Mistral and Mixtral hacked up into original Llama implementation☆368Dec 9, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Code for experiments on transformers using Markovian data.☆22Nov 22, 2024Updated last year
- Just Build It - a "do what I mean" abstraction for Haskell build tools☆12Jun 8, 2018Updated 8 years ago
- [NeurIPS 2025] Official code for "Tropical Attention: Neural Algorithmic Reasoning for Combinatorial Algorithms"☆32May 21, 2026Updated last month
- tinkerpop blueprints graphdb on top on lmdb☆20Jan 7, 2014Updated 12 years ago
- ☆19Mar 25, 2025Updated last year
- [ICML 2024] Junk DNA Hypothesis: A Task-Centric Angle of LLM Pre-trained Weights through Sparsity; Lu Yin*, Ajay Jaiswal*, Shiwei Liu, So…☆16Apr 21, 2025Updated last year
- [ICML 2024] DPZero: Private Fine-Tuning of Language Models without Backpropagation☆17Sep 4, 2024Updated last year
- ☆20Jun 22, 2026Updated last week
- Source code for IRL-INR (ICML 2023)☆20May 27, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A style guide for Haskell code.☆14May 26, 2025Updated last year
- ☆21Dec 14, 2024Updated last year
- demo for par_camera_control.h☆11Nov 22, 2022Updated 3 years ago
- Typewriter component for Svelte that actually "types" one character at a time☆16Jun 4, 2026Updated 3 weeks ago
- A worker pool library for Rust☆13Jun 19, 2026Updated last week
- ☆14Apr 16, 2024Updated 2 years ago
- some classes which can help me to program kernel driver in Windows.☆16Feb 9, 2018Updated 8 years ago
- Write fixtures compactly, expand them to a vector☆12Jan 10, 2017Updated 9 years ago
- time-bomb.nvim is a minimal Neovim plugin for timers and Pomodoro cycles to boost developer focus. Features floating timers, 9 progress b…☆32Mar 12, 2026Updated 3 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Leiningen Plugin that lints your Clojure project and checks that every public var is documented☆10Jun 18, 2021Updated 5 years ago
- a minimalistic todo app☆10May 10, 2023Updated 3 years ago
- An unnecessarily tiny and minimal implementation of GPT-2 in NumPy.☆11Feb 12, 2023Updated 3 years ago
- It's Data.Graph, but it doesn't suck!☆16Jun 3, 2021Updated 5 years ago
- 💵 Code for Less is More for Long Document Summary Evaluation by LLMs (Wu*, Iso* et al; EACL 2024)☆11Feb 22, 2024Updated 2 years ago
- Efficient and correct pagination!☆16May 22, 2026Updated last month
- Sudoku solver in Golang☆10Sep 6, 2020Updated 5 years ago
- Barycentric coordinates GCN shader extension sample for DirectX 11☆13Jan 13, 2021Updated 5 years ago
- PyCon 2016 Tutorial Session -- Making Connections with Natural Language Processing☆12May 26, 2016Updated 10 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆13Mar 7, 2024Updated 2 years ago
- Command line tool for converting images to ASCII art☆20Jun 4, 2026Updated 3 weeks ago
- JAX implementation of GPTQ quantization algorithm☆10Jul 19, 2023Updated 2 years ago
- Scripts for managing Debian and RPM package repositories☆14Jan 14, 2026Updated 5 months ago
- 🐶chihuahua - tiny & fast rendering library☆13Aug 9, 2016Updated 9 years ago
- ☆16Feb 21, 2026Updated 4 months ago
- Autoregressive Image Generation☆31Jun 13, 2025Updated last year