☆40Jul 26, 2024Updated last year
Alternatives and similar repositories for muzero_sketch
Users that are interested in muzero_sketch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Approximating the joint distribution of language models via MCTS☆22Nov 3, 2024Updated last year
- look how they massacred my boy☆63Oct 16, 2024Updated last year
- Simple Transformer in Jax☆143Jun 22, 2024Updated last year
- ☆21Apr 9, 2026Updated last week
- smolLM with Entropix sampler on pytorch☆149Oct 31, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Plotting (entropy, varentropy) for small LMs☆99May 20, 2025Updated 10 months ago
- Modify Entropy Based Sampling to work with Mac Silicon via MLX☆49Nov 6, 2024Updated last year
- ☆33Nov 4, 2024Updated last year
- A curated lexicon of phenomenological terms for AI experience and consciousness research☆46Mar 27, 2026Updated 3 weeks ago
- Meme search engine for the real shitposters☆11Jan 27, 2026Updated 2 months ago
- supporting pytorch FSDP for optimizers☆84Dec 8, 2024Updated last year
- Companion repository to "Prompt Compression and Contrastive Conditioning for Controllability and Toxicity Reduction in Language Models"☆14May 31, 2023Updated 2 years ago
- ☆16Sep 27, 2023Updated 2 years ago
- Python library for building and sharing dataframe-agnostic, sklearn-style transformers and ml models for data science competitions.☆28Mar 10, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Benchmark tests supporting the TiledCUDA library.☆18Nov 19, 2024Updated last year
- A `tree` util enhanced with tokens, lines, and components. `pip install -U tree_plus`☆15Nov 24, 2025Updated 4 months ago
- Agentic Deep Graph Reasoning Implementation☆14Mar 4, 2025Updated last year
- Quick Notebook Tutorials☆36Jul 17, 2025Updated 9 months ago
- Rust bindings for CTranslate2☆14Jun 21, 2023Updated 2 years ago
- ☆196Mar 27, 2026Updated 3 weeks ago
- FlexAttention w/ FlashAttention3 Support☆27Oct 5, 2024Updated last year
- High-performance tokenized language data-loader for Python C++ extension☆14Jul 22, 2024Updated last year
- A graph visualization of attention☆56May 20, 2025Updated 10 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- RAG Agent for the ARC AGI Challenge☆20Jul 1, 2024Updated last year
- ☆115Dec 1, 2024Updated last year
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆62Nov 4, 2024Updated last year
- ☆36Aug 16, 2024Updated last year
- Transformer with Mu-Parameterization, implemented in Jax/Flax. Supports FSDP on TPU pods.☆32Jun 5, 2025Updated 10 months ago
- Entropy Based Sampling and Parallel CoT Decoding☆3,431Nov 13, 2024Updated last year
- Fast, free, easy, and object-agnostic video anonymization☆12Dec 12, 2020Updated 5 years ago
- ☆12Jun 2, 2023Updated 2 years ago
- Cowork-like experience in the browser using filesystem api☆78Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- cli loom that uses git to manage branches☆33Jan 3, 2025Updated last year
- NanoGPT (124M) quality in 2.67B tokens☆28Sep 17, 2025Updated 7 months ago
- Get ready for that YC interview☆39Nov 19, 2024Updated last year
- ☆34Sep 10, 2024Updated last year
- [WIP] Better (FP8) attention for Hopper☆33Feb 24, 2025Updated last year
- ☆24Dec 26, 2023Updated 2 years ago
- ☆17Jun 8, 2025Updated 10 months ago