☆40Jul 26, 2024Updated last year
Alternatives and similar repositories for muzero_sketch
Users that are interested in muzero_sketch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Approximating the joint distribution of language models via MCTS☆22Nov 3, 2024Updated last year
- look how they massacred my boy☆63Oct 16, 2024Updated last year
- Simple Transformer in Jax☆143Jun 22, 2024Updated last year
- smolLM with Entropix sampler on pytorch☆149Oct 31, 2024Updated last year
- Plotting (entropy, varentropy) for small LMs☆99May 20, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Modify Entropy Based Sampling to work with Mac Silicon via MLX☆49Nov 6, 2024Updated last year
- ☆33Nov 4, 2024Updated last year
- A curated lexicon of phenomenological terms for AI experience and consciousness research☆50Apr 22, 2026Updated last month
- supporting pytorch FSDP for optimizers☆84Dec 8, 2024Updated last year
- ☆16Sep 27, 2023Updated 2 years ago
- Python library for building and sharing dataframe-agnostic, sklearn-style transformers and ml models for data science competitions.☆27Mar 10, 2026Updated 2 months ago
- Benchmark tests supporting the TiledCUDA library.☆19Nov 19, 2024Updated last year
- Agentic Deep Graph Reasoning Implementation☆14Mar 4, 2025Updated last year
- NSA Triton Kernels written with GPT5 and Opus 4.1☆70Aug 12, 2025Updated 9 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Rust bindings for CTranslate2☆14Jun 21, 2023Updated 2 years ago
- ☆196May 4, 2026Updated 3 weeks ago
- FlexAttention w/ FlashAttention3 Support☆27Oct 5, 2024Updated last year
- High-performance tokenized language data-loader for Python C++ extension☆15Jul 22, 2024Updated last year
- A graph visualization of attention☆56May 20, 2025Updated last year
- Writing FLUX in Triton☆42Sep 22, 2024Updated last year
- RAG Agent for the ARC AGI Challenge☆20Jul 1, 2024Updated last year
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆62Nov 4, 2024Updated last year
- ☆36Aug 16, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Official implementation of 'A Large-Scale Exploration of mu-Transfer'☆32Jun 5, 2025Updated 11 months ago
- Entropy Based Sampling and Parallel CoT Decoding☆3,435Nov 13, 2024Updated last year
- ☆12Jun 2, 2023Updated 2 years ago
- An offline NextJS application to allow users to locally transcode audio recordings into accurate, plain-text transcripts, then to use lar…☆49Dec 5, 2025Updated 5 months ago
- cli loom that uses git to manage branches☆33Jan 3, 2025Updated last year
- NanoGPT (124M) quality in 2.67B tokens☆28Sep 17, 2025Updated 8 months ago
- ☆14May 9, 2024Updated 2 years ago
- ☆34Sep 10, 2024Updated last year
- ☆124Feb 21, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [WIP] Better (FP8) attention for Hopper☆33Feb 24, 2025Updated last year
- ☆24Dec 26, 2023Updated 2 years ago
- ☆17Jun 8, 2025Updated 11 months ago
- This package contains a collection of tests to improve your Polars data analysis superpowers☆15Mar 15, 2026Updated 2 months ago
- [CVPR 2025] Few-shot Recognition via Stage-Wise Retrieval-Augmented Finetuning☆30Mar 15, 2026Updated 2 months ago
- ☆12Jan 4, 2024Updated 2 years ago
- Simple and efficient pytorch-native transformer training and inference (batched)☆78Apr 2, 2024Updated 2 years ago