Alpha-Zero Connect Four NN trained via self play
☆27Mar 7, 2025Updated last year
Alternatives and similar repositories for c4a0
Users that are interested in c4a0 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A minimal home grid world environment to evaluate language understanding in interactive agents.☆24Sep 6, 2023Updated 2 years ago
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆43Sep 19, 2022Updated 3 years ago
- Experiments Notebook of "Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism"☆15Apr 30, 2025Updated last year
- Reasoning-based Evaluation and Ranking of Translations.☆20Jul 18, 2025Updated 9 months ago
- manipulating cointegrated pairs to achieve a market-neutral strategy that outperforms indices☆10Jan 12, 2021Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Action Value Gradient Algorithm☆28May 18, 2025Updated 11 months ago
- Code for TRANSDREAMER: REINFORCEMENT LEARNING WITH TRANSFORMER WORLD MODELS☆31Oct 12, 2023Updated 2 years ago
- coloring terminal text with intensities (used for plotting probability, entropy with tokens)☆12Oct 11, 2024Updated last year
- Output image to a file, stream, canvas, console, buffer or any other destination☆17Jan 18, 2025Updated last year
- Combining SOAP and MUON☆20Feb 11, 2025Updated last year
- Code to reproduce key results accompanying "SAEs (usually) Transfer Between Base and Chat Models"☆13Jul 18, 2024Updated last year
- ☆15Mar 2, 2025Updated last year
- A Statistical Arbitrage Strategy to trade Cryptocurrency Pairs☆14Nov 6, 2020Updated 5 years ago
- Source code for the paper "Positional Attention: Expressivity and Learnability of Algorithmic Computation"☆14May 26, 2025Updated 11 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- This is the code which powers the Twitter Bot https://twitter.com/RGB_Colours☆15Apr 14, 2017Updated 9 years ago
- ☆45Apr 30, 2018Updated 8 years ago
- Minimal implementation of TokenFormer for inference and learning☆13Nov 6, 2024Updated last year
- ApertureDB Python Client☆12Apr 23, 2026Updated last week
- ☆63Jun 12, 2025Updated 10 months ago
- Schedule free optimiser implemented in JAX using Optimistix☆15May 29, 2024Updated last year
- I can haz planetz?☆11Jun 12, 2020Updated 5 years ago
- Parallel Associative Scan for Language Models☆18Jan 8, 2024Updated 2 years ago
- A python library that supports all vector databases specifically for LLM apps and frameworks☆13May 3, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆19Dec 4, 2025Updated 4 months ago
- Simply drag and drop your PDF files into Preve to get started. Ask Preve questions about your document. Get Summaries, key points, specif…☆11Apr 9, 2026Updated 3 weeks ago
- Simulating a 2D Hovering SpaceX Grasshopper with a Thrust Vector Control) engine.☆12Dec 28, 2015Updated 10 years ago
- A tool for dependencies validation for ninja build system using strace to detect the real dependencies☆16Nov 12, 2018Updated 7 years ago
- research impl of Native Sparse Attention (2502.11089)☆63Feb 19, 2025Updated last year
- This repo contains the code for the reinforcement learning course project https://github.com/cuhkrlcourse☆12May 24, 2020Updated 5 years ago
- Neuroevolution is a Competitive Alternative to Reinforcement Learning for Skill Discovery☆17Apr 2, 2026Updated last month
- A set of useful classes and categories for iOS development.☆28May 8, 2013Updated 12 years ago
- Grokking on modular arithmetic in less than 150 epochs in MLX☆15Oct 24, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Official Project Page for HLA: Higher-order Linear Attention (https://arxiv.org/abs/2510.27258)☆48Jan 6, 2026Updated 3 months ago
- Code for the paper "Function-Space Learning Rates"☆24Jun 3, 2025Updated 11 months ago
- Stochastic trace estimation using JAX☆17Aug 20, 2025Updated 8 months ago
- ☆18Apr 19, 2024Updated 2 years ago
- This repo contains a demo of adversarial strings poisoning vector database and forching specific hallucinations on RAG chatbot.☆10May 2, 2024Updated 2 years ago
- Reinforcement learning in pure JAX.☆13Dec 24, 2025Updated 4 months ago
- Performance Counters for Apple Silicon on macOS☆20Jan 9, 2022Updated 4 years ago