Alpha-Zero Connect Four NN trained via self play
☆26Mar 7, 2025Updated 11 months ago
Alternatives and similar repositories for c4a0
Users that are interested in c4a0 are comparing it to the libraries listed below
Sorting:
- A minimal home grid world environment to evaluate language understanding in interactive agents.☆24Sep 6, 2023Updated 2 years ago
- Action Value Gradient Algorithm☆28May 18, 2025Updated 9 months ago
- ☆12Jan 17, 2026Updated last month
- Code for TRANSDREAMER: REINFORCEMENT LEARNING WITH TRANSFORMER WORLD MODELS☆29Oct 12, 2023Updated 2 years ago
- The official starter-kit for NeurIPS 2025 mind games competition☆21Jul 27, 2025Updated 7 months ago
- Bayesian probability transforms for BM25 retrieval scores☆40Updated this week
- Partially Observable Multi-Agent RL with Transformers☆17Feb 13, 2026Updated 2 weeks ago
- Simply drag and drop your PDF files into Preve to get started. Ask Preve questions about your document. Get Summaries, key points, specif…☆11Apr 5, 2025Updated 10 months ago
- ☆15Jul 27, 2023Updated 2 years ago
- PyTorch Implementation of Context-Aware Sequential Model for Multi-Behaviour Recommendation https://arxiv.org/abs/2312.09684☆10May 31, 2024Updated last year
- About The dataset was recorded on the Husky robotics platform on the university campus and consists of 5 tracks recorded at different tim…☆11Mar 25, 2025Updated 11 months ago
- Text preprocessing package for use in NLP tasks https://pypi.org/project/textcl/☆11Aug 9, 2024Updated last year
- Generative AI app for Lost and Found belonggins using Open AI clip-vit-large to create image embeddings and search them using Natural Lan…☆10Jul 15, 2024Updated last year
- Expand -> Retrieve -> Rerank - simple method with strong results on BRIGHT benchmark☆22Aug 22, 2025Updated 6 months ago
- Brax + Pufferlib + CARBS for gpu-accelerated robotics RL☆12Jun 12, 2025Updated 8 months ago
- Scratchpad/Chain-of-Thought Prompts☆12Jun 6, 2022Updated 3 years ago
- A CLI tool for finding the files that count 🤠🔫☆13Feb 24, 2025Updated last year
- Code for the "Evolving Reservoirs for Meta Reinforcement Learning" paper☆11Apr 22, 2024Updated last year
- Reverse engineered Twitter's API☆13Nov 28, 2023Updated 2 years ago
- A Statistical Arbitrage Strategy to trade Cryptocurrency Pairs☆13Nov 6, 2020Updated 5 years ago
- Reasoning-based Evaluation and Ranking of Translations.☆19Jul 18, 2025Updated 7 months ago
- A NodeJS application to upload, watch and stream live videos.☆12Jan 24, 2023Updated 3 years ago
- ☆20Dec 1, 2025Updated 3 months ago
- We provide the gui for termux . it is a Linux system with gui running on Android for AI programming without root.Ai framework: tensorfl…☆13May 11, 2019Updated 6 years ago
- Code for PuzzleJAX, a benchmark for reasoning and learning, that reimplements PuzzleScript, a concise and expressive DSL and game engine …☆25Updated this week
- ☆45Apr 30, 2018Updated 7 years ago
- I can haz planetz?☆11Jun 12, 2020Updated 5 years ago
- ☆15Mar 2, 2025Updated last year
- Simulating a 2D Hovering SpaceX Grasshopper with a Thrust Vector Control) engine.☆12Dec 28, 2015Updated 10 years ago
- This code accompanies the paper "Bayesian Framework for Information-Theoretic Probing" published in EMNLP 2021.☆10Aug 23, 2021Updated 4 years ago
- AI-powered self-interview preparation platform. This platform will use the magic of AI and language processing to simulate real intervie…☆18Jul 29, 2023Updated 2 years ago
- Gym env for Slay the Spire☆16Dec 31, 2024Updated last year
- ☆13Apr 25, 2024Updated last year
- Code and data for Learning Rewards from Linguistic Feedback, AAAI '21☆10Dec 16, 2020Updated 5 years ago
- Minimal implementation of TokenFormer for inference and learning☆13Nov 6, 2024Updated last year
- Maximal Update Parametrization (μP) with Flax & Optax.☆16Dec 27, 2023Updated 2 years ago
- Reinforcement learning in pure JAX.☆13Dec 24, 2025Updated 2 months ago
- Repo for BEHAVE: Dataset and Method for Tracking Human Object Interactions, CVPR'22☆15Oct 12, 2022Updated 3 years ago
- Official code for "A General Learning Framework for Open Ad Hoc Teamwork Using Graph-based Policy Learning"☆17Mar 1, 2023Updated 3 years ago