Repo du cours d'introduction à l'apprentissage par renforcement.
☆15Feb 2, 2025Updated last year
Alternatives and similar repositories for IntroRL
Users that are interested in IntroRL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An unofficial implementation of "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"☆36Jun 7, 2024Updated last year
- Experiments Notebook of "Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism"☆15Apr 30, 2025Updated 10 months ago
- Reasoning-based Evaluation and Ranking of Translations.☆20Jul 18, 2025Updated 8 months ago
- manipulating cointegrated pairs to achieve a market-neutral strategy that outperforms indices☆12Jan 12, 2021Updated 5 years ago
- Open-sourcing code associated with the AAAI-25 paper "On the Expressiveness and Length Generalization of Selective State-Space Models on …☆16Sep 18, 2025Updated 6 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- coloring terminal text with intensities (used for plotting probability, entropy with tokens)☆12Oct 11, 2024Updated last year
- A few models converted from caffe to CoreMLs format.☆15Jun 6, 2017Updated 8 years ago
- Rust widget toolkit built on Reclutch☆11Mar 25, 2020Updated 6 years ago
- Scratchpad/Chain-of-Thought Prompts☆12Jun 6, 2022Updated 3 years ago
- Combining SOAP and MUON☆19Feb 11, 2025Updated last year
- Code to reproduce key results accompanying "SAEs (usually) Transfer Between Base and Chat Models"☆13Jul 18, 2024Updated last year
- Hub for Open Source AGiXT Extensions, Chains, Prompts, and Agents.☆17Sep 27, 2023Updated 2 years ago
- Official implementation of ECCV24 paper: POA☆24Aug 8, 2024Updated last year
- ☆15Mar 2, 2025Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- An approximate implementation of the OpenAI paper - An Empirical Model of Large-Batch Training for MNIST☆11Nov 19, 2022Updated 3 years ago
- A Statistical Arbitrage Strategy to trade Cryptocurrency Pairs☆14Nov 6, 2020Updated 5 years ago
- Source code for the paper "Positional Attention: Expressivity and Learnability of Algorithmic Computation"☆14May 26, 2025Updated 10 months ago
- Gradient noise generators in C (perlin and simplex)☆12Dec 18, 2012Updated 13 years ago
- A simulator of Michelson interferometer.☆13Nov 23, 2020Updated 5 years ago
- Simulating a 2D Hovering SpaceX Grasshopper with a Thrust Vector Control) engine.☆12Dec 28, 2015Updated 10 years ago
- Open-sourcing my latest music album.☆12Sep 23, 2020Updated 5 years ago
- Official Project Page for HLA: Higher-order Linear Attention (https://arxiv.org/abs/2510.27258)☆45Jan 6, 2026Updated 2 months ago
- This repo contains the code for the reinforcement learning course project https://github.com/cuhkrlcourse☆12May 24, 2020Updated 5 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆91Aug 18, 2024Updated last year
- Alpha-Zero Connect Four NN trained via self play☆26Mar 7, 2025Updated last year
- Implement a Stack VM Interpreter with a Register Window☆11Jan 2, 2024Updated 2 years ago
- A set of useful classes and categories for iOS development.☆29May 8, 2013Updated 12 years ago
- A minimal lockless queue (i.e. a light pipe) witten in vanilla C.☆13May 6, 2018Updated 7 years ago
- Grokking on modular arithmetic in less than 150 epochs in MLX☆16Oct 24, 2024Updated last year
- Super tiny tap output library☆12Oct 13, 2023Updated 2 years ago
- A compiler synthesizer for simple languages.☆15Dec 18, 2018Updated 7 years ago
- 🎸 A collection of awesome guitar resources.☆12Jul 26, 2019Updated 6 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- modular audio processing system☆16Feb 21, 2013Updated 13 years ago
- A jekyll plug-in that provides a Liquid filter for emojifying text with https://github.com/github/gemoji. See http://www.emoji-cheat-shee…☆21Mar 19, 2015Updated 11 years ago
- A deceptively simple way to add a configuration file to a command-line application.☆17Mar 11, 2025Updated last year
- Capo is a modern music notation programming language designed for fast, feature-rich, and customizable music entry.☆17Jan 13, 2026Updated 2 months ago
- Fluid Language Model Benchmarking☆27Sep 16, 2025Updated 6 months ago
- A simple queue using a linked list written in C under the BSD license.☆18Jun 25, 2017Updated 8 years ago
- Efficient PScan implementation in PyTorch☆17Jan 2, 2024Updated 2 years ago