(ICLR 2025) Mitigating Information Loss in Tree-Based Reinforcement Learning via Direct Optimization
☆30Sep 5, 2024Updated last year
Alternatives and similar repositories for SYMPOL
Users that are interested in SYMPOL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆11Sep 23, 2024Updated last year
- Improved image classification using a modified LIME framework, enabling user-driven hierarchical feature analysis through the integration…☆17Dec 9, 2025Updated 5 months ago
- Official repo for ICML25 paper: DCBM: Data-Efficient Visual Concept Bottleneck Models☆30Sep 16, 2025Updated 8 months ago
- TransformerLens + HuggingFace☆11Nov 4, 2023Updated 2 years ago
- small language models training made easy☆14Dec 15, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆48May 9, 2026Updated 2 weeks ago
- The nnsight package enables interpreting and manipulating the internals of deep learned models.☆936Updated this week
- APAR: LLMs Can Do Auto-Parallel Auto-Regressive Decoding☆14Jul 22, 2024Updated last year
- Diffusion Probabilistic Model in Jax☆13Apr 20, 2024Updated 2 years ago
- ☆24Jan 28, 2025Updated last year
- MishformerLens intends to be a drop-in replacement for TransformerLens that AST patches HuggingFace Transformers rather than implementing…☆10Oct 7, 2024Updated last year
- @ngrok/mantle ui component library☆15May 15, 2026Updated 2 weeks ago
- Data analysis scripts for Puffer☆12Jun 4, 2025Updated 11 months ago
- Optim4RL is a Jax framework of learning to optimize for reinforcement learning.☆28Nov 27, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆15Sep 4, 2025Updated 8 months ago
- 🚀 Official code for “XStreamVGGT: Extremely Memory-Efficient Streaming Vision Geometry Grounded Transformer with KV Cache Compression”, …☆45Jan 27, 2026Updated 4 months ago
- ☆46Sep 27, 2025Updated 8 months ago
- Average-Reward Reinforcement Learning with Trust Region Methods☆11Oct 17, 2022Updated 3 years ago
- ☆13Jun 30, 2020Updated 5 years ago
- Runtime library and schema compiler for the Avro serialization format☆21Dec 13, 2021Updated 4 years ago
- A realtime multicellular organism evolution simulator with Verlet integration☆12May 30, 2021Updated 4 years ago
- Curated list of JAX Resources and Packages☆41Apr 30, 2026Updated 3 weeks ago
- Brutaltester compatible referee for coders strike back☆12Nov 27, 2018Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [NeurIPS 2025] LabUtopia: High-Fidelity Simulation and Hierarchical Benchmark for Scientific Embodied Agents☆36May 20, 2026Updated last week
- 赛尔号登录器-随机皮肤☆13Mar 6, 2026Updated 2 months ago
- ☆18Dec 10, 2025Updated 5 months ago
- Pytorch implementation on OpenAI's Procgen ppo-baseline, built from scratch.☆14May 17, 2024Updated 2 years ago
- This is the official PyTorch implementation of ASAG (ICCV 2023).☆18Sep 9, 2023Updated 2 years ago
- Tools for optimizing steering vectors in LLMs.☆22Apr 10, 2025Updated last year
- The AI Arena: A framework for distributed multi-agent reinforcement learning☆14Aug 5, 2022Updated 3 years ago
- Official codebase for "Privileged Sensing Scaffolds Reinforcement Learning", contains the Scaffolder algorithm and Sensory Scaffolding Su…☆33Sep 30, 2025Updated 7 months ago
- 江苏共青团青年大学习快速截图器☆10Aug 26, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- MAG-SQL: Multi-Agent Generative Approach with Soft Schema Linking and Iterative Sub-SQL Refinement for Text-to-SQL☆19Jul 10, 2025Updated 10 months ago
- PyTorch and NNsight implementation of AtP* (Kramar et al 2024, DeepMind)☆20Jan 19, 2025Updated last year
- Subject of the hackathon 42☆12Nov 9, 2022Updated 3 years ago
- The Comyco's Video Description Dataset☆13Oct 10, 2024Updated last year
- ☆20Mar 18, 2026Updated 2 months ago
- TopViewRS: Vision-Language Models as Top-View Spatial Reasoners (EMNLP 2024 Oral)☆15Jun 14, 2025Updated 11 months ago
- code for the paper "Adversarial Reinforced Instruction Attacker for Robust Vision-Language Navigation" (TPAMI 2021)☆10Jul 15, 2022Updated 3 years ago