Bilkent-CYBORG / VOPyLinks
A Framework for Black-box Vector Optimization
☆32Updated last week
Alternatives and similar repositories for VOPy
Users that are interested in VOPy are comparing it to the libraries listed below
Sorting:
- Code for [ICML2025]``Monte Carlo Tree Search for Comprehensive Exploration in LLM-Based Automatic Heuristic Design``.☆54Updated 3 months ago
- [ICLR 2025] Official implementation of DICL (Disentangled In-Context Learning), featured in the paper "Zero-shot Model-based Reinforcemen…☆26Updated 6 months ago
- ☆53Updated 2 months ago
- ☆63Updated 5 months ago
- Gradient Boosting Reinforcement Learning (GBRL)☆118Updated 2 weeks ago
- Causal Agent based on Large Language Model☆50Updated 2 months ago
- Official source code for "Graph Neural Networks for Learning Equivariant Representations of Neural Networks". In ICLR 2024 (oral).☆82Updated last year
- This repository contains code for the paper "Learning Decision Trees as Amortized Structure Inference"☆14Updated 5 months ago
- (ICLR 2025) Mitigating Information Loss in Tree-Based Reinforcement Learning via Direct Optimization☆23Updated 11 months ago
- implementation of dualformer☆20Updated 6 months ago
- ☆13Updated 11 months ago
- ☆32Updated last year
- Efficient Scaling laws and collaborative pretraining.☆17Updated 7 months ago
- The official implementation of Regularized Policy Gradient (RPG) (https://arxiv.org/abs/2505.17508)☆35Updated last week
- The original Shared Recurrent Memory Transformer implementation☆30Updated last month
- Efficiently discovering algorithms via LLMs with evolutionary search and reinforcement learning.☆106Updated 3 weeks ago
- The official repository of "SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in Cyber World".☆29Updated 2 weeks ago
- ☆19Updated 5 months ago
- We integrate discrete diffusion models with neurosymbolic predictors for scalable and calibrated learning and reasoning☆40Updated 3 months ago
- Code for "Accelerating Training with Neuron Interaction and Nowcasting Networks" [to appear at ICLR 2025]☆20Updated 3 months ago
- ☆35Updated 8 months ago
- ☆22Updated 11 months ago
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆41Updated last year
- Repo to reproduce the First-Explore paper results☆38Updated 8 months ago
- The official code release for Q#: Provably Optimal Distributional RL for LLM Post-Training☆16Updated 5 months ago
- Dateset Reset Policy Optimization☆30Updated last year
- A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks☆35Updated 10 months ago
- Drop-in environment replacements that make your RL algorithm train faster.☆21Updated last year
- ☆67Updated last year
- ☆24Updated 3 months ago