Bilkent-CYBORG / VOPyLinks
A Framework for Black-box Vector Optimization
☆31Updated 2 months ago
Alternatives and similar repositories for VOPy
Users that are interested in VOPy are comparing it to the libraries listed below
Sorting:
- [ICLR 2025] Official implementation of DICL (Disentangled In-Context Learning), featured in the paper "Zero-shot Model-based Reinforcemen…☆27Updated 8 months ago
- ☆19Updated 7 months ago
- Code for "Accelerating Training with Neuron Interaction and Nowcasting Networks" [ICLR 2025]☆24Updated 3 weeks ago
- ☆66Updated 7 months ago
- Efficient Scaling laws and collaborative pretraining.☆18Updated last month
- We integrate discrete diffusion models with neurosymbolic predictors for scalable and calibrated learning and reasoning☆51Updated last month
- Dateset Reset Policy Optimization☆31Updated last year
- Causal Agent based on Large Language Model☆55Updated 2 months ago
- UQ: Assessing Language Models on Unsolved Questions☆26Updated 2 months ago
- Bayes-Adaptive RL for LLM Reasoning☆40Updated 5 months ago
- CS194-196 Course Project☆15Updated 8 months ago
- Gradient Boosting Reinforcement Learning (GBRL)☆122Updated this week
- Official codebase for "Quantile Reward Policy Optimization: Alignment with Pointwise Regression and Exact Partition Functions" (Matrenok …☆27Updated 3 months ago
- Efficiently discovering algorithms via LLMs with evolutionary search and reinforcement learning.☆116Updated 2 weeks ago
- ☆23Updated last year
- This repository contains code for the paper "Learning Decision Trees as Amortized Structure Inference"☆15Updated 7 months ago
- Official source code for "Graph Neural Networks for Learning Equivariant Representations of Neural Networks". In ICLR 2024 (oral).☆83Updated last year
- ☆34Updated 11 months ago
- ☆230Updated this week
- Trust Region Preference Approximation: A simple and stable reinforcement learning algorithm for LLM reasoning☆13Updated 4 months ago
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆27Updated last month
- The repository contains code for Adaptive Data Optimization☆27Updated 10 months ago
- Resa: Transparent Reasoning Models via SAEs☆44Updated last month
- ☆20Updated this week
- Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers☆23Updated 8 months ago
- implementation of dualformer☆24Updated 8 months ago
- A collection of AWESOME language modeling techniques on tabular data applications.☆32Updated last year
- ☆29Updated 4 months ago
- ☆59Updated 3 weeks ago
- ☆34Updated last year