Official Code Release for "Training a Generally Curious Agent"
☆46May 18, 2025Updated 11 months ago
Alternatives and similar repositories for paprika
Users that are interested in paprika are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Learning from preferences is a common paradigm for fine-tuning language models. Yet, many algorithmic design decisions come into play. Ou…☆32Apr 20, 2024Updated last year
- UGround: Towards Unified Visual Grounding with Unrolled Transformers☆22Feb 15, 2026Updated 2 months ago
- [ICLR 2026] Information Gain-based Policy Optimization: A Simple and Effective Approach for Multi-Turn Search Agents☆55Updated this week
- ☆10Mar 13, 2023Updated 3 years ago
- ☆10Nov 17, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ReCross: Unsupervised Cross-Task Generalization via Retrieval Augmentation☆23May 1, 2022Updated 3 years ago
- Martingale posterior neural networks for fast sequential decision making @ Neurips 2025☆23Nov 13, 2025Updated 5 months ago
- [ACL'25] UTBoost: Rigorous Evaluation of Coding Agents on SWE-Bench☆36Aug 12, 2025Updated 8 months ago
- 一个基于 Flask 的问卷调查应用。☆11Feb 2, 2023Updated 3 years ago
- ☆12Jul 4, 2024Updated last year
- Safety Verification and Robustness Analysis of Neural Networks via Quadratic Constraints and Semidefinite Programming.☆14Jun 28, 2021Updated 4 years ago
- The official implement of "Accelerating Multimodal Large Language Models via Dynamic Visual-Token Exit and the Empirical Findings"☆18Dec 5, 2024Updated last year
- Official Implementation of UTrice: Unifying Primitives in Differentiable Ray Tracing and Rasterization via Triangles for Particle-Based 3…☆30Jan 13, 2026Updated 3 months ago
- ☆14Jun 24, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Code repository for the paper on "Predicting the Performance of Black-Box LLMs through Self-Queries".☆12Jan 9, 2025Updated last year
- A framework for benchmarking embedding models in hybrid search scenarios (BM25 + vector search) using Weaviate.☆38Updated this week
- [COLM 2025] Official code for "When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoni…☆15Oct 31, 2025Updated 5 months ago
- ViLoMem: Agentic Learner with Grow-and-Refine Multimodal Semantic Memory☆59Nov 27, 2025Updated 4 months ago
- ACL2024: TTM-RE Memory-Augmented Document-Level Relation Extraction☆20Oct 6, 2024Updated last year
- Code to accompany the paper "The Information Geometry of Unsupervised Reinforcement Learning"☆20Oct 6, 2021Updated 4 years ago
- [ICLR 2025] ELICIT: LLM Augmentation Via External In-context Capability☆14Mar 11, 2025Updated last year
- CLARA: Confidence of Labels and Raters☆10Jun 3, 2023Updated 2 years ago
- The asterai CLI and runtime for running WASM components bundled in environments.☆20Mar 21, 2026Updated 3 weeks ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [ICLR 2026] SwiReasoning: Switch-Thinking in Latent and Explicit for Pareto-Superior Reasoning LLMs☆57Updated this week
- ☆11Nov 23, 2020Updated 5 years ago
- [NeurIPS 2025] Implementation for the paper "The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning"☆166Mar 2, 2026Updated last month
- Code Repo for paper Label Leakage and Protection in Two-party Split Learning (ICLR 2022).☆22Mar 12, 2022Updated 4 years ago
- This repository provides the official implementation of QSVD, a method for efficient low-rank approximation that unifies Query-Key-Value …☆26Dec 1, 2025Updated 4 months ago
- Representation Learning in RL☆13Jun 1, 2022Updated 3 years ago
- Code for Representation Bending Paper☆17Jul 15, 2025Updated 9 months ago
- "what, how, where, and how well? a survey on test-time scaling in large language models" repository☆95Updated this week
- LLMs represent numbers on a helix and manipulate that helix to do addition.☆29Feb 4, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code for the main RoboTutor app. Many sound and image assets not included.☆14Nov 5, 2019Updated 6 years ago
- The code to reproduce CVPR 2021 paper "Towards Robust Classification Model by Counterfactual and Invariant Data Generation"☆16Jul 29, 2021Updated 4 years ago
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆266May 5, 2025Updated 11 months ago
- ☆494Apr 7, 2026Updated last week
- [CCS 2025] DPImageBench is an open-source toolkit developed to facilitate the research and application of DP image synthesis.☆32Feb 19, 2026Updated 2 months ago
- Code for "RSQ: Learning from Important Tokens Leads to Better Quantized LLMs"☆21Mar 25, 2026Updated 3 weeks ago
- ☆17Nov 7, 2024Updated last year