tajwarfahim/paprika

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/tajwarfahim/paprika)

tajwarfahim / paprika

Official Code Release for "Training a Generally Curious Agent"

☆48

Alternatives and similar repositories for paprika

Users that are interested in paprika are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Asap7772 / understanding-rlhf
View on GitHub
Learning from preferences is a common paradigm for fine-tuning language models. Yet, many algorithmic design decisions come into play. Ou…
☆32Apr 20, 2024Updated 2 years ago
Gen-Verse / GenEnv
View on GitHub
GenEnv: Difficulty-Aligned Co-Evolution Between LLM Agents and Environment Simulators
☆62Dec 23, 2025Updated 6 months ago
Improbable-AI / orso
View on GitHub
☆17Feb 22, 2025Updated last year
MaxSobolMark / OOO
View on GitHub
Official repo for Offline RL for Online RL
☆18Oct 14, 2023Updated 2 years ago
filteredcophy / FilteredCoPhy
View on GitHub
☆10Nov 17, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
vivek3141 / gg-bench
View on GitHub
Measuring General Intelligence With Generated Games (Preprint)
☆25Jul 30, 2025Updated 11 months ago
INK-USC / ReCross
View on GitHub
ReCross: Unsupervised Cross-Task Generalization via Retrieval Augmentation
☆23May 1, 2022Updated 4 years ago
aster2024 / SWIFT
View on GitHub
Source code for SWIFT, an efficient reward model.
☆21Jan 13, 2026Updated 5 months ago
gerdm / martingale-posterior-neural-networks
View on GitHub
Martingale posterior neural networks for fast sequential decision making @ Neurips 2025
☆25Nov 13, 2025Updated 7 months ago
HITsz-TMG / ICL-State-Vector
View on GitHub
☆12Jul 4, 2024Updated 2 years ago
xuanyuzhang21 / RALI
View on GitHub
[ICLR 2026 Oral] Reasoning as Representation: Rethinking Visual Reinforcement Learning in Image Quality Assessment
☆38Feb 14, 2026Updated 4 months ago
DoubtedSteam / DyVTE
View on GitHub
The official implement of "Accelerating Multimodal Large Language Models via Dynamic Visual-Token Exit and the Empirical Findings"
☆18Dec 5, 2024Updated last year
waizui / UTrice
View on GitHub
Official Implementation of UTrice: Unifying Primitives in Differentiable Ray Tracing and Rasterization via Triangles for Particle-Based 3…
☆30Jan 13, 2026Updated 5 months ago
RUCBM / ICLEval
View on GitHub
☆14Jun 24, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
dsam99 / QueRE
View on GitHub
Code repository for the paper on "Predicting the Performance of Black-Box LLMs through Self-Queries".
☆12Jan 9, 2025Updated last year
machinelearningZH / hybrid-search-eval
View on GitHub
A framework for benchmarking embedding models in hybrid search scenarios (BM25 + vector search) using Weaviate.
☆40Updated this week
GeWu-Lab / Patch-Matters
View on GitHub
[CVPR2025] Code Release of Patch Matters: Training-free Fine-grained Image Caption Enhancement via Local Perception
☆25Jun 17, 2025Updated last year
chufangao / TTM-RE
View on GitHub
ACL2024: TTM-RE Memory-Augmented Document-Level Relation Extraction
☆21Oct 6, 2024Updated last year
ben-eysenbach / info_geometry
View on GitHub
Code to accompany the paper "The Information Geometry of Unsupervised Reinforcement Learning"
☆20Oct 6, 2021Updated 4 years ago
yuziGuo / FarOptBasis
View on GitHub
☆14Aug 27, 2023Updated 2 years ago
facebookresearch / clara
View on GitHub
CLARA: Confidence of Labels and Raters
☆11Jun 3, 2023Updated 3 years ago
MasterVito / SvS
View on GitHub
Official Repo for SvS: A Self-play with Variational Problem Synthesis strategy for RLVR training
☆54Dec 13, 2025Updated 6 months ago
lishiqianhugh / IPHYRE
View on GitHub
IPHYRE: Interactive Physical Reasoning, ICLR 2024
☆18Oct 18, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
weihao-bo / ViLoMem
View on GitHub
ViLoMem: Agentic Learner with Grow-and-Refine Multimodal Semantic Memory
☆66Apr 21, 2026Updated 2 months ago
JackKuo666 / ClinicalTrials-MCP-Server
View on GitHub
🔍 Enable AI assistants to search and access ClinicalTrials.gov data through a simple MCP interface.
☆16Apr 9, 2025Updated last year
PerceptionComputingLab / TDSC-ABUS2023
View on GitHub
Official repository of MICCAI 2023 TDSC-ABUS challenge
☆18Feb 2, 2025Updated last year
TianHongZXY / RLVR-Decomposed
View on GitHub
[NeurIPS 2025] Implementation for the paper "The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning"
☆165Mar 2, 2026Updated 4 months ago
Chen-Junbao / FedCCFA
View on GitHub
Classifier Clustering and Feature Alignment for Federated Learning under Distributed Concept Drift [NeurIPS 2024]
☆19Oct 25, 2024Updated last year
yudasong / briee
View on GitHub
Representation Learning in RL
☆13Jun 1, 2022Updated 4 years ago
lblankl / Short-RL
View on GitHub
Short RL
☆18Apr 16, 2026Updated 2 months ago
EvolvingLMMs-Lab / Evolving-Visual-Generation
View on GitHub
[Roadmap] Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling
☆123Jun 9, 2026Updated last month
AIM-Intelligence / RepBend
View on GitHub
Code for Representation Bending Paper
☆17Jul 15, 2025Updated 11 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
Re-Align / AlignTDS
View on GitHub
Analyzing LLM Alignment via Token distribution shift
☆17Jan 26, 2024Updated 2 years ago
subhashk01 / LLM-addition
View on GitHub
LLMs represent numbers on a helix and manipulate that helix to do addition.
☆31Feb 4, 2025Updated last year
XPRIZE / GLEXP-Team-RoboTutor-RoboTutor
View on GitHub
Code for the main RoboTutor app. Many sound and image assets not included.
☆14Nov 5, 2019Updated 6 years ago
facebookresearch / sweet_rl
View on GitHub
Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks
☆271May 5, 2025Updated last year
SAI-Lab-NYU / QSVD
View on GitHub
This repository provides the official implementation of QSVD, a method for efficient low-rank approximation that unifies Query-Key-Value …
☆28May 16, 2026Updated last month
john-hewitt / model-editing-canonical-examples
View on GitHub
☆14Feb 12, 2024Updated 2 years ago
MedARC-AI / OpenMidnight
View on GitHub
☆49Jun 22, 2026Updated 2 weeks ago