The code for paper "Rethinking LLM-as-a-Judge: Representation-as-a-Judge with Small Language Models via Semantic Capacity Asymmetry", accepted by ICLR 2026.
☆71Feb 3, 2026Updated 3 months ago
Alternatives and similar repositories for Representation-as-a-judge
Users that are interested in Representation-as-a-judge are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official implementation of FRAPPE: Infusing World Modeling into Generalist Policies via Multiple Future Representation Alignment☆45Mar 24, 2026Updated last month
- [CVPR 2026] Official repo for "EVATok: Adaptive Length Video Tokenization for Efficient Visual Autoregressive Generation"☆58Mar 13, 2026Updated 2 months ago
- OmniStream: Mastering Perception, Reconstruction and Action in Continuous Streams☆99Mar 15, 2026Updated 2 months ago
- [CVPR 2026] Official code of "EmbodiedSplat: Online Feed-Forward Semantic 3DGS for Open-Vocabulary 3D Scene Understanding"☆82Mar 7, 2026Updated 2 months ago
- daVinci-Agency: Unlocking Long-Horizon Agency Data-Efficiently☆39Feb 4, 2026Updated 3 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- VLS: Steering Pretrained Robot Policies via Vision–Language Models☆57Mar 29, 2026Updated last month
- [ArXiv 26] The official repository of "ArtHOI: Articulated Human-Object Interaction Synthesis by 4D Reconstruction from Video Priors".☆35Mar 5, 2026Updated 2 months ago
- Residual Context Diffusion (RCD): Repurposing discarded signals as structured priors for high-performance reasoning in dLLMs.☆56Mar 12, 2026Updated 2 months ago
- M³: Dense Matching Meets Multi-View Foundation Models for Monocular Gaussian Splatting SLAM☆69Mar 18, 2026Updated 2 months ago
- TBD☆56Mar 13, 2026Updated 2 months ago
- Official Implementation of "ToolSafe: Enhancing Tool Invocation Safety of LLM-based Agents via Proactive Step-level Guardrail and Feedbac…☆56Mar 25, 2026Updated last month
- [ACL'26] EvoToken-DLM (Beyond Hard Masks: Progressive Token Evolution for Diffusion Language)☆48Apr 7, 2026Updated last month
- In-Context Reinforcement Learning for Tool Use in Large Language Models☆47Mar 26, 2026Updated last month
- Open Ended Medical Reinforcement Learning☆54Mar 15, 2026Updated 2 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆36Jan 30, 2026Updated 3 months ago
- ☆24Mar 9, 2023Updated 3 years ago
- ☆50Apr 22, 2025Updated last year
- ☆44Mar 23, 2026Updated last month
- MHLA: Restoring Expressivity of Linear Attention via Token-Level Multi-Head (ICLR 2026)☆147May 8, 2026Updated last week
- [ACL 2026 Findings] "Omni-R1: Towards the Unified Generative Paradigm for Multimodal Reasoning"☆62Jan 28, 2026Updated 3 months ago
- The evaluation code for A Safety Report on GPT-5.2, Gemini 3 Pro, Qwen3-VL, Grok 4.1 Fast, Nano Banana Pro, and Seedream 4.5☆53Jan 18, 2026Updated 4 months ago
- Medical SAM3: A Foundation Model for Universal Prompt-Driven Medical Image Segmentation