qqr is an RL training framework for open-ended agents.
☆257Jun 15, 2026Updated 2 weeks ago
Alternatives and similar repositories for qqr
Users that are interested in qqr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The code for the paper "Pre-trained Vision-Language Models Learn Discoverable Concepts"☆21Jun 5, 2024Updated 2 years ago
- [COLM 2025] JailDAM: Jailbreak Detection with Adaptive Memory for Vision-Language Model☆26Nov 25, 2025Updated 7 months ago
- Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".☆30Aug 9, 2025Updated 10 months ago
- Official Code Repository for [AutoScale📈: Scale-Aware Data Mixing for Pre-Training LLMs] Published as a conference paper at **COLM 2025*…☆14Aug 8, 2025Updated 10 months ago
- A unified suite for generating elite reasoning problems and training high-performance LLMs, including pioneering attention-free architect…☆132Jan 31, 2026Updated 5 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains☆49Feb 4, 2026Updated 5 months ago
- [ICML 2025] Closed-Loop Long-Horizon Robotic Planning via Equilibrium Sequence Modeling☆13May 5, 2025Updated last year
- ☆13May 21, 2023Updated 3 years ago
- A framework for benchmarking embedding models in hybrid search scenarios (BM25 + vector search) using Weaviate.☆40Jun 17, 2026Updated 2 weeks ago
- ☆29Mar 10, 2026Updated 3 months ago
- Official repository for "Unveiling Opinion Evolution via Prompting and Diffusion for Short Video Fake News Detection", ACL Findings 2024.☆15Apr 25, 2025Updated last year
- a benchmark suite for testing logical reasoning abilities of prompt-based models☆31Nov 20, 2023Updated 2 years ago
- [ICCV 2023] Going Beyond Nouns With Vision & Language Models Using Synthetic Data☆13Sep 30, 2023Updated 2 years ago
- ☆28Aug 13, 2025Updated 10 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Customized Inference Engine for Multiverse Models☆25Jun 27, 2025Updated last year
- 云端天眼,找回失踪儿童的预警平台☆10Jan 9, 2020Updated 6 years ago
- "A Survey on Agent-as-a-Judge"☆132May 11, 2026Updated last month
- TTRV: Test-Time Reinforcement Learning for Vision–Language Models (CVPR 2026)☆45Mar 8, 2026Updated 3 months ago
- Official repository of "Distort, Distract, Decode: Instruction-Tuned Model Can Refine its Response from Noisy Instructions", ICLR 2024 Sp…☆21Mar 7, 2024Updated 2 years ago
- A curated list of papers & resources linked to concept learning☆13Aug 9, 2023Updated 2 years ago
- ☆52Feb 12, 2025Updated last year
- Official Repository for paper "HERMES: KV Cache as Hierarchical Memory for Efficient Streaming Video Understanding" [ACL 2026]☆90May 8, 2026Updated last month
- Repo for Paper "From Role-Play to Drama-Interaction: An LLM Solution" @ACL 2024☆13Jul 25, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- TreeRL: LLM Reinforcement Learning with On-Policy Tree Search in ACL'25☆97Jun 16, 2025Updated last year
- ☆29Aug 25, 2024Updated last year
- [ICLR 2026] Official Implementation of ProxyThinker: Test-Time Guidance through Small Visual Reasoners.☆22Sep 24, 2025Updated 9 months ago
- [BMVC22] Official Implementation of ViCHA: "Efficient Vision-Language Pretraining with Visual Concepts and Hierarchical Alignment"☆54Oct 20, 2022Updated 3 years ago
- ☆155Mar 12, 2025Updated last year
- [ICLR 2025] "Noisy Test-Time Adaptation in Vision-Language Models"☆13Feb 22, 2025Updated last year
- Suri: Multi-constraint instruction following for long-form text generation [EMNLP’24]☆27Oct 3, 2025Updated 9 months ago
- Source code of paper "Systematic Assessment of Factual Knowledge in Large Language Models" - EMNLP Findings 2023☆17Mar 17, 2026Updated 3 months ago
- Code implementation for paper "On the Efficacy of Small Self-Supervised Contrastive Models without Distillation Signals".☆17Dec 15, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for ICLR 2024 paper "CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets"☆62Jun 3, 2024Updated 2 years ago
- A paper list of Weakly Supervised Object Detection (WSOD) resources.☆13May 6, 2021Updated 5 years ago
- NeurIPS-2023: Data Pruning via Moving-one-Sample-out☆10May 21, 2026Updated last month
- ACM MULTIMEDIA CONFERENCE 2020☆11Jul 28, 2020Updated 5 years ago
- slime is an LLM post-training framework for RL Scaling.☆7,099Updated this week
- Code of CropMix: Sampling a Rich Input Distribution via Multi-Scale Cropping☆17Oct 8, 2022Updated 3 years ago
- ☆18Sep 5, 2024Updated last year