An adaptive sampling framework for Reinforce-style LLM post training.
☆90Nov 29, 2025Updated 3 months ago
Alternatives and similar repositories for Reinforce-Ada
Users that are interested in Reinforce-Ada are comparing it to the libraries listed below
Sorting:
- A collection of research papers on hypervisor testing.☆56Jan 31, 2026Updated last month
- ☆18Jun 10, 2025Updated 8 months ago
- Symphony — A decentralized multi-agent framework that enables intelligent agents to collaborate seamlessly across heterogeneous edge devi…☆30Oct 30, 2025Updated 4 months ago
- UR2: Unify RAG and Reasoning through Reinforcement Learning☆127Nov 19, 2025Updated 3 months ago
- Exploring Gemini-2.5-Flash-Image in medical imaging—segmentation, simulation, and cross-modal understanding with synthetic examples.☆35Dec 14, 2025Updated 2 months ago
- ☆13Feb 2, 2025Updated last year
- [USENIX Security'25] THEMIS: Towards Practical Intellectual Property Protection for Post-Deployment On-Device Deep Learning Models☆108Aug 13, 2025Updated 6 months ago
- Logic-in-frames: Dynamic keyframe search via visual semantic-logical verification for long video understanding☆58Nov 17, 2025Updated 3 months ago
- Leveraging AI, this solution boosts 360° video quality through 4x upscaling with Real-ESRGAN. It integrates GFPGAN for smart face enhance…☆23Jun 27, 2025Updated 8 months ago
- TARS: MinMax Token-Adaptive Preference Strategy for Hallucination Reduction in MLLMs☆23Sep 21, 2025Updated 5 months ago
- Dynamic human image animation with strong identity preservation, heterogeneous character driving, and controllable backgrounds.☆140May 23, 2025Updated 9 months ago
- ☆14Apr 25, 2025Updated 10 months ago
- [ICLR 2026] BARREL: Boundary-Aware Reasoning for Factual and Reliable LRMs☆17May 21, 2025Updated 9 months ago
- ☆223Nov 5, 2025Updated 3 months ago
- ☆12Jan 10, 2025Updated last year
- Gotta Hear Them All: Towards Sound Source Aware Audio Generation.☆67Nov 15, 2025Updated 3 months ago
- a iOS network debug library ,It can monitor HTTP requests within the App and displays information related to the request.☆15Apr 17, 2017Updated 8 years ago
- ☆24May 13, 2025Updated 9 months ago
- ☆117Aug 29, 2025Updated 6 months ago
- switch2ai - A JetBrains IDE plugin enabling seamless collaboration between JetBrains IDEs and various AI agents (Cursor, Qoder, Claude co…☆170Nov 11, 2025Updated 3 months ago
- ☆55Updated this week
- (NeurIPS 2024) BiDM: Pushing the Limit of Quantization for Diffusion Models☆22Nov 20, 2024Updated last year
- [CVPR'25] Official PyTorch implementation of AvatarArtist: Open-Domain 4D Avatarization.☆277Jun 14, 2025Updated 8 months ago
- ☆104Oct 8, 2025Updated 4 months ago
- Vocabulary Parallelism☆25Mar 10, 2025Updated 11 months ago
- Official Implementation of FastKV: Decoupling of Context Reduction and KV Cache Compression for Prefill-Decoding Acceleration☆29Nov 22, 2025Updated 3 months ago
- The official implementation of Mantis: A Versatile Vision-Language-Action Model with Disentangled Visual Foresight☆82Jan 16, 2026Updated last month
- AI-powered tool for analyzing GitHub trending repositories and URL metadata☆25Feb 23, 2026Updated last week
- ☆31Sep 1, 2025Updated 6 months ago
- APRIL: Active Partial Rollouts in Reinforcement Learning to Tame Long-tail Generation. A system-level optimization for scalable LLM tra…☆51Oct 11, 2025Updated 4 months ago
- [NeurIPS 2025] Official implementation of "Reasoning Path Compression: Compressing Generation Trajectories for Efficient LLM Reasoning"☆30Oct 20, 2025Updated 4 months ago
- ☆153Jan 2, 2024Updated 2 years ago
- A Text2SQL benchmark for evaluation of Large Language Models☆41Updated this week
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆12Jun 28, 2025Updated 8 months ago
- PipeRAG: Fast Retrieval-Augmented Generation via Algorithm-System Co-design (KDD 2025)☆30Jun 14, 2024Updated last year
- official implementation for paper titled "Training-free Horizon Extension for Autoregressive Video Generation"☆106Feb 17, 2026Updated last week
- Beyond log-likelihood: exploring alternative objectives for supervised fine-tuning of language model post-training☆55Oct 4, 2025Updated 4 months ago
- [ICCV'25] HERMES: temporal-coHERent long-forM understanding with Episodes and Semantics☆38Sep 10, 2025Updated 5 months ago
- [NeurIPS 2024] Official Repository of Multi-Object Hallucination in Vision-Language Models☆34Nov 13, 2024Updated last year