Official implementation for the paper "Can Large Reasoning Models Self-Train?"
☆73Oct 10, 2025Updated 5 months ago
Alternatives and similar repositories for srt
Users that are interested in srt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- AAAI2025☆11Apr 18, 2025Updated 11 months ago
- [NeurIPS 2025] Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPO☆81Oct 29, 2025Updated 5 months ago
- ICML 2025 Spotlight, PCEvolve: Private Contrastive Evolution for Synthetic Dataset Generation via Few-Shot Private Data and Generative AP…☆14Jun 27, 2025Updated 9 months ago
- Materials for "Multi-property Steering of Large Language Models with Dynamic Activation Composition"☆14Nov 22, 2024Updated last year
- VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection☆26May 31, 2025Updated 9 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆31Dec 3, 2025Updated 3 months ago
- Implementation for our TOIS paper --- Attentive Long Short-Term Preference Modeling for Personalized Product Search.☆19Feb 14, 2020Updated 6 years ago
- ☆12Oct 2, 2023Updated 2 years ago
- OpenSRH is the first ever publicly available stimulated Raman histology (SRH) dataset and benchmark, which will facilitate the clinical t…☆13Oct 13, 2022Updated 3 years ago
- ☆18Feb 7, 2024Updated 2 years ago
- KnowRL: Exploring Knowledgeable Reinforcement Learning for Factuality☆40Dec 1, 2025Updated 3 months ago
- Vision-Language based Visual Object Tracking☆29Oct 10, 2025Updated 5 months ago
- [NeurIPS 2023] Differentially Private Image Classification by Learning Priors from Random Processes☆12Jun 12, 2023Updated 2 years ago
- 构建一个医疗领域知识图谱 和一个基于Flask的简易网页聊天机器人,通过ner获取用户问题的实体并在知识图谱内提取答案。☆12Apr 25, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆19Jun 10, 2025Updated 9 months ago
- Models, data, and codes for the paper: MetaAligner: Towards Generalizable Multi-Objective Alignment of Language Models☆25Sep 26, 2024Updated last year
- ☆19Aug 7, 2025Updated 7 months ago
- Training an LLM to use a calculator with multi-turn reinforcement learning, achieving a **62% absolute increase in evaluation accuracy**.☆69May 5, 2025Updated 10 months ago
- This is the implementation of k-space cold diffusion model for accelerated MRI reconstruction.☆21Oct 12, 2024Updated last year
- The official repo for DARG: Dynamic Evaluation of Large Language Models via Adaptive Reasoning Graph☆18Oct 13, 2024Updated last year
- Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"☆27Apr 17, 2024Updated last year
- Code repository for our paper titled "Near real-time intraoperative brain tumor diagnosis using stimulated Raman histology and deep neura…☆55Mar 18, 2020Updated 6 years ago
- The official PyTorch implementation of Logical Consistency and Greater Descriptive Power for Facial Hair Attribute Learning - CVPR 2023☆12Aug 31, 2024Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- PLM: Efficient Peripheral Language Models Hardware-Co-Designed for Ubiquitous Computing☆20Mar 18, 2025Updated last year
- ☆10Jul 13, 2024Updated last year
- ☆20Sep 11, 2025Updated 6 months ago
- Look Back to Reason Forward: Revisitable Memory for Long-Context LLM Agents☆27Mar 9, 2026Updated 3 weeks ago
- ☆22Jan 29, 2026Updated 2 months ago
- [NeurIPS 2024] A Large-Scale Human-Centric Benchmark for Referring Expression Comprehension in the LMM Era☆11Aug 6, 2024Updated last year
- ☆22Feb 13, 2026Updated last month
- ☆23Jun 5, 2025Updated 9 months ago
- Python wrapper for lean-gym☆13Apr 5, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [ICML 2025 Spotlight] RAPID: Long-Context Inference with Retrieval-Augmented Speculative Decoding☆20Mar 2, 2025Updated last year
- Code for our paper Resources and Evaluations for Multi-Distribution Dense Information Retrieval☆16Jan 16, 2024Updated 2 years ago
- [ICCV 2023] Code for "Multi-task View Synthesis with Neural Radiance Fields"☆11Oct 2, 2023Updated 2 years ago
- [SDM24] Official code for "Time-Transformer"☆18Sep 30, 2025Updated 5 months ago
- Code for paper "Conversational Product Search Based on Negative Feedback"☆12Jun 26, 2020Updated 5 years ago
- Cross-Self KV Cache Pruning for Efficient Vision-Language Inference☆10Dec 15, 2024Updated last year
- Code on IART: Intent-aware Response Ranking with Transformers in Information-seeking Conversation Systems (WWW 2020)☆11Apr 18, 2021Updated 4 years ago