Official implementation for the paper "Can Large Reasoning Models Self-Train?"
☆75Oct 10, 2025Updated 7 months ago
Alternatives and similar repositories for srt
Users that are interested in srt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A PyTorch native library for large model training☆28Apr 1, 2026Updated last month
- [NeurIPS 2025] Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPO☆84Oct 29, 2025Updated 6 months ago
- ICML 2025 Spotlight, PCEvolve: Private Contrastive Evolution for Synthetic Dataset Generation via Few-Shot Private Data and Generative AP…☆14Jun 27, 2025Updated 10 months ago
- Materials for "Multi-property Steering of Large Language Models with Dynamic Activation Composition"☆14Nov 22, 2024Updated last year
- VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection☆26May 31, 2025Updated 11 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- 补充了一些Visualglm缺少的文件,可以对Visualglm进行训练,实例中是对人脸做了面相的识别☆13Jun 7, 2023Updated 2 years ago
- OpenSRH is the first ever publicly available stimulated Raman histology (SRH) dataset and benchmark, which will facilitate the clinical t…☆13Oct 13, 2022Updated 3 years ago
- KnowRL: Exploring Knowledgeable Reinforcement Learning for Factuality☆44Dec 1, 2025Updated 5 months ago
- Graph QABot Demo| 图谱问答案例☆15Apr 11, 2023Updated 3 years ago
- [NeurIPS 2023] Differentially Private Image Classification by Learning Priors from Random Processes☆12Jun 12, 2023Updated 2 years ago
- [ICML2025] Official Repo for Paper "Optimizing Temperature for Language Models with Multi-Sample Inference"☆22Feb 16, 2025Updated last year
- Implementation of my agent used in 2025 AFAC TianChi competition☆27Oct 6, 2025Updated 7 months ago
- ☆30Jun 19, 2023Updated 2 years ago
- ☆18Aug 7, 2025Updated 9 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- The official repo for DARG: Dynamic Evaluation of Large Language Models via Adaptive Reasoning Graph☆18Oct 13, 2024Updated last year
- Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"☆27Apr 17, 2024Updated 2 years ago
- Official PyTorch implementation of WPS from our paper: WPS-SAM: Towards Weakly-Supervised Part Segmentation with Foundation Models☆14Jun 12, 2025Updated 10 months ago
- [CVPR'25] Attention IoU: Examining Biases in CelebA using Attention Maps☆13Mar 26, 2025Updated last year
- ☆10Jul 13, 2024Updated last year
- ☆346Aug 12, 2025Updated 8 months ago
- ☆20Jun 10, 2025Updated 11 months ago
- This helps creating high quality figures for use in manuscripts and presentations.☆12May 14, 2025Updated 11 months ago
- [MICCAI2022] Estimating Model Performance under Domain Shifts with Class-Specific Confidence Scores.☆12Jun 7, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆54Aug 24, 2025Updated 8 months ago
- PLM: Efficient Peripheral Language Models Hardware-Co-Designed for Ubiquitous Computing☆21Mar 18, 2025Updated last year
- A multimodal large-scale model, which performs close to the closed-source Qwen-VL-PLUS on many datasets and significantly surpasses the p…☆14Feb 5, 2024Updated 2 years ago
- [NeurIPS 2024] A Large-Scale Human-Centric Benchmark for Referring Expression Comprehension in the LMM Era☆10Aug 6, 2024Updated last year
- ☆25Jan 29, 2026Updated 3 months ago
- ☆13Apr 23, 2025Updated last year
- Data-OOB: Out-of-bag Estimate as a Simple and Efficient Data Value (ICML 2023)☆21Jul 26, 2023Updated 2 years ago
- [ACL2026] Uni-MMMU : A Massive Multi-discipline Multimodal Unified Benchmark☆25Apr 13, 2026Updated 3 weeks ago
- R files containing the code used to predict rugby world cup matches☆11Sep 18, 2015Updated 10 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Python wrapper for lean-gym☆13Apr 5, 2023Updated 3 years ago
- [ICML 2025 Spotlight] RAPID: Long-Context Inference with Retrieval-Augmented Speculative Decoding☆22Mar 2, 2025Updated last year
- Code for our paper Resources and Evaluations for Multi-Distribution Dense Information Retrieval☆16Jan 16, 2024Updated 2 years ago
- [ICCV 2023] Code for "Multi-task View Synthesis with Neural Radiance Fields"☆11Oct 2, 2023Updated 2 years ago
- The official GitHub page for ''What Makes for Good Visual Instructions? Synthesizing Complex Visual Reasoning Instructions for Visual Ins…☆19Nov 10, 2023Updated 2 years ago
- ☆15Apr 26, 2025Updated last year
- [ICLR 2026] MMDuet2: Enhancing Proactive Interaction of Video MLLMs with Multi-Turn Reinforcement Learning☆31Jan 14, 2026Updated 3 months ago