Official implementation for the paper "Can Large Reasoning Models Self-Train?"
☆74Oct 10, 2025Updated 6 months ago
Alternatives and similar repositories for srt
Users that are interested in srt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- AAAI2025☆12Apr 18, 2025Updated last year
- [NeurIPS 2025] Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPO☆81Oct 29, 2025Updated 5 months ago
- Materials for "Multi-property Steering of Large Language Models with Dynamic Activation Composition"☆14Nov 22, 2024Updated last year
- VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection☆26May 31, 2025Updated 10 months ago
- ☆31Apr 12, 2026Updated last week
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Implementation for our TOIS paper --- Attentive Long Short-Term Preference Modeling for Personalized Product Search.☆19Feb 14, 2020Updated 6 years ago
- ☆12Oct 2, 2023Updated 2 years ago
- ☆18Feb 7, 2024Updated 2 years ago
- KnowRL: Exploring Knowledgeable Reinforcement Learning for Factuality☆40Dec 1, 2025Updated 4 months ago
- [ICML2025] Official Repo for Paper "Optimizing Temperature for Language Models with Multi-Sample Inference"☆22Feb 16, 2025Updated last year
- ☆30Jun 19, 2023Updated 2 years ago
- Matching Natural Language Sentences with Hierarchical Sentence Factorization☆22Apr 26, 2018Updated 7 years ago
- ☆15Sep 24, 2022Updated 3 years ago
- Models, data, and codes for the paper: MetaAligner: Towards Generalizable Multi-Objective Alignment of Language Models☆25Sep 26, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆15Jun 30, 2025Updated 9 months ago
- ☆18Aug 7, 2025Updated 8 months ago
- The official repo for DARG: Dynamic Evaluation of Large Language Models via Adaptive Reasoning Graph☆18Oct 13, 2024Updated last year
- Examples for KubeEdge☆13Sep 29, 2020Updated 5 years ago
- Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"☆27Apr 17, 2024Updated 2 years ago
- ☆18Dec 2, 2025Updated 4 months ago
- ☆10Jul 13, 2024Updated last year
- ☆333Aug 12, 2025Updated 8 months ago
- ☆19Jun 10, 2025Updated 10 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- PLM: Efficient Peripheral Language Models Hardware-Co-Designed for Ubiquitous Computing☆21Mar 18, 2025Updated last year
- ☆21Sep 11, 2025Updated 7 months ago
- Look Back to Reason Forward: Revisitable Memory for Long-Context LLM Agents☆31Updated this week
- ☆22Jan 29, 2026Updated 2 months ago
- ☆13Apr 23, 2025Updated 11 months ago
- R files containing the code used to predict rugby world cup matches☆10Sep 18, 2015Updated 10 years ago
- DP-HyperparamTuning offers an array of tools for fast and easy hypertuning of various hyperparameters for the DP-SGD algorithm.☆23Sep 27, 2021Updated 4 years ago
- ☆23Jun 5, 2025Updated 10 months ago
- [ICML 2025 Spotlight] RAPID: Long-Context Inference with Retrieval-Augmented Speculative Decoding☆22Mar 2, 2025Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Code for our paper Resources and Evaluations for Multi-Distribution Dense Information Retrieval☆16Jan 16, 2024Updated 2 years ago
- 🚀全流程自己训练一个VLA 「大模 型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!☆30Oct 16, 2025Updated 6 months ago
- The official GitHub page for ''What Makes for Good Visual Instructions? Synthesizing Complex Visual Reasoning Instructions for Visual Ins…☆19Nov 10, 2023Updated 2 years ago
- ☆358Jul 29, 2025Updated 8 months ago
- [NeurIPS 2025] Official Implementation of paper "Sherlock: Self-Correcting Reasoning in Vision-Language Models"☆29Sep 18, 2025Updated 7 months ago
- Code for paper "Conversational Product Search Based on Negative Feedback"☆12Jun 26, 2020Updated 5 years ago
- Code on IART: Intent-aware Response Ranking with Transformers in Information-seeking Conversation Systems (WWW 2020)☆11Apr 18, 2021Updated 5 years ago