Official implementation for the paper "Can Large Reasoning Models Self-Train?"
☆76Oct 10, 2025Updated 8 months ago
Alternatives and similar repositories for srt
Users that are interested in srt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS 2025] First SFT, Second RL, Third UPT: Continual Improving Multi-Modal LLM Reasoning via Unsupervised Post-Training☆87Oct 29, 2025Updated 7 months ago
- Materials for "Multi-property Steering of Large Language Models with Dynamic Activation Composition"☆14Nov 22, 2024Updated last year
- ☆31Apr 14, 2026Updated 2 months ago
- Implementation for our TOIS paper --- Attentive Long Short-Term Preference Modeling for Personalized Product Search.☆19Feb 14, 2020Updated 6 years ago
- 补充了一些Visualglm缺少的文件,可以对Visualglm进行训练,实例中是对人脸做了面相的识别☆13Jun 7, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- KnowRL: Exploring Knowledgeable Reinforcement Learning for Factuality☆47May 19, 2026Updated last month
- [ICML2026] ARLArena☆80May 2, 2026Updated last month
- Graph QABot Demo| 图谱问答案例☆15Apr 11, 2023Updated 3 years ago
- (UNUSED) Early endpoint Void used to check for updates. Replaced by new build pipeline.☆12Dec 12, 2025Updated 6 months ago
- 构建一个医疗领域知识图谱和一个基于Flask的简易网页聊天机器人,通过ner获取用户问题的实体并在知识图谱内提取答案。☆12Apr 25, 2023Updated 3 years ago
- [ICML2025] Official Repo for Paper "Optimizing Temperature for Language Models with Multi-Sample Inference"☆23Feb 16, 2025Updated last year
- A minimal TPU compatible Jax implementation of NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis.☆13Apr 21, 2022Updated 4 years ago
- Implementation of my agent used in 2025 AFAC TianChi competition☆28Oct 6, 2025Updated 8 months ago
- ☆30Jun 19, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Models, data, and codes for the paper: MetaAligner: Towards Generalizable Multi-Objective Alignment of Language Models☆25Sep 26, 2024Updated last year
- ☆15Jun 30, 2025Updated 11 months ago
- Training an LLM to use a calculator with multi-turn reinforcement learning, achieving a **62% absolute increase in evaluation accuracy**.☆72May 5, 2025Updated last year
- ☆17Nov 20, 2024Updated last year
- Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"☆27Apr 17, 2024Updated 2 years ago
- ☆25May 10, 2023Updated 3 years ago
- [ACM MM 2023] QA-CLIMS: Question-Answer Cross Language Image Matching for Weakly Supervised Semantic Segmentation☆13Jun 14, 2024Updated 2 years ago
- The official PyTorch implementation of Logical Consistency and Greater Descriptive Power for Facial Hair Attribute Learning - CVPR 2023☆12Aug 31, 2024Updated last year
- Testing DeepSpeed integration in 🤗 Accelerate☆11Jun 28, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [CVPR'25] Attention IoU: Examining Biases in CelebA using Attention Maps☆13Mar 26, 2025Updated last year
- ☆19Dec 2, 2025Updated 6 months ago
- [MM 2023] Toward High Quality Facial Representation Learning☆19Oct 30, 2023Updated 2 years ago
- ☆15Jun 30, 2023Updated 2 years ago
- A THU beamer template based on PKU beamer template☆32Aug 26, 2025Updated 9 months ago
- ☆20Jun 10, 2025Updated last year
- ☆369Aug 12, 2025Updated 10 months ago
- ☆25Mar 17, 2026Updated 3 months ago
- ☆53Aug 24, 2025Updated 9 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆24Sep 11, 2025Updated 9 months ago
- PLM: Efficient Peripheral Language Models Hardware-Co-Designed for Ubiquitous Computing☆21Mar 18, 2025Updated last year
- Testing Difference Target Propagation (DTP) on MNIST.☆13Oct 12, 2020Updated 5 years ago
- [NeurIPS 2024] A Large-Scale Human-Centric Benchmark for Referring Expression Comprehension in the LMM Era☆10Aug 6, 2024Updated last year
- ☆25Jan 29, 2026Updated 4 months ago
- [ACL2026] Uni-MMMU : A Massive Multi-discipline Multimodal Unified Benchmark☆25Apr 13, 2026Updated 2 months ago
- Python wrapper for lean-gym☆13Apr 5, 2023Updated 3 years ago