simpleR1: A Simple Framework for Training R1-like Models
☆30Aug 12, 2025Updated 8 months ago
Alternatives and similar repositories for simpleR1
Users that are interested in simpleR1 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Pytorch implementation of Collaborative Metric Learning (CML)☆11Oct 13, 2020Updated 5 years ago
- [NeurIPS 2024] "Mind the Gap between Prototypes and Images in Cross-domain Finetuning"☆11Nov 15, 2024Updated last year
- [NeurIPS 2024 Oral] "Bayesian-Guided Label Mapping for Visual Reprogramming"☆12Dec 20, 2024Updated last year
- [NeurIPS 2024] "Self-Calibrated Tuning of Vision-Language Models for Out-of-Distribution Detection"☆13Oct 28, 2024Updated last year
- Original PyTorch Implementation for the EMNLP 2023 Paper "Beyond Detection: A Defend-and-Summarize Strategy for Robust and Interpretable …☆16Dec 14, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- This is very simple GUI for MDM(Human Motion Diffusion Model).☆12Oct 6, 2022Updated 3 years ago
- ☆10May 18, 2023Updated 2 years ago
- ☆10Jul 16, 2025Updated 9 months ago
- A curated list of Awesome-LLM-Ensemble papers for the survey "Harnessing Multiple Large Language Models: A Survey on LLM Ensemble"☆219Apr 22, 2026Updated last week
- Official repository for the paper "Exploring the Promise and Limits of Real-Time Recurrent Learning" (ICLR 2024)☆13Jun 11, 2025Updated 10 months ago
- This is a sample implementation of "TIMERS: Error-Bounded SVD Restart on Dynamic Networks"(AAAI 2018).☆12Jul 4, 2018Updated 7 years ago
- ☆12Mar 7, 2024Updated 2 years ago
- SemEval-2018 Task 1 Affect in Tweets Evaluation Script☆14Dec 28, 2017Updated 8 years ago
- Author Name Disambiguation☆10Sep 10, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Models, data, and codes for the paper: MetaAligner: Towards Generalizable Multi-Objective Alignment of Language Models☆25Sep 26, 2024Updated last year
- Source Code for TrustCom2022 Accepted Paper " 'Comments Matter and The More The Better': Improving Rumor Detecion with User Comments".☆19May 23, 2023Updated 2 years ago
- ☆35May 24, 2025Updated 11 months ago
- My personal research notebook with notes, tutorials, and resources written in Jupyterbook.☆21Updated this week
- code and data for Improving Temporal Link Prediction via Temporal Walk Matrix Projection, NeurIPS 2024☆14Oct 5, 2024Updated last year
- Temporal Graph Rewiring Method with Expander Graphs☆12Oct 18, 2024Updated last year
- DeepStyle provides pretrained models aiming to project text in a stylometric space. The base project consists in a new method of represen…☆15Jun 9, 2023Updated 2 years ago
- [ICLR 2024 Spotlight] "Negative Label Guided OOD Detection with Pretrained Vision-Language Models"☆21Oct 23, 2024Updated last year
- This github contains the implementation of the method proposed in MDGNN_BS paper☆12May 9, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Code for "Nearest Neighbor Classifier Embedded Network for Active Learning", AAAI 2021☆10Feb 3, 2021Updated 5 years ago
- A curated list for interpretable machine learning☆18Jan 4, 2019Updated 7 years ago
- Top-Conference Paper Figure Reproduction & Plotting Skills | 顶会论文图表复现绘制Skills☆109Apr 20, 2026Updated last week
- A Benchmark Dataset for Multimodal Scientific Fact Checking☆27Sep 17, 2024Updated last year
- Code for DUCK: Rumour Detection on Social Media by Modelling User and Comment Propagation Networks NAACL2022(https://aclanthology.org/202…☆23Jul 18, 2022Updated 3 years ago
- Awesome paper for multi-modal llm with grounding ability☆19Oct 11, 2025Updated 6 months ago
- This repository is the code and data for DialMed: A Dataset for Dialogue-based Medication Recommendation, COLING 2022.☆23Oct 26, 2022Updated 3 years ago
- ☆25Oct 16, 2024Updated last year
- [NeurIPS 2024] "Discovery of the Hidden World with Large Language Models"☆31Dec 2, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆15Jul 18, 2024Updated last year
- The official implementation of Non-separable Spatio-temporal Graph Kernels via SPDEs.☆16Jun 2, 2022Updated 3 years ago
- ☆25Sep 29, 2021Updated 4 years ago
- Code for "HINTS: Citation Time Series Prediction for New Publications via Dynamic Heterogeneous Information Network Embedding".☆14Mar 26, 2022Updated 4 years ago
- ☆19Jul 7, 2021Updated 4 years ago
- [NeurIPS 2023] "Unleashing the Power of Graph Data Augmentation on Covariate Distribution Shift" by Yongduo Sui, Qitian Wu, Jiancan Wu, Q…☆17Nov 6, 2023Updated 2 years ago
- Code and Data for WWW'23 paper Reinforcement Learning-based Counter-Misinformation Response Generation: A Case Study of COVID-19 Vaccine …☆27Jun 28, 2023Updated 2 years ago