Pengyue-Lab / uiuc-cs357-fa21-scriptsLinks
A repository of useful scripts for the course CS357 in the form of Jupyter Notebook.
☆13Updated 4 years ago
Alternatives and similar repositories for uiuc-cs357-fa21-scripts
Users that are interested in uiuc-cs357-fa21-scripts are comparing it to the libraries listed below
Sorting:
- Official repository of the video reasoning benchmark MMR-V. Can Your MLLMs "Think with Video"?☆36Updated 6 months ago
- Data and Code for CVPR 2025 paper "MMVU: Measuring Expert-Level Multi-Discipline Video Understanding"☆76Updated 10 months ago
- [arXiv:2508.00410] "Co-rewarding: Stable Self-supervised RL for Eliciting Reasoning in Large Language Models"☆30Updated 2 months ago
- The repository of the paper "REEF: Representation Encoding Fingerprints for Large Language Models," aims to protect the IP of open-source…☆72Updated 11 months ago
- Code for the paper "VTool-R1: VLMs Learn to Think with Images via Reinforcement Learning on Multimodal Tool Use"☆144Updated 4 months ago
- Implementation of the MATRIX framework (ICML 2024)☆60Updated last year
- [AI4MATH@ICML2025] Do Not Let Low-Probability Tokens Over-Dominate in RL for LLMs☆41Updated 7 months ago
- [SCIS] MULTI-Benchmark: Multimodal Understanding Leaderboard with Text and Images☆44Updated last month
- Towards a Mechanistic Interpretation of Multi-Step Reasoning Capabilities of Language Models☆14Updated 2 years ago
- [NeurIPS 2025] VeriThinker: Learning to Verify Makes Reasoning Model Efficient☆63Updated 3 months ago
- [TMLR] Triple Preference Optimization☆30Updated 10 months ago
- Code for the paper "AsFT: Anchoring Safety During LLM Fune-Tuning Within Narrow Safety Basin".☆35Updated 5 months ago
- Workspace for CS357☆18Updated 2 years ago
- ☆176Updated last week
- [NeurIPS'25] The official code of "PeRL: Permutation-Enhanced Reinforcement Learning for Interleaved Vision-Language Reasoning"☆27Updated 3 months ago
- ☆62Updated 3 months ago
- Official code for ICLR 2024 paper "Do Generated Data Always Help Contrastive Learning?"☆31Updated last year
- G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning☆94Updated 7 months ago
- SynthRL: Scaling Visual Reasoning with Verifiable Data Synthesis☆68Updated 5 months ago
- Official eval code for ROVER: Benchmarking Reciprocal Cross-Modal Reasoning for Omnimodal Generation☆27Updated 2 weeks ago
- ☆198Updated last week
- Github repository for "Why Is Spatial Reasoning Hard for VLMs? An Attention Mechanism Perspective on Focus Areas" (ICML 2025)☆66Updated 7 months ago
- The official repo for "AceCoder: Acing Coder RL via Automated Test-Case Synthesis" [ACL25]☆94Updated 8 months ago
- A Massive Multi-Discipline Lecture Understanding Benchmark☆32Updated 2 months ago
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆88Updated 10 months ago
- ☆32Updated 2 months ago
- ☆32Updated last month
- ☆25Updated 5 months ago
- VeriWeb: Verifiable Long-Chain Web Benchmark for Agentic Information-Seeking☆83Updated 3 weeks ago
- CVPR25☆26Updated 5 months ago