InternScience/SGI-Bench

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/InternScience/SGI-Bench)

InternScience / SGI-Bench

Probing Scientific General Intelligence of LLMs with Scientist-Aligned Workflows

☆167

Alternatives and similar repositories for SGI-Bench

Users that are interested in SGI-Bench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

InternScience / SciEvalKit
View on GitHub
A unified evaluation toolkit and leaderboard for rigorously assessing the scientific intelligence of large language and vision–language m…
☆85Jun 17, 2026Updated last month
VisionXLab / GRADE
View on GitHub
[ECCV'26] GRADE: Grounded Reasoning Assessment for Discipline-informed Editing
☆28Apr 23, 2026Updated 2 months ago
VisionXLab / Rise-Video
View on GitHub
RISE-Video: Can Video Generators Decode Implicit World Rules?
☆28Mar 26, 2026Updated 3 months ago
VisionXLab / FIRM-Reward
View on GitHub
Trust Your Critic: Robust Reward Modeling and Reinforcement Learning for Faithful Image Editing and Generation
☆40Mar 13, 2026Updated 4 months ago
InternScience / ResearchClawBench
View on GitHub
🦞 ResearchClawBench: Evaluating AI Agents for Automated Research from Re-Discovery to New-Discovery
☆221Updated this week
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Visionary-Laboratory / CourtSI
View on GitHub
Stepping VLMs onto the Court: Benchmarking Spatial Intelligence in Sports
☆70Mar 15, 2026Updated 4 months ago
Sunshine-Ye / NIPS22-ST
View on GitHub
☆12Oct 24, 2024Updated last year
OpenGVLab / InternVL-U
View on GitHub
InternVL-U is a 4B-parameter unified multimodal model (UMM) that brings multimodal understanding, reasoning, image generation, image edit…
☆291Mar 21, 2026Updated 4 months ago
VisionXLab / CrossEarth-SAR
View on GitHub
The official repo of CrossEarth-SAR, a sar-centric and billion-scale geospatial foundation model for cross-domain semantic segmentation
☆46Mar 18, 2026Updated 4 months ago
Visionary-Laboratory / SpaceDG
View on GitHub
SpaceDG: Benchmarking Spatial Intelligence under Visual Degradation
☆31Jul 9, 2026Updated last week
InternScience / Awesome-Scientific-Datasets-and-LLMs
View on GitHub
A curated collection of papers, datasets, and resources on Scientific Datasets and Large Language Models (LLMs)
☆457Oct 3, 2025Updated 9 months ago
PhoenixZ810 / RISEBench
View on GitHub
[NIPS 2025 DB Oral] Official Repository of paper: Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing
☆154May 18, 2026Updated 2 months ago
microsoft / BizGenEval
View on GitHub
Bridging the gap between image generation and real-world design: a benchmark for structured, multi-constraint commercial visual content g…
☆20Apr 24, 2026Updated 2 months ago
VisionXLab / Moment-Video
View on GitHub
☆18Jun 2, 2026Updated last month
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
InternScience / EcoClaw
View on GitHub
EcoClaw: Save 90%+ on LLM Costs for OpenClaw with One Plugin
☆29Apr 3, 2026Updated 3 months ago
Zhouzone / OmniWeather
View on GitHub
The official repository of Omni-Weather. Code will be made publicly available soon.
☆16Mar 30, 2026Updated 3 months ago
OpenEarthLab / EarthLink
View on GitHub
EarthLink: A Self-Evolving AI Agent System for Climate Science
☆41Jul 7, 2026Updated 2 weeks ago
Visionary-Laboratory / holi-spatial
View on GitHub
[ICML 2026 Oral] Holi-Spatial: Evolving Video Streams into Holistic 3D Spatial Intelligence
☆366Jul 6, 2026Updated 2 weeks ago
AgenticScience / Awesome-Agent-Scientists
View on GitHub
Paper list of agent for science
☆284Jun 27, 2026Updated 3 weeks ago
VisionXLab / PointOBB-v3
View on GitHub
[IJCV] PointOBB-v3: Expanding Performance Boundaries of Single Point-Supervised Oriented Object Detection
☆42Sep 25, 2025Updated 9 months ago
InternScience / MME-Reasoning
View on GitHub
Official Repository: A Comprehensive Benchmark for Logical Reasoning in MLLMs
☆45Jun 17, 2025Updated last year
Visionary-Laboratory / visionary
View on GitHub
Visionary: The World Model Carrier Built on WebGPU-Powered Gaussian Splatting Platform
☆514Jun 26, 2026Updated 3 weeks ago
VisionXLab / EvoTok
View on GitHub
[ECCV'26] Code repo for "EvoTok: A Unified Image Tokenizer via Residual Latent Evolution for Visual Understanding and Generation"
☆22Jun 18, 2026Updated last month
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
CMarsRover / SciAgentGYM
View on GitHub
Code for Paper: Benchmarking Multi-step Scientific Tool-use in LLM Agents
☆37Jul 5, 2026Updated 2 weeks ago
lyh-18 / PromptGIP
View on GitHub
Unifying Image Processing as Visual Prompting Question Answering
☆23Jun 17, 2024Updated 2 years ago
InternLM / Intern-S1
View on GitHub
A Scientific Multimodal Foundation Model
☆834Updated this week
PRIME-RL / P1-VL
View on GitHub
P1-VL: Bridging Visual Perception and Scientific Reasoning in Physics Olympiads
☆15Feb 11, 2026Updated 5 months ago
VisionXLab / DVGBench
View on GitHub
[ISPRS2026] DVGBench: Implicit-to-Explicit Visual Grounding Benchmark in UAV Imagery with Large Vision-Language Models
☆30Mar 24, 2026Updated 3 months ago
MasterVito / DAC-RL
View on GitHub
Official Repo for DAC-RL: Training LLMs for Divide-and-Conquer Reasoning Elevates Test-Time Scalability
☆16Feb 26, 2026Updated 4 months ago
google-deepmind / proeval
View on GitHub
GenAI evaluation framework, optimized for 100x lower cost 🚀.
☆40Jun 16, 2026Updated last month
VisionXLab / SpaCE-10
View on GitHub
[ICLR 2026] SpaCE-10: A Comprehensive Benchmark for Multimodal Large Language Models in Compositional Spatial Intelligence
☆20Jan 26, 2026Updated 5 months ago
liushulinle / MarsRL
View on GitHub
MarsRL: Advancing Multi-Agent Reasoning System via Reinforcement Learning with Agentic Pipeline Parallelism
☆18Nov 18, 2025Updated 8 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
ventr1c / memma
View on GitHub
The official repository of "MemMA: Coordinating the Memory Cycle through Multi-Agent Reasoning and In-Situ Self-Evolution".
☆19Mar 20, 2026Updated 4 months ago
multimodal-art-projection / OProver
View on GitHub
☆21May 17, 2026Updated 2 months ago
InternScience / InternAgent
View on GitHub
InternAgent-1.5: A Unified Agentic Framework for Long-Horizon Autonomous Scientific Discovery
☆1,377Jun 10, 2026Updated last month
microsoft / HealthAgentBench
View on GitHub
☆24Updated this week
Henry-Lee-real / StableI2I
View on GitHub
Official implementation of StableI2I （ICML 2026）
☆19May 11, 2026Updated 2 months ago
THUDM / CaRR
View on GitHub
This repository contains the code and data for the paper "Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents wit…
☆72Apr 8, 2026Updated 3 months ago
liaoning97 / FineRMoE
View on GitHub
The official code of FineRMoE.
☆20Mar 17, 2026Updated 4 months ago