[ACL 2025] Multi-Agent System for Science of Science
☆67Jul 26, 2025Updated 7 months ago
Alternatives and similar repositories for Virtual-Scientists
Users that are interested in Virtual-Scientists are comparing it to the libraries listed below
Sorting:
- Multi-Agent System for Science of Science☆22Feb 5, 2026Updated last month
- 🎮Manipulates mobile phones just like how you would. Official code for "MobA: Multifaceted Memory-Enhanced Adaptive Planning for Efficien…☆27Oct 10, 2025Updated 4 months ago
- ☆12Mar 5, 2025Updated last year
- ☆12Mar 21, 2024Updated last year
- Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"☆11Oct 11, 2024Updated last year
- ☆34Jan 25, 2026Updated last month
- [ICLR 2026] BARREL: Boundary-Aware Reasoning for Factual and Reliable LRMs☆17May 21, 2025Updated 9 months ago
- Blindspots in LLMs I've noticed while AI coding. Sonnet family emphasis.☆13Mar 20, 2025Updated 11 months ago
- CS194-196 Course Project☆14Feb 20, 2025Updated last year
- ☆13Mar 6, 2023Updated 3 years ago
- Moatless Testbeds allows you to create isolated testbed environments in a Kubernetes cluster where you can apply code changes through git…☆14Apr 9, 2025Updated 10 months ago
- CogNetX is an advanced, multimodal neural network architecture inspired by human cognition. It integrates speech, vision, and video proce…☆19Feb 9, 2026Updated 3 weeks ago
- PiFlow: Principle-aware Scientific Discovery with Multi-Agent Collaboration☆41Jan 7, 2026Updated 2 months ago
- An alternate reality web browser, powered by an LLM☆17Apr 29, 2024Updated last year
- ☆15Feb 21, 2024Updated 2 years ago
- a survey on deep research☆47Sep 9, 2025Updated 5 months ago
- ☆25Dec 12, 2025Updated 2 months ago
- ☆20Oct 25, 2022Updated 3 years ago
- [NeurIPS'25] ColorBench: Can VLMs See and Understand the Colorful World? A Comprehensive Benchmark for Color Perception, Reasoning, and R…☆32Sep 27, 2025Updated 5 months ago
- ☆31Sep 12, 2025Updated 5 months ago
- ☆26Sep 3, 2024Updated last year
- Code of "LMLT: Low-to-high Multi-Level Vision Transformer for Image Super-Resolution".☆32Oct 17, 2024Updated last year
- PKM tooling for semantically inclined digital gardeners (with [[wikirefs]], semantic trees, and graphs).☆22Sep 12, 2024Updated last year
- EasyApp is a chrome extension that allows users to autofill job applications, draft application responses, navigate job listings, tweak u…☆29Dec 5, 2023Updated 2 years ago
- ☆26Nov 21, 2022Updated 3 years ago
- This repo contains evaluation code for the paper "AV-Odyssey: Can Your Multimodal LLMs Really Understand Audio-Visual Information?"☆31Dec 23, 2024Updated last year
- This is the official repo for the paper "AMO-Bench: Large Language Models Still Struggle in High School Math Competitions".☆64Feb 6, 2026Updated last month
- ☆31Jun 12, 2024Updated last year
- ☆33Apr 22, 2025Updated 10 months ago
- VCR-Bench: A Comprehensive Evaluation Framework for Video Chain-of-Thought Reasoning☆35Jul 15, 2025Updated 7 months ago
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆12Jun 28, 2025Updated 8 months ago
- Cheating a little to solve the halting problem at scale☆32Nov 17, 2025Updated 3 months ago
- quick playground to animate pippin☆14Nov 11, 2024Updated last year
- [ICLR 2023] PyTorch code of Summarization Programs: Interpretable Abstractive Summarization with Neural Modular Trees☆24Jun 19, 2023Updated 2 years ago
- A Text2SQL benchmark for evaluation of Large Language Models☆41Updated this week
- ☆37Oct 15, 2024Updated last year
- The original Shared Recurrent Memory Transformer implementation☆33Jul 11, 2025Updated 7 months ago
- AAAI'24: Prompt to Transfer: Sim-to-Real Transfer for Traffic Signal Control with Prompt Learning☆30Sep 12, 2024Updated last year
- Official Implementation for the paper "Integrative Decoding: Improving Factuality via Implicit Self-consistency"☆32Apr 12, 2025Updated 10 months ago