This repository includes the official implementation of OpenScholar: Synthesizing Scientific Literature with Retrieval-augmented LMs.
☆1,375Aug 13, 2025Updated 7 months ago
Alternatives and similar repositories for OpenScholar
Users that are interested in OpenScholar are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository contains ScholarQABench data and evaluation pipeline.☆145Aug 13, 2025Updated 7 months ago
- This repository contains expert evaluation interface and data evaluation script for the OpenScholar project.☆39Nov 19, 2024Updated last year
- High accuracy RAG for answering questions from scientific documents with citations☆8,321Mar 20, 2026Updated last week
- OpenResearcher, an advanced Scientific Research Assistant☆498Oct 10, 2024Updated last year
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆224Dec 16, 2025Updated 3 months ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬☆12,804Dec 19, 2025Updated 3 months ago
- PaSa -- an advanced paper search agent powered by large language models. It can autonomously make a series of decisions, including invoki…☆1,543May 27, 2025Updated 10 months ago
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆104Dec 2, 2024Updated last year
- ☆596May 10, 2025Updated 10 months ago
- CycleResearcher: Improving Automated Research via Automated Review☆362Mar 5, 2026Updated 3 weeks ago
- Agent Laboratory is an end-to-end autonomous research workflow meant to assist you as the human researcher toward implementing your resea…☆5,455Aug 20, 2025Updated 7 months ago
- ☆381Aug 7, 2025Updated 7 months ago
- CodeScientist: An automated scientific discovery system for code-based experiments☆318Nov 27, 2025Updated 4 months ago
- Repo housing the open sourced code for the ai2 scholar qa app and also the corresponding library☆270Mar 19, 2026Updated last week
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [EMNLP 2025] OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking☆487Aug 23, 2025Updated 7 months ago
- Repo for NAACL 2025 Paper "Unfolding the Headline: Iterative Self-Questioning for News Retrieval and Timeline Summarization"☆296Aug 4, 2025Updated 7 months ago
- ☆288Jul 19, 2024Updated last year
- An autonomous agent that conducts deep research on any data using any LLM providers☆26,061Mar 14, 2026Updated 2 weeks ago
- An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.☆28,040Sep 30, 2025Updated 5 months ago
- [EMNLP 2024] Tree of Problems: Improving structured problem solving with compositionality☆19Mar 4, 2025Updated last year
- Python package for serving a local search engine. One command to download and serve a datastore---that's it 😎.☆25Jun 6, 2025Updated 9 months ago
- [SIGIR 2025] The official repo for "Scaling Sparse and Dense Retrieval in Decoder-Only LLMs"☆20Mar 31, 2025Updated 11 months ago
- [NeurIPS2025] "AI-Researcher: Autonomous Scientific Innovation" -- A production-ready version: https://novix.science/chat☆4,968Oct 16, 2025Updated 5 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- 🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)☆6,816Jul 4, 2025Updated 8 months ago
- AllenAI's post-training codebase☆3,643Mar 23, 2026Updated last week
- Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".☆226Jun 24, 2025Updated 9 months ago
- [ICLR 2025] LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs☆1,850Jun 24, 2025Updated 9 months ago
- Analysis code for Neurips 2025 paper "SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks"☆56Aug 6, 2025Updated 7 months ago
- ScholarCopilot: Training Large Language Models for Academic Writing with Accurate Citations [COLM 2025]☆251Jul 8, 2025Updated 8 months ago
- DATAGEN: AI-driven multi-agent research assistant automating hypothesis generation, data analysis, and report writing.☆1,661Feb 5, 2026Updated last month
- [ICLR 2025] BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval☆195Sep 13, 2025Updated 6 months ago
- 🔍 Search-o1: Agentic Search-Enhanced Large Reasoning Models [EMNLP 2025]☆1,185Nov 17, 2025Updated 4 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- An Open Large Reasoning Model for Real-World Solutions☆1,539Feb 13, 2026Updated last month
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL☆4,309Nov 13, 2025Updated 4 months ago
- SOTA search powered LLM☆3,794Apr 4, 2025Updated 11 months ago
- Keep searching, reading webpages, reasoning until it finds the answer (or exceeding the token budget)☆5,135Dec 13, 2025Updated 3 months ago
- "Your Fully-Automated Personal AI Assistant"☆1,461Oct 16, 2025Updated 5 months ago
- S2ORC: The Semantic Scholar Open Research Corpus: https://www.aclweb.org/anthology/2020.acl-main.447/☆1,022Apr 26, 2024Updated last year
- DSPy: The framework for programming—not prompting—language models☆33,038Mar 22, 2026Updated last week