moaraio / SS-self-hostingLinks
This is a public repository to enable researchers to begin their journey of self-hosting data from Semantic Scholar.
☆42Updated 8 months ago
Alternatives and similar repositories for SS-self-hosting
Users that are interested in SS-self-hosting are comparing it to the libraries listed below
Sorting:
- ☆77Updated last year
- Public space for the user community of Semantic Scholar APIs to share scripts, report issues, and make suggestions.☆231Updated 5 months ago
- Automated Hypothesis Testing with Agentic Sequential Falsifications☆204Updated 2 months ago
- A proof of concept to scrape papers from journals☆285Updated last year
- SUQL: Conversational Search over Structured and Unstructured Data with LLMs☆271Updated last month
- Edu-ConvoKit: An Open-Source Framework for Education Conversation Data☆97Updated 2 months ago
- ☆94Updated last year
- Get answers to research questions from 200M+ papers. Link to demo -☆205Updated last year
- To automate the SLR process and write paper quickly using multi agents of AI☆45Updated last year
- Interact with the Deep Search platform for new knowledge explorations and discoveries☆209Updated 5 months ago
- This repository contains ScholarQABench data and evaluation pipeline.☆73Updated 3 months ago
- ☆42Updated 2 months ago
- Versatile framework designed to streamline the integration of your models, as well as those sourced from Hugging Face, into complex progr…☆32Updated 3 months ago
- Python PDF parser for scientific publications: content and figures☆418Updated last year
- Python SDK for running evaluations on LLM generated responses☆289Updated last month
- PDF parser powered by grobid☆28Updated 11 months ago
- Parsers for scientific papers (PDF2JSON, TEX2JSON, JATS2JSON)☆425Updated last year
- RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo ranker☆113Updated this week
- GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation Extraction☆75Updated 11 months ago
- A Lightweight Library for AI Observability☆246Updated 4 months ago
- In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.☆431Updated last year
- XTR/WARP (SIGIR'25) is an extremely fast and accurate retrieval engine based on Stanford's ColBERTv2/PLAID and Google DeepMind's XTR.☆137Updated 2 months ago
- ☆144Updated 11 months ago
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆90Updated 7 months ago
- Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, imp…☆184Updated 10 months ago
- Dataset and annotations for ASSETS 2022 publication☆12Updated 2 years ago
- A Python library to chunk/group your texts based on semantic similarity.☆97Updated last year
- 📄 ⚙️ ETL processes for medical and scientific papers☆393Updated this week
- Python API for https://vespa.ai, the open big data serving engine☆127Updated last week
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆84Updated 9 months ago