moaraio / SS-self-hosting
This is a public repository to enable researchers to begin their journey of self-hosting data from Semantic Scholar.
β41Updated 4 months ago
Alternatives and similar repositories for SS-self-hosting:
Users that are interested in SS-self-hosting are comparing it to the libraries listed below
- β85Updated 10 months ago
- π Reference-Free automatic summarization evaluation with potential hallucination detectionβ99Updated last year
- β65Updated last year
- Attribute (or cite) statements generated by LLMs back to in-context information.β218Updated 5 months ago
- Late Interaction Models Training & Retrievalβ263Updated this week
- Generalist and Lightweight Model for Relation Extraction (Extract any relationship types from text)β196Updated this week
- Versatile framework designed to streamline the integration of your models, as well as those sourced from Hugging Face, into complex progrβ¦β28Updated 3 months ago
- LLM prompt language based on Jinja. Banks provides tools and functions to build prompts text and chat messages from generic blueprints. Iβ¦β84Updated this week
- MedEmbed is a collection of embedding models fine-tuned specifically for medical and clinical data.β44Updated 5 months ago
- SciRepEval benchmark training and evaluation scriptsβ73Updated 10 months ago
- Public space for the user community of Semantic Scholar APIs to share scripts, report issues, and make suggestions.β220Updated 2 months ago
- πΊοΈ Data Cleaning and Textual Data Visualization πΊοΈβ165Updated 9 months ago
- β62Updated 8 months ago
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive argumentsβ75Updated 5 months ago
- SUQL: Conversational Search over Structured and Unstructured Data with LLMsβ257Updated this week
- Completion After Prompt Probability. Make your LLM make a choiceβ75Updated 4 months ago
- An attribution library for LLMsβ37Updated 6 months ago
- Python API for https://vespa.ai, the open big data serving engineβ115Updated this week
- A proof of concept to scrape papers from journalsβ276Updated 9 months ago
- Edu-ConvoKit: An Open-Source Framework for Education Conversation Dataβ85Updated 7 months ago
- Official implementation of the ACL 2024: Scientific Inspiration Machines Optimized for Noveltyβ78Updated 11 months ago
- Automating meta-analysis of clinical trials (randomized controlled trials)β15Updated 6 months ago
- Notebooks for training universal 0-shot classifiers on many different tasksβ120Updated 2 months ago
- This is the repo for the LegalBench-RAG Paper: https://arxiv.org/abs/2408.10343.β76Updated 2 months ago
- GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation Extractionβ68Updated 7 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing β‘β67Updated 4 months ago
- A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GROβ¦β48Updated last week
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)β76Updated 5 months ago
- Knowledge Graph Generator appβ30Updated 11 months ago
- Python SDK for running evaluations on LLM generated responsesβ272Updated last week