pqaidevteam / pqai-db
Document database service for PQAI
☆11Updated last year
Related projects: ⓘ
- Curated list of resources for processing patent data☆54Updated 3 months ago
- PQAI Search Server☆69Updated this week
- Hugging Face and Pyserini interoperability☆17Updated last year
- CORWA: A Citation-Oriented Related Work Annotation Dataset, NAACL 2022☆16Updated 2 months ago
- Scripts supporting the development and serving the Roots Search Tool - https://hf.co/spaces/bigscience-data/roots-search☆10Updated last year
- This is a demo project to compare two web scrapping frameworks, Playwright and Selenium and using the new Pipelining tool Dagster☆12Updated 3 years ago
- Tools for building SQLite databases from files and directories☆11Updated last year
- Experiments with Observable Framework☆12Updated 6 months ago
- A python library for the Semantic Scholar (S2) API with typed pydantic objects and various nifty functionalities.☆17Updated 3 years ago
- Dataset repository for SDPROC SHared Task: Context24: Contextualizing Scientific Figures and Tables☆18Updated 3 months ago
- Open Access PDF harvester☆35Updated 4 months ago
- Legal document similarity - Code, data, and models for the ICAIL 2021 paper "Evaluating Document Representations for Content-based Legal …☆31Updated 3 years ago
- Open Access PDF harvester, metadata aggregator and full-text ingester☆54Updated 4 months ago
- ☆14Updated last year
- A collection of open source tools and resources related to Wikibase knowledge graphs☆64Updated last year
- Summarize. is a Streamlit application that performs automatic text summarization using both extractive and abstractive models.☆15Updated 2 years ago
- 🛠 Self-hosted, fast, and consistent remote configuration for apps.☆12Updated last year
- ☆31Updated 8 months ago
- ☆15Updated 3 years ago
- A Python module for retrieving script types of writing systems including alphabets, abjads, abugidas, syllabaries, logographs, featurals …☆11Updated 2 months ago
- Search for and retrieve US Patent and Trademark Office Patent Data☆75Updated 4 years ago
- A simple library for training named entity recognition model from partially annotated data☆21Updated 10 months ago
- Metadata Extractor & Loader (MEL) ■ The NLP-NER Toolkit (TNNT)☆22Updated last year
- spaCy entry points for Curated Transformers☆23Updated 2 weeks ago
- Graph-and-node based workflows☆11Updated this week
- d3 plugin to create a temporal network visualization☆18Updated last year
- Apache Spark based framework for analysis A/B experiments☆11Updated 2 months ago
- ☆15Updated 2 years ago
- Make MP3 albums out of Academic PDFs. Works by gluing together Grobid and TTS offerings.☆12Updated 9 months ago
- ☆19Updated last year