mirandrom / PyS2
A python library for the Semantic Scholar (S2) API with typed pydantic objects and various nifty functionalities.
☆17Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for PyS2
- Finds linguistic patterns effortlessly☆33Updated last year
- ☆23Updated last year
- Transform a corpus of text documents (any kind) into a map with different zoom levels and topics names to summarise sub corpus of similar…☆28Updated 10 months ago
- ☆16Updated last year
- 🤗 Push your spaCy pipelines to the Hugging Face Hub☆43Updated 5 months ago
- Starbucks: Improved Training for 2D Matryoshka Embeddings☆17Updated last month
- The CleanCoNLL dataset from our EMNLP 2023 paper where we corrected annotation errors and inconsistencies in CoNLL-03.☆19Updated 4 months ago
- Weakly Supervised Text-to-SQL Parsing through Question Decomposition☆22Updated 11 months ago
- Metadata Extractor & Loader (MEL) ■ The NLP-NER Toolkit (TNNT)☆22Updated last year
- Code for SaGe subword tokenizer (EACL 2023)☆22Updated this week
- ☆53Updated 10 months ago
- Neighborhood Contrastive Learning for Scientific Document Representations with Citation Embeddings (EMNLP 2022 paper)☆64Updated 2 years ago
- SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batchi…☆31Updated 5 months ago
- Blue Brain text mining toolbox for semantic search and structured information extraction☆42Updated last year
- Open Access PDF harvester, metadata aggregator and full-text ingester☆55Updated 6 months ago
- Pytorch implementation of a BiLSTM model for the Wikification project.☆17Updated 4 years ago
- Source code and data for Like a Good Nearest Neighbor☆28Updated 9 months ago
- ☆32Updated 10 months ago
- ☆20Updated last year
- Tool for disambiguating acronyms and abbreviations in text for NLP applications☆20Updated 5 months ago
- End-to-end zero-shot entity and relation extraction☆58Updated 3 months ago
- Implementation of the Paper "Towards an Automated Argument Mining Pipeline to Transform Plain Text to Argument Graphs"☆22Updated 9 months ago
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆13Updated 8 months ago
- Download, parse, and filter data from Phil Papers. Data-ready for The-Pile.☆12Updated last year
- A proposed standard `NOCK` for a Parquet format that supports efficient distributed serialization of multiple kinds of graph technologies☆17Updated 2 years ago
- MultiCite code and data. Models are available on Huggingface.☆29Updated 2 years ago
- Poor man's simple harvester for arXiv resources☆11Updated last year
- Index of open source code for OpenAlex---an open, comprehensive catalog of scholarship, connecting papers, authors, institutions, and jou…☆19Updated 10 months ago
- A library of tools for dictionary-based Named Entity Recognition (NER), based on word vector representations to expand dictionary terms.☆24Updated last year
- ☆23Updated last year