neuml / paperetlLinks
π βοΈ ETL processes for medical and scientific papers
β397Updated 3 weeks ago
Alternatives and similar repositories for paperetl
Users that are interested in paperetl are comparing it to the libraries listed below
Sorting:
- Neural Searchβ333Updated last year
- Semantic search engine indexing 110 million academic publicationsβ90Updated last month
- Labelling platform for text using weak supervision.β264Updated 3 years ago
- π π€ AI for medical and scientific papersβ1,456Updated last month
- Software that makes labeling PDFs easy.β418Updated last year
- Gain clues from clustering!β318Updated last year
- Neural Searchβ363Updated 5 months ago
- π Datasets and models for instruction-tuningβ238Updated last year
- π Build autonomous agents, retrieval augmented generation (RAG) processes and language model powered chat applicationsβ295Updated 3 months ago
- β¨ Bootstrap annotation with zero- & few-shot learning via OpenAI GPT-3β323Updated 2 years ago
- Healthsea is a spaCy pipeline for analyzing user reviews of supplementary products for their effects on health.β92Updated 3 years ago
- Full text search that feels like a numpy arrayβ257Updated 4 months ago
- π Semantic search for headlines and story textβ360Updated last year
- Semantic search through a vectorized Wikipedia (SentenceBERT) with the Weaviate vector search engineβ242Updated 2 years ago
- OCR, Archive, Index and Search: Implementation agnostic OCR framework.β223Updated last year
- ποΈ Highlight text in documentsβ108Updated 4 months ago
- SpanMarker for Named Entity Recognitionβ447Updated 7 months ago
- Information extraction from English and German texts based on predicate logicβ138Updated 2 years ago
- This is a public repository to enable researchers to begin their journey of self-hosting data from Semantic Scholar.β42Updated 9 months ago
- A proof of concept to scrape papers from journalsβ288Updated last year
- π PDF text extraction pipeline: self-hosted, local-first, Docker-basedβ326Updated last year
- Natural language Pandas queries and data generation powered by GPT-3β197Updated last year
- Parsers for scientific papers (PDF2JSON, TEX2JSON, JATS2JSON)β430Updated last year
- A spaCy wrapper for GliNERβ119Updated 6 months ago
- Coreference resolution for English, French, German and Polish, optimised for limited training data and easily extensible for further langβ¦β126Updated last year
- The Semantic Scholar Search Rerankerβ108Updated 4 years ago
- Get answers to research questions from 200M+ papers. Link to demo -β205Updated last year
- A pattern to let you try several vector databases and change a little code as possibleβ38Updated 2 years ago
- β191Updated this week
- Python PDF parser for scientific publications: content and figuresβ422Updated last year