Shivanshu-Gupta / web-scrapersLinks
A repository of my web-scraping projects
☆33Updated 10 months ago
Alternatives and similar repositories for web-scrapers
Users that are interested in web-scrapers are comparing it to the libraries listed below
Sorting:
- Finds out symptoms similar to a given symptom, from a symptom-disease data set.☆51Updated 7 years ago
- Using PubMed to find out how a gene contributes to addiction.☆21Updated 2 years ago
- CodeSwitch is a NLP tool, can use for language identification, pos tagging, name entity recognition, sentiment analysis of code mixed dat…☆35Updated 4 years ago
- A curated list of ML awesome frameworks & libraries for text data☆16Updated 2 years ago
- Sentence tokenizer for clinical/medical text.☆27Updated last year
- A PyPI package for easy text annotation in a Jupyter Notebook.☆28Updated 4 years ago
- Clean personally identifiable information from dirty dirty text using spaCy.☆41Updated 2 years ago
- MozoLM: A language model (LM) serving library☆45Updated last month
- Visualization Tool for Mapping Out Researchers using Natural Language Processing☆58Updated last year
- An NLP pipeline for COVID-19 surveillance used in the Department of Veterans Affairs Biosurveillance.☆16Updated 2 years ago
- [WIP] Behold, semantic-search, built over sentence-transformers to make it easy for search engineers to evaluate, optimise and deploy mod…☆15Updated 2 years ago
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP models…☆36Updated 3 years ago
- Implementation of Z-BERT-A: a zero-shot pipeline for unknown intent detection.☆41Updated 2 years ago
- A work in progress library that fuses the HL7 FHIR standard with scikit-learn☆21Updated 2 years ago
- Shoonya - Platform to Annotate and label data at scale.☆57Updated 11 months ago
- Text classification automl☆21Updated 4 years ago
- Aranizer: A Custom Tokenizer based on SentencePiece and BPE tailored for Arabic Language Modeling☆20Updated last year
- Replication materials for "Identifying the Development and Application of Artificial Intelligence in Scientific Text"☆12Updated 5 years ago
- ☆33Updated 6 years ago
- Analysis of the human symptoms–disease network☆21Updated 8 years ago
- AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languages☆75Updated 3 years ago
- A collection of textual datasets in Hausa language and the corresponding translation in English language.☆15Updated 4 years ago
- http://icd10data.com/ data scraping☆21Updated 7 years ago
- BERT Probe: A python package for probing attention based robustness to character and word based adversarial evaluation. Also, with recipe…☆18Updated 3 years ago
- Intelligence Task Ontology (ITO)☆74Updated 2 years ago
- GenieNLP: A versatile codebase for any NLP task☆89Updated last year
- Tooling to play around with multilingual machine translation for Indian Languages.☆22Updated 3 years ago
- Machine learning utilities for model conversion, serialization, loading etc☆27Updated 2 years ago
- Text processing library for sentiment analysis and related tasks☆27Updated 6 years ago
- A corpus of textual data corresponding to synthetic clinical encounters, including each encounters’ dialogue transcript and clinical note…☆36Updated last year