Shivanshu-Gupta / web-scrapersLinks
A repository of my web-scraping projects
☆33Updated last year
Alternatives and similar repositories for web-scrapers
Users that are interested in web-scrapers are comparing it to the libraries listed below
Sorting:
- A large scale Humor Dataset, containing more than 550k rated English jokes (LREC'20)☆69Updated 2 years ago
- BERT Probe: A python package for probing attention based robustness to character and word based adversarial evaluation. Also, with recipe…☆18Updated 3 years ago
- A curated list of ML awesome frameworks & libraries for text data☆16Updated 2 years ago
- Sentence tokenizer for clinical/medical text.☆28Updated last year
- [WIP] Behold, semantic-search, built over sentence-transformers to make it easy for search engineers to evaluate, optimise and deploy mod…☆15Updated 2 years ago
- COVID-19 Open Research Dataset (CORD-19) Analysis☆57Updated 2 years ago
- MozoLM: A language model (LM) serving library☆45Updated last month
- CodeSwitch is a NLP tool, can use for language identification, pos tagging, name entity recognition, sentiment analysis of code mixed dat…☆36Updated 5 years ago
- Master's thesis project in collaboration with Rasa, focusing on knowledge distillation from BERT into different very small networks and a…☆13Updated 3 years ago
- Visualization Tool for Mapping Out Researchers using Natural Language Processing☆58Updated last year
- 🤖 An automated machine learning framework for audio, text, image, video, or .CSV files (50+ featurizers and 15+ model trainers). Python …☆146Updated 7 months ago
- ☆33Updated 6 years ago
- GenieNLP: A versatile codebase for any NLP task☆88Updated last year
- Using PubMed to find out how a gene contributes to addiction.☆20Updated 2 years ago
- Clean personally identifiable information from dirty dirty text using spaCy.☆41Updated 2 years ago
- Finds out symptoms similar to a given symptom, from a symptom-disease data set.☆51Updated 7 years ago
- Shoonya - Platform to Annotate and label data at scale.☆59Updated last week
- A curated list of awesome resources at the intersection of healthcare and AI☆69Updated 2 years ago
- Analysis of the human symptoms–disease network☆21Updated 8 years ago
- An NLP pipeline for COVID-19 surveillance used in the Department of Veterans Affairs Biosurveillance.☆16Updated 3 years ago
- simple rule based named entity recognition☆42Updated 3 years ago
- Machine learning utilities for model conversion, serialization, loading etc☆27Updated 2 years ago
- Tooling to play around with multilingual machine translation for Indian Languages.☆22Updated 3 years ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆21Updated 2 years ago
- ☆70Updated 2 years ago
- XAI based human-in-the-loop framework for automatic rule-learning.☆49Updated last year
- NLP and CV Data Engineering Framework☆46Updated 2 years ago
- Transforming textual descriptions into process models using deep learning☆15Updated 6 years ago
- ☆16Updated 5 years ago
- Intelligence Task Ontology (ITO)☆74Updated 3 years ago