Shivanshu-Gupta / web-scrapersLinks
A repository of my web-scraping projects
☆34Updated last year
Alternatives and similar repositories for web-scrapers
Users that are interested in web-scrapers are comparing it to the libraries listed below
Sorting:
- BERT Probe: A python package for probing attention based robustness to character and word based adversarial evaluation. Also, with recipe…☆18Updated 3 years ago
- MozoLM: A language model (LM) serving library☆47Updated last month
- Sentence tokenizer for clinical/medical text.☆28Updated last year
- Visualization Tool for Mapping Out Researchers using Natural Language Processing☆59Updated last year
- [WIP] Behold, semantic-search, built over sentence-transformers to make it easy for search engineers to evaluate, optimise and deploy mod…☆15Updated 2 years ago
- Scripts to parse arxiv documents for NLP tasks☆19Updated 2 years ago
- A curated list of ML awesome frameworks & libraries for text data☆16Updated 2 years ago
- This project develops compact transformer models tailored for clinical text analysis, balancing efficiency and performance for healthcare…☆18Updated last year
- ☆25Updated 3 years ago
- The repository for the paper "When Do You Need Billions of Words of Pretraining Data?"☆21Updated 5 years ago
- A large scale Humor Dataset, containing more than 550k rated English jokes (LREC'20)☆74Updated 2 years ago
- Utility for cui2vec in Go☆13Updated 2 years ago
- Clean personally identifiable information from dirty dirty text using spaCy.☆41Updated 2 years ago
- code for "Automated and Intelligent Synthesis of Oxygen-Producing Catalysts from Martian Meteorites by Robotic AI-Chemist "☆12Updated 2 years ago
- A work in progress library that fuses the HL7 FHIR standard with scikit-learn☆21Updated 2 years ago
- An asynchronous concurrent pipeline for classifying Common Crawl based on fastText's pipeline.☆86Updated 4 years ago
- Transforming textual descriptions into process models using deep learning☆15Updated 6 years ago
- Scripts to convert datasets from various sources to Hugging Face Datasets.☆57Updated 3 years ago
- Fastai implementation of @karpathy's miniGPT library☆15Updated 5 years ago
- Supervised instruction finetuning for LLM with HF trainer and Deepspeed☆36Updated 2 years ago
- A simple visualization of Martin Zinkevich article☆30Updated 2 months ago
- GenieNLP: A versatile codebase for any NLP task☆89Updated last year
- Experiments with generating GPT-2 fanfiction on specified topics.☆11Updated 6 years ago
- Robust de-identification of medical notes using transformer architectures☆57Updated 3 years ago
- Implementation of Z-BERT-A: a zero-shot pipeline for unknown intent detection.☆44Updated 2 years ago
- Dataset for training machine learning model for automatically generating psychiatric case notes from doctor-patient conversations.☆68Updated 3 years ago
- pharmpy is an umbrella library for searching the FDA NDC directory, Established Pharmacologic Class (EPC), Anatomical Therapeutic Chemica…☆17Updated 5 years ago
- A utility for labeling clusters of text data.☆28Updated 4 years ago
- Automatically check mismatch between code and comments using AI and ML☆54Updated 4 years ago
- http://icd10data.com/ data scraping☆23Updated 7 years ago