clulab / gentlenlpLinks
Software introduced in the Deep Learning for NLP: A Gentle Introduction book
☆25Updated 2 months ago
Alternatives and similar repositories for gentlenlp
Users that are interested in gentlenlp are comparing it to the libraries listed below
Sorting:
- ☆12Updated 4 years ago
- Analysis of gutenberg dataset☆44Updated 7 years ago
- Fastlaw's purpose is to replace generic word embeddings for work on supervised machine learning NLP-tasks with legal texts.☆40Updated 6 years ago
- TopicScan: Visualization and validation interface for NMF Topic Modeling☆23Updated 5 years ago
- Documentation effort for the BookCorpus dataset☆34Updated 4 years ago
- spaCy match and replace, maintaining conjugation☆36Updated 3 years ago
- Clean personally identifiable information from dirty dirty text using spaCy.☆41Updated 2 years ago
- MoodCat😼 classifies the mood of English sentences.☆14Updated 3 years ago
- ☆55Updated 2 years ago
- Supervised instruction finetuning for LLM with HF trainer and Deepspeed☆36Updated 2 years ago
- MinHash implementation in Python☆12Updated last year
- spaCy entry points for Curated Transformers☆32Updated 8 months ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆21Updated 3 years ago
- SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.☆59Updated last year
- Minimalist BERT implementation assignment for CS11-711☆83Updated 3 years ago
- 🤗 HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)☆17Updated last year
- A Streamlit application to visualize sentence embeddings☆18Updated 3 years ago
- Seahorse is a dataset for multilingual, multi-faceted summarization evaluation. It consists of 96K summaries with human ratings along 6 q…☆89Updated last year
- Dutch abusive language data☆11Updated 2 years ago
- Quick cheat sheet to time series models using NYC Taxi Data☆17Updated 6 years ago
- ☆22Updated 3 years ago
- An example of how to use spaCy for extremely large files without running into memory issues☆36Updated 3 years ago
- Named entity recognition for the legal domain☆43Updated 4 years ago
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated last year
- Odinson is a powerful and highly optimized open-source framework for rule-based information extraction. Odinson couples a simple, yet pow…☆72Updated last year
- Finds linguistic patterns effortlessly☆39Updated 2 years ago
- Next-generation Punkt sentence boundary detection with zero dependencies☆27Updated 2 months ago
- AfriSenti-SemEval Shared Task 12: Sentiment Analysis for African languages : https://afrisenti-semeval.github.io/☆49Updated 2 years ago
- ☆30Updated 3 years ago
- BERT Probe: A python package for probing attention based robustness to character and word based adversarial evaluation. Also, with recipe…☆18Updated 3 years ago