Text Preprocessing Package includes cleaning, tokenization, dataset preparation ...etc
☆18Aug 16, 2020Updated 5 years ago
Alternatives and similar repositories for nlp_preprocessing
Users that are interested in nlp_preprocessing are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Dense Passage Retrieval using tensorflow-keras on TPU☆17Jun 27, 2021Updated 4 years ago
- ☆12Jul 21, 2025Updated 9 months ago
- A helper to compare and identify similar keywords using PHP.☆10May 28, 2023Updated 2 years ago
- Ability to quickly group tabs by their attributes☆15Jan 17, 2026Updated 3 months ago
- Refer to paper "Embedding-based News Recommendation for Millions of Users" & "Article De-duplication Using Distributed Representations" p…☆31Mar 24, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- user-editable, acronym-only dictionary☆24Jan 7, 2017Updated 9 years ago
- Chrome extension to sync the clipboard between computers☆27Jul 26, 2014Updated 11 years ago
- Grepify the GUI Regex Text Scanner for Code Reviewers☆23Apr 15, 2013Updated 13 years ago
- Outline to Story: Fine-grained Controllable Story Generation from Cascaded Events☆18Jun 16, 2022Updated 3 years ago
- Collection of useful tools to analyse Google Analytics☆10Dec 11, 2015Updated 10 years ago
- Use Python with the Twitter API and Alchemy API to create personas quickly.☆19Dec 1, 2015Updated 10 years ago
- BERT semantic search engine for searching literature research papers for coronavirus covid-19 in google colab☆31Apr 13, 2020Updated 6 years ago
- Markdown-based task manager☆15May 7, 2024Updated last year
- A Python module to extract personality insights, sentiment & keywords from reddit accounts. pip install reddit_persona☆24Jul 19, 2017Updated 8 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Fluent dataset operations, compatible with your favorite libraries☆11Sep 4, 2025Updated 8 months ago
- Code and resources for the paper "BERT-QE: Contextualized Query Expansion for Document Re-ranking".☆51Oct 10, 2021Updated 4 years ago
- Code for "A Hierarchical End-to-End Model for Jointly Improving Text Summarization and Sentiment Classification" (IJCAI 2018)☆23Jul 14, 2018Updated 7 years ago
- 🚁 Airtable assist☆16Jul 8, 2023Updated 2 years ago
- Didactic Web crawler for Web Search Engines (CS 6913) course at NYU☆10Dec 8, 2022Updated 3 years ago
- Tensorflow, Pytorch, Huggingface Transformer, Fastai, etc. tutorial Colab Notebooks.☆79Dec 20, 2022Updated 3 years ago
- Repository for the online book "Guide to Effect Sizes and Confidence Intervals"☆18Jan 16, 2024Updated 2 years ago
- During the presentation at Google Meet, multiple participant's screens are passed to Picture-in-Picture are automatically changed. Other…☆14Sep 3, 2020Updated 5 years ago
- Manage chrome extensions from the toolbar☆12Jun 24, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Free Live News JSON REST API for Global News & Blog Articles☆24Mar 13, 2024Updated 2 years ago
- This is a nice sample-web-stack.☆15Oct 24, 2025Updated 6 months ago
- classify crime into different categories using PySpark☆21May 20, 2019Updated 6 years ago
- ☆20Jan 16, 2020Updated 6 years ago
- This repository contains the complete source code of the MedTAG annotation tool. MedTAG is a biomedical annotation tool for tagging biome…☆12Jan 1, 2023Updated 3 years ago
- Small, minimalistic graphics for powerful ideas in a few words.☆14Jul 1, 2022Updated 3 years ago
- Chrome Shortcuts 👨🏻💻☆11Mar 7, 2025Updated last year
- Google Tag Manager Multisite Admin is an add-on to DuracellTomi's Google Tag Manager for WordPress plugin☆11Sep 11, 2020Updated 5 years ago
- URLs previewer browser extension for https://workflowy.com/☆14Jan 7, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 🖇️ GitHub Action to list repositories in a README☆17Jan 31, 2025Updated last year
- Keyphrase Extraction based on Scientific Text, Semeval 2017, Task 10☆109Sep 13, 2022Updated 3 years ago
- Difference-based Contrastive Learning for Korean Sentence Embeddings☆23Mar 11, 2026Updated last month
- User script which adds search and navigation features to WorkFlowy☆11Oct 1, 2021Updated 4 years ago
- A online map of The Legend of Zelda: Breath of the Wild☆18Apr 19, 2019Updated 7 years ago
- ☆13Sep 2, 2021Updated 4 years ago
- Semantic Web database☆19Sep 1, 2022Updated 3 years ago