yehudagale / fuzzyJoiner
β13Updated 2 years ago
Alternatives and similar repositories for fuzzyJoiner:
Users that are interested in fuzzyJoiner are comparing it to the libraries listed below
- Abydos NLP/IR library for Pythonβ184Updated 2 years ago
- Hunspell extension for spaCy 2.0.β94Updated 6 months ago
- 𧬠A JupyterLab extension for annotating data with Prodigyβ189Updated last year
- Language detection using Spacy and Fasttextβ55Updated last year
- Automatically labeling training dataβ105Updated 6 years ago
- German lemmatization with IWNLP as extension for spaCyβ24Updated last year
- Polyglot is a language identifier for detecting text documents containing text written in more than one language, and for identifying theβ¦β33Updated 8 years ago
- Prodigy thing(z)β13Updated 6 years ago
- Healthsea is a spaCy pipeline for analyzing user reviews of supplementary products for their effects on health.β91Updated 3 years ago
- π― kettle is a CLI tool for creating and deploying cloud functions & docker containers for machine learningβ32Updated 2 years ago
- Excel Integration with spaCy. Training NER using Excel/XLSX from PDF, DOCX, PPT, PNG or JPG.β105Updated 2 years ago
- The weights for the embedding layer of Scandinavian UMLFiT language modelsβ32Updated 5 years ago
- Conversational text Analysis using various NLP techniquesβ181Updated last year
- Accelerated NLP pipelines for fast inference on CPU. Built with Transformers and ONNX runtime.β126Updated 4 years ago
- In the wild extraction of entities that are found using Flair and displayed using a very elegant front-end.β71Updated 2 years ago
- β70Updated 2 years ago
- NER tagger for English, Spanish, Dutch, Italian and German and French.β35Updated 9 years ago
- Use ML-Annotate to label data for machine learning purposesβ107Updated 4 years ago
- Example using Polyaxon to experiment with pre-training spaCyβ65Updated 3 years ago
- Sentence transformers models for SpaCyβ107Updated last year
- Anonymization of legal cases (Fr) based on Flair embeddingsβ88Updated 4 years ago
- Language detection extension for spaCy 2.0+β112Updated 6 years ago
- AI apps/benchmark for legaltechβ111Updated 3 years ago
- Information extraction from English and German texts based on predicate logicβ135Updated last year
- Language independent truecaser in Python.β160Updated 3 years ago
- Code and data for segmentation experiments.β22Updated 9 years ago
- A way to do annotations for NER. TALEN: Tool for Annotation of Low-resource ENtitiesβ113Updated 2 years ago
- ULMFiT + Siamese Network for Sentence Vectorsβ34Updated 6 years ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality β¦β106Updated 11 months ago
- The Data Linter identifies potential issues (lints) in your ML training data.β87Updated 7 years ago