genomoncology / entitykb
β22Updated this week
Related projects: β
- 𧬠A VS Code extension for annotating data with Prodigyβ30Updated 2 years ago
- Finds linguistic patterns effortlesslyβ31Updated last year
- β29Updated 2 years ago
- A Domain Specific Language (DSL) for building language patterns. These can be later compiled into spaCy patterns, pure regex, or any otheβ¦β65Updated last year
- Python based Wikidata framework for easy dataframe extractionβ39Updated 9 months ago
- Python package that offers text scrubbing functionality, providing building blocks for string cleaning as well as normalizing geographicaβ¦β22Updated 3 weeks ago
- A library to extract a publication date from a web page, along with a measure of the accuracy.β42Updated 5 years ago
- A proposed standard `NOCK` for a Parquet format that supports efficient distributed serialization of multiple kinds of graph technologiesβ15Updated last year
- β70Updated last year
- πΎ PdpCLI is a pandas DataFrame processing CLI tool which enables you to build a pandas pipeline from a configuration file.β15Updated 11 months ago
- Generate Pandas frames, load and extract data, based on JSON Table Schema descriptors.β52Updated 3 years ago
- Set-oriented Operations in Pandasβ24Updated 4 years ago
- Metadata Extractor & Loader (MEL) β The NLP-NER Toolkit (TNNT)β22Updated last year
- Language detection using Spacy and Fasttextβ53Updated 9 months ago
- spaCy match and replace, maintaining conjugationβ34Updated last year
- A Datasette plugin providing an MLOps platform to train, eval and predict machine learning modelsβ15Updated last week
- Versatile Metrics Collection for Pythonβ18Updated 9 months ago
- A utility for labeling clusters of text data.β28Updated 3 years ago
- International Address formatter which considers the standard formatting rules of the countryβ25Updated 3 years ago
- The NLP Bias Identification Toolkitβ35Updated last year
- πΈ Train floret vectorsβ18Updated last year
- An index data structure for approximate string search.β23Updated 5 years ago
- Scalable String Similarity Joins in Pythonβ39Updated 2 months ago
- Python Data Collection Libraryβ46Updated 3 years ago
- asyncbolt Bolt client/server protocol for Python asyncioβ13Updated 6 years ago
- python package for performing deduplication using flexible text matching and cleaning in pandas dataframeβ25Updated 3 years ago
- Python package for deduplication/entity resolution using active learningβ77Updated 3 weeks ago
- Lossless in-memory compression of pandas DataFrames and Series powered by the visions type system. Up to 10x less RAM needed for the sameβ¦β28Updated last year
- A simple web application for searching Word2Vec embeddings derived from approximately 2,000 law reports published by the The Incorporatedβ¦β25Updated last year
- Generate reports for spaCy models.β28Updated 2 years ago