DistrictDataLabs / baleenLinks
An automated ingestion service for blogs to construct a corpus for NLP research.
β86Updated 7 years ago
Alternatives and similar repositories for baleen
Users that are interested in baleen are comparing it to the libraries listed below
Sorting:
- π« Scripts, tools and resources for developing spaCyβ126Updated 6 years ago
- π₯ Browser-based slides or PDFs of our talks and presentationsβ94Updated 6 years ago
- Language detection extension for spaCy 2.0+β113Updated 6 years ago
- Relatively simple text classification powered by spaCyβ41Updated 9 years ago
- A visualisation tool for Spacy using Hierplane.β65Updated 2 years ago
- A Topic Modeling toolboxβ92Updated 9 years ago
- π€ΉββοΈ Query spaCy's linguistic annotations using GraphQLβ86Updated 7 years ago
- π Emoji handling and meta data for spaCy with custom extension attributesβ182Updated 2 years ago
- Excel Integration with spaCy. Training NER using Excel/XLSX from PDF, DOCX, PPT, PNG or JPG.β104Updated 2 years ago
- π« Jupyter notebooks for spaCy examples and tutorialsβ288Updated 6 years ago
- displaCy-ent.js: An open-source named entity visualiser for the modern webβ198Updated 7 years ago
- Supervised learning for novelty detection in textβ78Updated 8 years ago
- π« REST microservices for various spaCy-related tasksβ241Updated 3 years ago
- spaCy pipeline component for adding text readability meta data to Doc objects.β56Updated 6 years ago
- Multidimensional data explorer and visualization tool.β56Updated 8 years ago
- Server/Client around Spacy to load spacy only onceβ46Updated 7 years ago
- Memory-based shallow parser for Pythonβ74Updated 6 years ago
- Tools, wrappers, etc... for data science with a concentration on text processingβ207Updated 2 years ago
- A repository for the "Combining DBpedia and Topic Modeling" GSoC 2016 ideaβ13Updated 9 years ago
- Textpipe: clean and extract metadata from textβ302Updated 4 years ago
- High-coverage and high-precision lexica of terms annotated with emotion scores for English and Italian.β154Updated 10 months ago
- A Domain Specific Language (DSL) for building language patterns. These can be later compiled into spaCy patterns, pure regex, or any otheβ¦β68Updated 2 years ago
- Search 'from' and 'to' strings to learn a text cleaning mappingβ17Updated 10 years ago
- Refinery - A locally deployable open-source web platform for analysis of large document collectionsβ101Updated 9 years ago
- Python port for IWNLP.Lemmatizerβ17Updated last year
- Similarity search on Wikipedia using gensim in Python.β60Updated 6 years ago
- β70Updated 2 years ago
- Experiment, Storage and Visualization Framework for Machine Learning research.β31Updated 4 years ago
- A collection of simple tutorials for using Fonduerβ100Updated 4 years ago
- Record Linkage ToolKit (Find and link entities)β110Updated 2 years ago