iterative / aita_dataset
AITA dataset based on r/AmItheAsshole/
β33Updated 4 years ago
Related projects β
Alternatives and complementary repositories for aita_dataset
- MoodCatπΌ classifies the mood of English sentences.β14Updated 2 years ago
- MinHash implementation in Pythonβ11Updated 2 months ago
- BERT Probe: A python package for probing attention based robustness to character and word based adversarial evaluation. Also, with recipeβ¦β18Updated 2 years ago
- Finds linguistic patterns effortlesslyβ33Updated last year
- Fastlaw's purpose is to replace generic word embeddings for work on supervised machine learning NLP-tasks with legal texts.β37Updated 5 years ago
- A conda-smithy repository for spacy.β14Updated last week
- [WIP] Behold, semantic-search, built over sentence-transformers to make it easy for search engineers to evaluate, optimise and deploy modβ¦β15Updated last year
- β30Updated 3 years ago
- Analysis of gutenberg datasetβ40Updated 5 years ago
- A comprehensive tool for linguistic analysis of communitiesβ48Updated 3 years ago
- The RadioTalk dataset of talk radio transcriptsβ56Updated 3 years ago
- spaCy match and replace, maintaining conjugationβ34Updated last year
- Markdown template for Dataseets for Datasetsβ60Updated 2 years ago
- A set of tools for leveraging pre-trained embeddings, active learning and model explainability for effecient document classificationβ29Updated 2 years ago
- An example of how to use spaCy for extremely large files without running into memory issuesβ36Updated 2 years ago
- The Algoneer Python library.β16Updated 3 years ago
- TopicScan: Visualization and validation interface for NMF Topic Modelingβ23Updated 4 years ago
- This is a document concerning Data Readiness in the context of machine learning and Natural Language Processing.β11Updated 3 years ago
- The News Landscape Toolkit (NELA)β15Updated 4 years ago
- This Python package can be used to systematically extract multiple data elements (e.g., title, keywords, text) from news sources around tβ¦β32Updated last year
- Preprocessing and analysis for training SNOMED-CT concept embeddings from CORD-19 corpusβ14Updated last year
- β24Updated 5 months ago
- Topic Inference with Zeroshot modelsβ61Updated last year
- A text similarity computation using minhashing and Jaccard distance on reuters datasetβ16Updated 6 years ago
- A clean and easy interface for performing nearest-neighbor lookupsβ50Updated 4 years ago
- 𧬠A VS Code extension for annotating data with Prodigyβ30Updated 2 years ago
- πΈ Train floret vectorsβ18Updated last year
- β13Updated 3 years ago