dohliam / more-stoplistsLinks
stoplists for African languages generated from the ASP corpus
☆14Updated 9 years ago
Alternatives and similar repositories for more-stoplists
Users that are interested in more-stoplists are comparing it to the libraries listed below
Sorting:
- a python package for cleaning Gutenberg books and dataset☆34Updated 8 months ago
- Use spaCy for NLP and output to the FoLiA XML format.☆12Updated last year
- Wiktionary parser tool for many language editions.☆54Updated 3 years ago
- FoLiA Linguistic Annotation Tool -- Flat is a web-based linguistic annotation environment based around the FoLiA format (http://proycon.g…☆113Updated 11 months ago
- Web hub based on Wikidata☆38Updated last month
- Python package for stylometry☆64Updated 4 years ago
- SerendipSlim is a visualization tool for exploring topic models built on large collections of text documents.☆39Updated 7 years ago
- command-line tool to extract taxonomies from Wikidata☆129Updated 6 years ago
- A modular annotation system that supports complex, interactive annotation graphs embedded on top of sequences of text.☆98Updated 4 years ago
- The RadioTalk dataset of talk radio transcripts☆61Updated 4 years ago
- A list of resources for conservation, development, and documentation of endangered, minority, and low or under-resourced human languages.☆35Updated 2 years ago
- An online annotation platform for teaching and learning in the humanities.☆108Updated last month
- Experiments to help discussion on Wikipedia talk pages☆68Updated this week
- A cloud-based, open-source system for writing and publishing dictionaries.☆98Updated 2 years ago
- Topic Words in Context (TWiC) is a highly-interactive, browser-based visualization for MALLET topic models☆51Updated 8 years ago
- Lexicons for the Multilingual UCREL Semantic Analysis System☆47Updated 3 weeks ago
- A temporal ordering system for events and time expressions in written text.☆42Updated 3 years ago
- Wikidata embedding☆51Updated last year
- WordWanderer – take your text for a walk☆12Updated 6 years ago
- An implementation of latent Dirichlet allocation in javascript☆185Updated 3 years ago
- Command-line tool to extract a ranked list of relevant keywords from a corpus with the option of using either topic modeling or tf-idf sc…☆41Updated 8 years ago
- Miscellaneous scripts to gather and process data of wikis.☆20Updated 2 years ago
- Source stories from the African Storybook Project in Markdown format☆22Updated last month
- Tools for tracking stories on news homepages☆48Updated 6 years ago
- A glossary for the United States.☆42Updated 10 years ago
- List of emoji rated for valence☆123Updated 3 years ago
- Fetch and parse the American Presidency Project's press-briefing and presidential-news-conference transcripts.☆11Updated 9 years ago
- Manifests of the public domain images uploaded to Flickr Commons, with descriptive information about the books they were taken from.☆75Updated 11 years ago
- Metaphor detection using NLP techniques, made in Python using NLTK☆18Updated 12 years ago
- linguistics backend☆42Updated 2 years ago