Annotated data set consisting of user comments posted to a German-language newspaper website
☆17Jun 28, 2018Updated 7 years ago
Alternatives and similar repositories for million-post-corpus
Users that are interested in million-post-corpus are comparing it to the libraries listed below
Sorting:
- Plan and train German transformer models.☆23Feb 22, 2021Updated 5 years ago
- Python deep learning framework including [Convolutional] Restricted Boltzmann Machines (RBMs), [Convolutional] Neural Networks and Auto-E…☆14Jan 10, 2017Updated 9 years ago
- Hubness analysis and removal functions☆19Apr 11, 2023Updated 2 years ago
- This is a german ELMo deep contextualized word representation. It is trained on a special German Wikipedia Text Corpus.☆28Dec 15, 2019Updated 6 years ago
- Code and data for Teddy https://arxiv.org/abs/2001.05171.☆15Jun 21, 2022Updated 3 years ago
- NER tagger for English, Spanish, Dutch, Italian and German and French.☆35Nov 20, 2015Updated 10 years ago
- Python code to automatically produce a summary of a piece of text.☆12Sep 8, 2016Updated 9 years ago
- ☆10Jul 6, 2023Updated 2 years ago
- ☆11Jan 27, 2026Updated last month
- ATC-Anno is an annotation tool for Air Traffic Control data that offers automatic semantic and concept annotation.☆12Nov 17, 2023Updated 2 years ago
- Gretchen - An Open-Source Humanoid Robot Development Platform☆11Jul 8, 2019Updated 6 years ago
- Crisis Event Extraction Service (CREES)☆15Feb 4, 2019Updated 7 years ago
- Ten Thousand German News Articles Dataset for Topic Classification☆87Nov 7, 2022Updated 3 years ago
- Command-line corpus tools☆12May 15, 2017Updated 8 years ago
- ☆10Dec 16, 2022Updated 3 years ago
- Compute the most likely permutation of a lattice given an LM☆10Jan 3, 2013Updated 13 years ago
- Poetry Corpora Annotated on Aesthetic Emotions☆12Aug 2, 2022Updated 3 years ago
- JAX notebook showing how to LoRA + GPTQ arbitrary models☆10Aug 8, 2023Updated 2 years ago
- Sosia: Automatic author matching in Scopus on-line☆12Jun 21, 2025Updated 8 months ago
- Replaces occurrences of the word 'literally' with 'figuratively'. That's literally all it does.☆45Nov 7, 2014Updated 11 years ago
- A rolling version of the Latent Dirichlet Allocation.☆13Nov 27, 2023Updated 2 years ago
- Ukrainian ELECTRA model☆12Mar 11, 2023Updated 2 years ago
- brat rapid annotation tool (brat) - for all your textual annotation needs☆10Feb 3, 2018Updated 8 years ago
- HFST optimized-lookup standalone library and command line tool☆13Feb 27, 2018Updated 8 years ago
- Official code for AAAI'20 paper "Merging Weak and Active Supervision for Semantic Parsing"☆11Dec 8, 2022Updated 3 years ago
- Prototype implementation of an architecture suggested in Robot Dream paper (http://arxiv.org/abs/1603.03007)☆12Jul 3, 2019Updated 6 years ago
- ☆14Sep 11, 2025Updated 5 months ago
- Qt for Python workshop☆11Nov 23, 2021Updated 4 years ago
- ☆10Nov 10, 2021Updated 4 years ago
- An agent-based model for scientific inquiry based on abstract argumentation☆13Jan 17, 2022Updated 4 years ago
- OSX menu bar controlled Tor relay server☆17Oct 28, 2014Updated 11 years ago
- Multilingual acoustic word embedding approaches applied and evaluated on GlobalPhone data.☆11Nov 3, 2020Updated 5 years ago
- AI agents playing Clash Royale autonomously. Claude Code + multi-agent architecture reached 1000+ trophies live on Twitch.☆16Jan 25, 2026Updated last month
- FEERCI: A Package for Fast non-parametric confidence intervals for Equal Error Rates☆12Mar 13, 2024Updated last year
- A super simple html + css + js ollama chat interface for hacking on.☆16May 1, 2025Updated 10 months ago
- Yelp Restaurant Photo Classification - Kaggle competition☆12Apr 19, 2019Updated 6 years ago
- Extract plain text from Arabic Wikipedia dumps.☆13Jun 15, 2014Updated 11 years ago
- Open-source AI for voice control, rivaling Alexa and Siri☆13Mar 9, 2024Updated last year
- TTS for Singlish using Tacotron2, the IMDA corpus, and Pachyderm.☆11Jan 11, 2020Updated 6 years ago