Ten Thousand German News Articles Dataset for Topic Classification
☆87Nov 7, 2022Updated 3 years ago
Alternatives and similar repositories for 10kGNAD
Users that are interested in 10kGNAD are comparing it to the libraries listed below
Sorting:
- ULMFiT Method for German Language☆15May 10, 2019Updated 6 years ago
- Language Model and Text Classification for German Language using Deep Learning☆18Jun 15, 2018Updated 7 years ago
- Plan and train German transformer models.☆23Feb 22, 2021Updated 5 years ago
- Annotated data set consisting of user comments posted to a German-language newspaper website☆17Jun 28, 2018Updated 7 years ago
- Toolkit to obtain and preprocess German text corpora, train models and evaluate them with generated testsets. Built with Gensim and Tenso…☆242Aug 21, 2024Updated last year
- ☆10Jul 15, 2024Updated last year
- A tokenizer and sentence splitter for German and English web and social media texts.☆153Dec 9, 2024Updated last year
- DBMDZ BERT, DistilBERT, ELECTRA, GPT-2 and ConvBERT models☆159Dec 6, 2022Updated 3 years ago
- ☆15Nov 11, 2023Updated 2 years ago
- GC4LM: A Colossal (Biased) language model for German☆13May 2, 2021Updated 4 years ago
- Implementation of https://arxiv.org/abs/1904.00962☆15Aug 30, 2019Updated 6 years ago
- Example project for running LensKit experiments☆13Apr 24, 2025Updated 10 months ago
- To be a next-generation DL-based phenotype prediction from genome mutations.☆18May 17, 2021Updated 4 years ago
- Combine two wikipedia pages to make new facts. Tweets @brand_new_facts☆18Sep 18, 2018Updated 7 years ago
- Repository for "Towards Robust Named Entity Recognition for Historic German"☆18Dec 11, 2020Updated 5 years ago
- Scripts for "Deploy ML to production" workshop☆23Apr 25, 2018Updated 7 years ago
- The first, open access evaluation dataset for methods to identify bias by word choice and labeling☆26Oct 30, 2025Updated 4 months ago
- This is a prototype of a multi-lingual suite for named-entity recognition in Python.☆21Apr 25, 2024Updated last year
- ☆11Aug 2, 2024Updated last year
- Exploring Few-Shot Adaptation of Language Models with Tables☆24Aug 22, 2022Updated 3 years ago
- An entity linking prototype, developed using the datasets from the TAC-KBP sub-task.☆28Apr 5, 2017Updated 8 years ago
- A part-of-speech tagger with support for domain adaptation and external resources.☆24Oct 26, 2022Updated 3 years ago
- German lemmatization with IWNLP as extension for spaCy☆26Jul 28, 2023Updated 2 years ago
- German Parliamentary Corpus (GerParCor)☆30Jan 14, 2026Updated last month
- ☆31Nov 14, 2024Updated last year
- Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.☆1,752Dec 20, 2023Updated 2 years ago
- Code and data for Teddy https://arxiv.org/abs/2001.05171.☆15Jun 21, 2022Updated 3 years ago
- ☆35Dec 26, 2022Updated 3 years ago
- framework for doing NER and other types of entity recognition, in Python☆68Jun 21, 2022Updated 3 years ago
- General-Purpose Neural Networks for Sentence Boundary Detection☆73Mar 27, 2023Updated 2 years ago
- OffeneRegister.de – Offene Daten für das Handelsregister☆34Feb 2, 2026Updated last month
- A python script to add Songs from Spotify Playlists to YouTube Playlists.☆10Mar 1, 2023Updated 3 years ago
- Juery is a tiny Java library to manage search and filter query from user to database.☆12Jan 27, 2026Updated last month
- A simple UI client for Aerospike DB☆11Aug 3, 2023Updated 2 years ago
- An FFmpeg Wrapper with focus on Complex Filter☆11Jul 7, 2023Updated 2 years ago
- Deep Learning Part 2, 2019 edition - transcriptions, screenshots and notebooks☆11Jul 19, 2019Updated 6 years ago
- This tool emulates the modbus registers of a SDM630 (Single/Three Phase Power Meter) from Eastron (pymodbus, Raspberry). Useful for Growa…☆11May 17, 2025Updated 9 months ago
- This is check50, a command-line program with which you can check the correctness of your programs.☆12Nov 13, 2021Updated 4 years ago
- Smart Meter Data Collector☆12Jul 17, 2024Updated last year