stefan-it/gc4lm

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/stefan-it/gc4lm)

stefan-it / gc4lm

GC4LM: A Colossal (Biased) language model for German

☆13

Alternatives and similar repositories for gc4lm

Users that are interested in gc4lm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

dbmdz / historic-ner
View on GitHub
Repository for "Towards Robust Named Entity Recognition for Historic German"
☆18Dec 11, 2020Updated 5 years ago
rsling / texrex
View on GitHub
texrex web page cleaning & ClaraX random walk crawler
☆11Dec 13, 2021Updated 4 years ago
stefan-it / europeana-bert
View on GitHub
BERT and ELECTRA models trained on Europeana Newspapers
☆39Dec 14, 2021Updated 4 years ago
NorskRegnesentral / NeuralTextSanitizer
View on GitHub
Neural models for detecting and masking personal information from texts
☆16Nov 25, 2022Updated 3 years ago
German-NLP-Group / german-transformer-training
View on GitHub
Plan and train German transformer models.
☆23Feb 22, 2021Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
NbAiLab / nostram
View on GitHub
Norwegian Speech Transformer Models
☆19Mar 26, 2026Updated 4 months ago
stefan-it / german-gpt2
View on GitHub
German GPT-2 model
☆32Aug 17, 2021Updated 4 years ago
CITlabRostock / citlab-article-separation-new
View on GitHub
Modules used for separating articles in (historical) newspapers and similar documents. This repository is part of the European Union's Ho…
☆22Sep 2, 2022Updated 3 years ago
ArneBinder / GlomImpl
View on GitHub
Implementation of the GLOM model for text
☆11Mar 4, 2021Updated 5 years ago
AI21Labs / pmi-masking
View on GitHub
This repository includes the masking vocabulary used in the ICLR 2021 spotlight PMI-Masking paper
☆14Aug 9, 2021Updated 4 years ago
impresso / CLEF-HIPE-2020
View on GitHub
Identifying Historical People, Places and other Entities: Shared Task on Named Entity Recognition and Linking on Historical Newspapers at…
☆21Aug 1, 2024Updated last year
ejmichaud / precision-ml
View on GitHub
☆13Feb 12, 2023Updated 3 years ago
yamac-kurtulus / Windows-Docker-Images
View on GitHub
Some Windows images for tool images that I had to use in a Windows Environment.
☆10Sep 27, 2020Updated 5 years ago
NationalLibraryOfNorway / DHLAB
View on GitHub
DHLAB is a library of python modules for accessing text and pictures at the National Library of Norway.
☆26Apr 21, 2026Updated 3 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
copenlu / cite-worth
View on GitHub
Data and code for the paper "CiteWorth: Cite-Worthiness Detection for Improved Scientific Document Understanding"
☆14Sep 8, 2022Updated 3 years ago
harvard-lil / WARC-diff-tools
View on GitHub
Comparing warc files
☆17Feb 21, 2019Updated 7 years ago
DFKI-NLP / REval
View on GitHub
[ACL 20] Probing Linguistic Features of Sentence-level Representations in Neural Relation Extraction
☆13Apr 21, 2020Updated 6 years ago
qurator-spk / neat
View on GitHub
Named entity annotation tool
☆28Jul 6, 2023Updated 3 years ago
seuretm / printed-vs-handwritten
View on GitHub
☆30Jul 17, 2019Updated 7 years ago
ncg-task / training-data
View on GitHub
Training data for the NLPContributionGraph Shared Task 11 at SemEval-2021
☆14Jan 11, 2021Updated 5 years ago
openredact / nerwhal
View on GitHub
This is a prototype of a multi-lingual suite for named-entity recognition in Python. ➡️ The project has moved to: https://gitlab.opencode…
☆21Mar 20, 2026Updated 4 months ago
Kungbib / kblab
View on GitHub
KB data lab
☆10Dec 8, 2020Updated 5 years ago
pdufter / staticlama
View on GitHub
☆13Apr 16, 2021Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ubbdst / elasticsearch-rdf-river
View on GitHub
RDF river plugin for harvesting metadata from Jena TDB, SPARQL endpoints or plain RDF files into Elasticsearch
☆10May 20, 2022Updated 4 years ago
nrkno / samnorsk
View on GitHub
Elastic support for Bokmål/Nynorsk
☆32Mar 30, 2017Updated 9 years ago
UB-Mannheim / GTCheck
View on GitHub
Check your modified Ground Truth files with visual support!
☆10Jan 31, 2024Updated 2 years ago
wbstack / deploy
View on GitHub
Cloud and Kubernetes configuration for deployment for wbstack.com. You'll want to look at the wikibase.cloud deploy repository soon!
☆12Feb 9, 2024Updated 2 years ago
pd3f / dehyphen
View on GitHub
📜 Dehyphenation of broken text (mainly German), i.e., extracted from a PDF
☆39Mar 8, 2022Updated 4 years ago
openredact / openredact-app
View on GitHub
This is a prototype of a semi-automatic data anonymization app for German documents. ➡️ The project has moved to: https://gitlab.opencode…
☆24Mar 20, 2026Updated 4 months ago
johnnovak / twyg
View on GitHub
Generative tree visualiser for Python
☆16Sep 15, 2020Updated 5 years ago
johnsamuelwrites / ShExStatements
View on GitHub
generate shape expressions from CSV
☆11Jun 19, 2026Updated last month
alexa / ramen
View on GitHub
A software for transferring pre-trained English models to foreign languages
☆20Mar 20, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
dbmdz / berts
View on GitHub
DBMDZ BERT, DistilBERT, ELECTRA, GPT-2 and ConvBERT models
☆158Dec 6, 2022Updated 3 years ago
stefan-it / fine-tuned-berts-seq
View on GitHub
Fine-tuned Transformers compatible BERT models for Sequence Tagging
☆40Jul 17, 2020Updated 6 years ago
altoxml / schema
View on GitHub
ALTO XML schema - latest and all former versions
☆55Jul 8, 2026Updated 2 weeks ago
julien-nc / integration_suitecrm
View on GitHub
Integration of SuiteCRM into Nextcloud
☆19Nov 12, 2021Updated 4 years ago
cisnlp / semi-markov-crf
View on GitHub
Code for paper "Neural Semi-Markov Conditional Random Fields for Robust Character-Based Part-of-Speech Tagging"
☆16May 31, 2019Updated 7 years ago
KorAP / Krill
View on GitHub
A Corpus Data Retrieval Index using Lucene for Look-Ups
☆20Updated this week
ICASSP2021-tutorial9 / Distant_conversational_ASR_and_analysis
View on GitHub
☆12Jun 10, 2021Updated 5 years ago