Rijgersberg / GEITje
GEITje 7B: een groot open Nederlands taalmodel
☆124Updated 3 weeks ago
Alternatives and similar repositories for GEITje:
Users that are interested in GEITje are comparing it to the libraries listed below
- An open, efficient LLM for Dutch☆44Updated last month
- A Scandinavian Benchmark for sentence embeddings☆33Updated last week
- Evaluation of language models on mono- or multilingual tasks.☆81Updated this week
- Norwegian Transformer Model☆115Updated 2 months ago
- A project for training foundational Danish language model☆71Updated last week
- DaCy: The State of the Art Danish NLP pipeline using SpaCy☆95Updated last month
- A repository of instructions in French to fine-tune LLMs☆17Updated last year
- An EUR-Lex parser for Python.☆29Updated 7 months ago
- Various training, inference and validation code and results related to Open LLM's that were pretrained (full or partially) on the Dutch l…☆28Updated 10 months ago
- A BERT-based application for reusable text classification at scale☆37Updated last year
- The official repository for Toxic Commons and Celadon. Toxicity Classification for public domain data.☆13Updated 3 months ago
- spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to i…☆46Updated 10 months ago
- ☆67Updated 11 months ago
- ☆45Updated 2 weeks ago
- Sentiment Corpus for Swedish 🇸🇪 Norwegian 🇳🇴 Danish 🇩🇰 Finnish 🇫🇮 (and English 🏴)☆15Updated 3 years ago
- Norwegian Speech Transformer Models☆18Updated 3 months ago
- Using embeddings compressed by Product Quantization, in Javascript☆31Updated last year
- Alpino parser and related tools for Dutch☆23Updated this week
- This repo provides a python module to work with Open Dutch WordNet. It was created using python 3.4.☆65Updated 3 years ago
- Repository for the EM German Model☆106Updated last year
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆55Updated 6 months ago
- LLM plugin providing access to Mistral models using the Mistral API☆167Updated 3 weeks ago
- Robust and fast topic models with sentence-transformers.☆42Updated this week
- 🗺️ Data Cleaning and Textual Data Visualization 🗺️☆163Updated 8 months ago
- Information extraction from English and German texts based on predicate logic☆135Updated last year
- An easy way to chunk spaCy docs.☆19Updated 6 months ago
- A spaCy wrapper for GliNER☆108Updated 3 weeks ago