Rijgersberg / GEITjeLinks
GEITje 7B: een groot open Nederlands taalmodel
☆128Updated 7 months ago
Alternatives and similar repositories for GEITje
Users that are interested in GEITje are comparing it to the libraries listed below
Sorting:
- Norwegian Transformer Model☆116Updated 9 months ago
- The robust European language model benchmark.☆120Updated this week
- A Scandinavian Benchmark for sentence embeddings☆40Updated 3 months ago
- An open, efficient LLM for Dutch☆54Updated 2 months ago
- Use machine learning to make your institutional communication more understandable and inclusive.☆47Updated last month
- Repository for the EM German Model☆112Updated last year
- The website for Danish Foundation Models, a project for training foundational Danish language model.☆74Updated last week
- A spaCy wrapper for GliNER☆119Updated 7 months ago
- Various training, inference and validation code and results related to Open LLM's that were pretrained (full or partially) on the Dutch l…☆32Updated last year
- A repository of instructions in French to fine-tune LLMs☆17Updated 2 years ago
- Generalist and Lightweight Model for Relation Extraction (Extract any relationship types from text)☆236Updated 2 months ago
- 🗺️ Data Cleaning and Textual Data Visualization 🗺️☆185Updated 3 months ago
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata☆164Updated 2 years ago
- Sentiment Corpus for Swedish 🇸🇪 Norwegian 🇳🇴 Danish 🇩🇰 Finnish 🇫🇮 (and English 🏴)☆15Updated 4 years ago
- Large-scale language models for Norwegian☆39Updated last month
- A BERT-based application for reusable text classification at scale☆38Updated 2 years ago
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆80Updated 2 years ago
- Neural Search☆363Updated 5 months ago
- German Dataset for Legal Information Retrieval☆20Updated last year
- spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to i…☆46Updated last year
- The official repository for Toxic Commons and Celadon. Toxicity Classification for public domain data.☆18Updated 9 months ago
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidata☆94Updated 2 years ago
- A simple tool for splitting up an ebook into its chapters. Works well with Project Gutenberg texts. May also be used to clean up books fo…☆110Updated 6 years ago
- SpanMarker for Named Entity Recognition☆451Updated 7 months ago
- Norwegian Speech Transformer Models☆18Updated 9 months ago
- ☆67Updated last year
- German Text Embedding Clustering Benchmark☆18Updated last year
- Mining Legal Arguments in Court Decisions - Data and software☆69Updated 2 years ago
- 📜 Dehyphenation of broken text (mainly German), i.e., extracted from a PDF☆39Updated 3 years ago
- ☆168Updated last year