BramVanroy / fietje-2Links
An open, efficient LLM for Dutch
☆54Updated 2 months ago
Alternatives and similar repositories for fietje-2
Users that are interested in fietje-2 are comparing it to the libraries listed below
Sorting:
- The robust European language model benchmark.☆114Updated this week
- GEITje 7B: een groot open Nederlands taalmodel☆128Updated 6 months ago
- The website for Danish Foundation Models, a project for training foundational Danish language model.☆74Updated this week
- Norwegian Transformer Model☆117Updated 8 months ago
- Camoscio: An Italian instruction-tuned language model based on LLaMA☆127Updated last year
- ☆110Updated last year
- Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.☆82Updated 10 months ago
- A spaCy custom component that extracts and normalizes temporal expressions☆55Updated 2 years ago
- LTG-Bert☆33Updated last year
- Bagpipes spaCy is a collection of custom spaCy pipeline components designed to enhance text processing capabilities.☆18Updated 11 months ago
- This repo provides a python module to work with Open Dutch WordNet. It was created using python 3.4.☆67Updated 4 years ago
- A Python library aimed at dissecting and augmenting NER training data.☆58Updated 2 years ago
- German Language Understanding Evaluation Benchmark @NAACL24☆12Updated 2 weeks ago
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆59Updated last year
- SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.☆59Updated last year
- AfroLID, a powerful neural toolkit for African languages identification which covers 517 African languages.☆31Updated 5 months ago
- This repository contains a demonstrative implementation for pooling-based models, e.g., DeepPyramidion complementing our paper "Sparsifyi…☆14Updated 3 years ago
- Lightweight self-hosted span annotation tool☆34Updated 2 months ago
- A Scandinavian Benchmark for sentence embeddings☆40Updated 2 months ago
- A python package to run inference with HuggingFace language and vision-language checkpoints wrapping many convenient features.☆28Updated 10 months ago
- DaCy: The State of the Art Danish NLP pipeline using SpaCy☆97Updated 7 months ago
- REMERGE - Multi-Word Expression discovery algorithm☆14Updated 2 years ago
- BERTje is a Dutch pre-trained BERT model developed at the University of Groningen. (EMNLP Findings 2020) "What’s so special about BERT’s …☆139Updated 2 years ago
- NTREX -- News Test References for MT Evaluation☆85Updated last year
- Code for SaGe subword tokenizer (EACL 2023)☆25Updated 8 months ago
- German small and large versions of GPT2.☆20Updated 3 years ago
- A PyTorch Lightning Callback for pushing models to the Hugging Face Hub 🤗⚡️☆36Updated 3 years ago
- ☆22Updated last year
- A survey of corpora for Germanic low-resource languages and dialects☆25Updated 8 months ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆156Updated last year