BramVanroy / fietje-2Links
An open, efficient LLM for Dutch
☆57Updated 4 months ago
Alternatives and similar repositories for fietje-2
Users that are interested in fietje-2 are comparing it to the libraries listed below
Sorting:
- GEITje 7B: een groot open Nederlands taalmodel☆127Updated 8 months ago
- The website for Danish Foundation Models, a project for training foundational Danish language model.☆75Updated last month
- The robust European language model benchmark.☆129Updated this week
- Get ready to meet Fauno - the Italian language model crafted by the RSTLess Research Group from the Sapienza University of Rome.☆85Updated 2 years ago
- Repository for the EM German Model☆112Updated last year
- A Scandinavian Benchmark for sentence embeddings☆41Updated 4 months ago
- Camoscio: An Italian instruction-tuned language model based on LLaMA☆127Updated last year
- German Language Understanding Evaluation Benchmark @NAACL24☆17Updated last month
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆61Updated last year
- A repository containing the code for translating popular LLM benchmarks to German.☆30Updated 2 years ago
- Lightweight self-hosted span annotation tool☆34Updated 3 weeks ago
- A spaCy wrapper for GliNER☆122Updated 8 months ago
- Page de préconfiguration de la communauté OpenLLM-France☆48Updated last year
- SDLG is an efficient method to accurately estimate aleatoric semantic uncertainty in LLMs☆27Updated last year
- SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.☆59Updated last year
- Repository containing the code for training the CroissantLLM☆21Updated last year
- REMERGE - Multi-Word Expression discovery algorithm☆14Updated 3 years ago
- 🔢 Work with static vector models☆30Updated 5 months ago
- Norwegian Transformer Model☆116Updated 10 months ago
- Full named-entity (i.e., not tag/token) evaluation metrics based on SemEval’13☆193Updated last month
- negate_sentence(A Python module that doesn't negate sentences.)☆31Updated last year
- Plug-and-play, zero-shot document processing pipelines.☆107Updated this week
- TextComplexityDE dataset consists of 1000 sentences in the German language with subjective complexity rating, collected from German learn…☆13Updated 3 years ago
- BERTje is a Dutch pre-trained BERT model developed at the University of Groningen. (EMNLP Findings 2020) "What’s so special about BERT’s …☆139Updated 2 years ago
- benchmarks for LLM tokenizers☆14Updated last month
- Code to create the dataset from "A New Aligned Simple German Corpus☆11Updated last year
- AfroLID, a powerful neural toolkit for African languages identification which covers 517 African languages.☆32Updated 7 months ago
- Semantically Structured Sentence Embeddings☆67Updated last year
- A Python library aimed at dissecting and augmenting NER training data.☆58Updated 2 years ago
- Control the quality of your labeled data with the Python tools you already know.☆233Updated this week