BramVanroy / fietje-2Links
An open, efficient LLM for Dutch
☆49Updated this week
Alternatives and similar repositories for fietje-2
Users that are interested in fietje-2 are comparing it to the libraries listed below
Sorting:
- The robust European language model benchmark.☆104Updated this week
- GEITje 7B: een groot open Nederlands taalmodel☆128Updated 4 months ago
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆59Updated 10 months ago
- A repository containing the code for translating popular LLM benchmarks to German.☆25Updated last year
- A Scandinavian Benchmark for sentence embeddings☆38Updated 2 weeks ago
- A python package to run inference with HuggingFace language and vision-language checkpoints wrapping many convenient features.☆27Updated 8 months ago
- Materials for "IT5: Large-scale Text-to-text Pretraining for Italian Language Understanding and Generation" 🇮🇹☆30Updated 11 months ago
- German Text Embedding Clustering Benchmark☆17Updated last year
- The website for Danish Foundation Models, a project for training foundational Danish language model.☆74Updated 2 weeks ago
- Framework for unified summarisation and evaluation of English documents using state-of-the-art models and measures.☆32Updated last year
- Lightweight self-hosted span annotation tool☆33Updated last week
- Generalist and Lightweight Model for Text Classification☆128Updated 2 weeks ago
- Repository for the EM German Model☆109Updated last year
- German Alpaca Dataset (Cleaned + Translated)☆24Updated 2 years ago
- Find informative examples to efficiently (human)-evaluate NLG models.☆11Updated last week
- Repository collecting resources and best practices to improve experimental rigour in deep learning research.☆27Updated 2 years ago
- A spaCy custom component that extracts and normalizes temporal expressions☆54Updated 2 years ago
- Multilingual Entity Linking model by BELA model☆12Updated last year
- Code to create the dataset from "A New Aligned Simple German Corpus☆10Updated last year
- REMERGE - Multi-Word Expression discovery algorithm☆14Updated 2 years ago
- SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.☆59Updated last year
- A framework for few-shot evaluation of autoregressive language models.☆13Updated last year
- Fact checking baseline combining dense retrieval and textual entailment☆29Updated 4 months ago
- Repository containing the code for training the CroissantLLM☆21Updated last year
- I.PHI dataset generation☆25Updated last year
- A survey of corpora for Germanic low-resource languages and dialects☆25Updated 6 months ago
- A High-level Library for Named Entity Recognition in Python.☆23Updated last year
- [EMNLP'23] Official Code for "FOCUS: Effective Embedding Initialization for Monolingual Specialization of Multilingual Models"☆31Updated 7 months ago
- BERT and ELECTRA models trained on Europeana Newspapers☆38Updated 3 years ago
- ☆27Updated 3 months ago