Rijgersberg / GEITje
GEITje 7B: een groot open Nederlands taalmodel
☆116Updated last week
Related projects ⓘ
Alternatives and complementary repositories for GEITje
- An open, efficient LLM for Dutch☆35Updated 2 months ago
- Repository for the EM German Model☆104Updated 11 months ago
- A Scandinavian Benchmark for sentence embeddings☆27Updated last month
- A project for training foundational Danish language model☆68Updated this week
- ☆68Updated 8 months ago
- Evaluation of language models on mono- or multilingual tasks.☆73Updated this week
- Norwegian Transformer Model☆114Updated 7 months ago
- An easy way to chunk spaCy docs.☆15Updated 2 months ago
- 🗺️ Data Cleaning and Textual Data Visualization 🗺️☆145Updated 4 months ago
- Small python package to measure OCR quality and other related metrics.☆20Updated 8 months ago
- Using embeddings compressed by Product Quantization, in Javascript☆30Updated last year
- A spaCy wrapper for GliNER☆87Updated 3 months ago
- Tools for interactive visual exploration of semantic embeddings.☆28Updated 2 months ago
- An integration of Qdrant ANN vector database backend with txtai☆23Updated 2 months ago
- End-to-end zero-shot entity and relation extraction☆56Updated 3 months ago
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆99Updated 9 months ago
- Open source, local, and self-hosted highly optimized language inference server supporting ASR/STT, TTS, and LLM across WebRTC, REST, and …☆384Updated 5 months ago
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆52Updated 3 months ago
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆62Updated 7 months ago
- A BERT-based application for reusable text classification at scale☆37Updated last year
- ☆47Updated last year
- 🚀 Template Haystack Search Application with Streamlit☆23Updated last year
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆72Updated last year
- A multi-lingual approach to AllenNLP CoReference Resolution along with a wrapper for spaCy.☆103Updated 6 months ago
- Repository for "Towards Robust Named Entity Recognition for Historic German"☆18Updated 3 years ago
- Search through Facebook Research's PyTorch BigGraph Wikidata-dataset with the Weaviate vector search engine☆31Updated 2 years ago
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata☆152Updated 2 years ago
- A Dutch RoBERTa-based language model☆197Updated 7 months ago
- SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.☆57Updated 6 months ago
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆101Updated 5 months ago