danish-foundation-models / site
A project for training foundational Danish language model
☆72Updated this week
Alternatives and similar repositories for site:
Users that are interested in site are comparing it to the libraries listed below
- A Scandinavian Benchmark for sentence embeddings☆36Updated last month
- The robust European language model benchmark.☆93Updated this week
- DaCy: The State of the Art Danish NLP pipeline using SpaCy☆95Updated 3 months ago
- A Danish-speaking language model with entity-aware self-attention☆9Updated 3 years ago
- Prompt templating and versioning using jinja2 and litellm 🔥☆16Updated last year
- A collection of Danish Transformers☆30Updated 3 years ago
- A visual labeling system implemented in Jupyter widgets.☆150Updated 4 months ago
- Late Interaction Models Training & Retrieval☆264Updated last week
- Ælæctra was created as part of a Cognitive Science bachelor thesis, in the attempt to enhance the Danish NLP community with a more effici…☆28Updated 2 years ago
- Robust and fast topic models with sentence-transformers.☆48Updated last week
- just a bunch of useful embeddings for scikit-learn pipelines☆493Updated last week
- Fine-tuning of transformers for Sentiment Analysis☆19Updated 3 years ago
- SpanMarker for Named Entity Recognition☆424Updated 2 months ago
- spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to i…☆46Updated 11 months ago
- Neural Search☆352Updated 3 weeks ago
- Notebooks for training universal 0-shot classifiers on many different tasks☆120Updated 3 months ago
- Chance-corrected Agreement Coefficients☆21Updated 4 months ago
- Pre-trained Nordic models for BERT☆168Updated 3 years ago
- A python package for benchmarking interpretability techniques on Transformers.☆213Updated 6 months ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆153Updated 10 months ago
- 🤘Lemmy is a lemmatizer for Danish 🇩🇰 and Swedish 🇸🇪☆76Updated 3 years ago
- Gain clues from clustering!☆313Updated 8 months ago
- FastFit ⚡ When LLMs are Unfit Use FastFit ⚡ Fast and Effective Text Classification with Many Classes☆190Updated 5 months ago
- A python package to run inference with HuggingFace language and vision-language checkpoints wrapping many convenient features.☆27Updated 6 months ago
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆126Updated 3 months ago
- ☆22Updated last year
- Danish Data Science Community's guide to sustainable data science☆19Updated 2 years ago
- Embedding Vector Oriented Clustering☆133Updated last month
- 🗺️ Data Cleaning and Textual Data Visualization 🗺️☆166Updated 9 months ago
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆56Updated 8 months ago