centre-for-humanities-computing / danish-foundation-models
A project for training foundational Danish language model
☆68Updated 3 weeks ago
Related projects: ⓘ
- A Scandinavian Benchmark for sentence embeddings☆27Updated 3 weeks ago
- Evaluation of language models on mono- or multilingual tasks.☆71Updated last month
- DaCy: The State of the Art Danish NLP pipeline using SpaCy☆91Updated 7 months ago
- 🗺️ Data Cleaning and Textual Data Visualization 🗺️☆131Updated 3 months ago
- Fine-tuning of transformers for Sentiment Analysis☆19Updated 3 years ago
- A collection of Danish Transformers☆30Updated 3 years ago
- Robust and fast topic models with sentence-transformers.☆13Updated last week
- A Danish-speaking language model with entity-aware self-attention☆9Updated 2 years ago
- Prompt templating and versioning using jinja2 and litellm 🔥☆15Updated 7 months ago
- Gain clues from clustering!☆302Updated 2 months ago
- just a bunch of useful embeddings☆458Updated last week
- Notebooks for training universal 0-shot classifiers on many different tasks☆100Updated 5 months ago
- Danish Data Science Community's guide to sustainable data science☆16Updated last year
- FastFit ⚡ When LLMs are Unfit Use FastFit ⚡ Fast and Effective Text Classification with Many Classes☆180Updated last month
- SpanMarker for Named Entity Recognition☆384Updated last month
- Chance-corrected Agreement Coefficients☆19Updated 3 weeks ago
- Neural Search☆333Updated 3 months ago
- Generalist and Lightweight Model for Relation Extraction (Extract any relationship types from text)☆77Updated this week
- Embedding Vector Oriented Clustering☆96Updated 3 months ago
- spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to i…☆46Updated 5 months ago
- ☆78Updated 4 months ago
- A list of awesome open source projects in the machine learning field, who's developers are mainly based in Germany☆36Updated last week
- Let's build better datasets, together!☆195Updated last month
- 💫 SpaCy wrapper for ConceptNet 💫☆88Updated last year
- A visual labeling system implemented in Jupyter widgets.☆140Updated last month
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆71Updated last year
- Topic modeling helpers using managed language models from Cohere. Name text clusters using large GPT models.☆214Updated last year
- A python package for benchmarking interpretability techniques on Transformers.☆207Updated 2 months ago
- A spaCy wrapper for GliNER☆77Updated 2 months ago
- Sentiment Corpus for Swedish 🇸🇪 Norwegian 🇳🇴 Danish 🇩🇰 Finnish 🇫🇮 (and English 🏴)☆15Updated 3 years ago