centre-for-humanities-computing / danish-foundation-models
A project for training foundational Danish language model
☆68Updated this week
Related projects ⓘ
Alternatives and complementary repositories for danish-foundation-models
- A Scandinavian Benchmark for sentence embeddings☆28Updated last week
- Evaluation of language models on mono- or multilingual tasks.☆75Updated last week
- DaCy: The State of the Art Danish NLP pipeline using SpaCy☆93Updated 3 weeks ago
- A collection of Danish Transformers☆30Updated 3 years ago
- just a bunch of useful embeddings☆467Updated 2 months ago
- Fine-tuning of transformers for Sentiment Analysis☆19Updated 3 years ago
- A Danish-speaking language model with entity-aware self-attention☆9Updated 2 years ago
- The Fastest State-of-the-Art Static Embeddings in the World☆473Updated this week
- Ælæctra was created as part of a Cognitive Science bachelor thesis, in the attempt to enhance the Danish NLP community with a more effici…☆25Updated 2 years ago
- Prompt templating and versioning using jinja2 and litellm 🔥☆15Updated 10 months ago
- spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to i…☆46Updated 7 months ago
- 📚 Process PDFs, Word documents and more with spaCy☆75Updated this week
- Sentiment Corpus for Swedish 🇸🇪 Norwegian 🇳🇴 Danish 🇩🇰 Finnish 🇫🇮 (and English 🏴)☆15Updated 3 years ago
- ☆66Updated this week
- Embedding Vector Oriented Clustering☆116Updated last week
- The Danish Gigaword project☆15Updated 3 years ago
- Danish Data Science Community's guide to sustainable data science☆17Updated last year
- Neural Search☆344Updated 5 months ago
- Topic modeling helpers using managed language models from Cohere. Name text clusters using large GPT models.☆217Updated last year
- FastFit ⚡ When LLMs are Unfit Use FastFit ⚡ Fast and Effective Text Classification with Many Classes☆183Updated last month
- Tag grants with MeSH and other tags☆14Updated 9 months ago
- 🗺️ Data Cleaning and Textual Data Visualization 🗺️☆146Updated 5 months ago
- Chance-corrected Agreement Coefficients☆21Updated last week
- DaNLP is a repository for Natural Language Processing resources for the Danish Language.☆198Updated 11 months ago
- A visual labeling system implemented in Jupyter widgets.☆146Updated last week
- A curated list of awesome resources for Danish language technology☆165Updated 2 weeks ago
- SpanMarker for Named Entity Recognition☆401Updated 3 months ago
- In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.☆386Updated 9 months ago
- Late Interaction Models Training & Retrieval☆165Updated this week
- SciRepEval benchmark training and evaluation scripts☆67Updated 6 months ago