johko / awesome-german-open-source-mlLinks
A list of awesome open source projects in the machine learning field, who's developers are mainly based in Germany
☆46Updated last year
Alternatives and similar repositories for awesome-german-open-source-ml
Users that are interested in awesome-german-open-source-ml are comparing it to the libraries listed below
Sorting:
- Efficiently find the best-suited language model (LM) for your NLP task☆127Updated last month
- synthetic data for ml☆24Updated 7 months ago
- Plug-and-play document processing pipelines with zero-shot models.☆99Updated last month
- Toolkit for attaching, training, saving and loading of new heads for transformer models☆287Updated 6 months ago
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Da☆116Updated 5 months ago
- Generalist and Lightweight Model for Text Classification☆157Updated 3 months ago
- A template to kick-start your Python project ✨🚀☆52Updated 2 months ago
- Let's build better datasets, together!☆263Updated 8 months ago
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines☆197Updated last year
- Mistral + Haystack: build RAG pipelines that rock 🤘☆105Updated last year
- The robust European language model benchmark.☆122Updated last week
- ☆210Updated 2 months ago
- Notebooks for training universal 0-shot classifiers on many different tasks☆137Updated 8 months ago
- SpanMarker for Named Entity Recognition☆451Updated 8 months ago
- Late Interaction Models Training & Retrieval☆584Updated this week
- 🔢 Work with static vector models☆29Updated 4 months ago
- Deliver safe & effective language models☆538Updated this week
- A guide book on data science for busy and equally lazy Data Scientists 😄☆133Updated 2 months ago
- just a bunch of useful embeddings for scikit-learn pipelines☆517Updated last month
- Chunk your text using gpt4o-mini more accurately☆44Updated last year
- A framework for evaluating semantic search across custom datasets, metrics, and embedding backends.☆35Updated 3 months ago
- This repository contains the code for dataset curation and finetuning of instruct variant of the Bilingual OpenHathi model. The resultin…☆23Updated last year
- A comprehensive guide to LLM evaluation methods designed to assist in identifying the most suitable evaluation techniques for various use…☆139Updated 3 weeks ago
- ☆124Updated 10 months ago
- A small library of LLM judges☆282Updated last month
- Datamodels for hugging face tokenizers☆47Updated last week
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆147Updated 2 months ago
- This is the reproduction repository for my 🤗 Hugging Face blog post on synthetic data☆68Updated last year
- FastFit ⚡ When LLMs are Unfit Use FastFit ⚡ Fast and Effective Text Classification with Many Classes☆212Updated 4 months ago
- awesome synthetic (text) datasets☆296Updated 2 months ago