johko / awesome-german-open-source-ml
A list of awesome open source projects in the machine learning field, who's developers are mainly based in Germany
☆43Updated 8 months ago
Alternatives and similar repositories for awesome-german-open-source-ml
Users that are interested in awesome-german-open-source-ml are comparing it to the libraries listed below
Sorting:
- Efficiently find the best-suited language model (LM) for your NLP task☆122Updated 2 weeks ago
- Generalist and Lightweight Model for Text Classification☆128Updated 2 weeks ago
- Mistral + Haystack: build RAG pipelines that rock 🤘☆103Updated last year
- synthetic data for ml☆23Updated 3 months ago
- The robust European language model benchmark.☆101Updated this week
- This is the reproduction repository for my 🤗 Hugging Face blog post on synthetic data☆68Updated last year
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆59Updated 9 months ago
- A comprehensive guide to LLM evaluation methods designed to assist in identifying the most suitable evaluation techniques for various use…☆116Updated last week
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines☆198Updated last year
- A notebook based tutorial series on buildling a LLM from scratch☆24Updated 7 months ago
- A BERT-based application for reusable text classification at scale☆38Updated last year
- Let's build better datasets, together!☆259Updated 4 months ago
- ☆123Updated 6 months ago
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Da☆103Updated last month
- Toolkit for attaching, training, saving and loading of new heads for transformer models☆276Updated 2 months ago
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆108Updated last year
- A Python library aimed at dissecting and augmenting NER training data.☆58Updated 2 years ago
- Plug-and-play document processing pipelines with zero-shot models.☆61Updated this week
- Notebooks for training universal 0-shot classifiers on many different tasks☆125Updated 4 months ago
- A template to kick-start your Python project ✨🚀☆51Updated 4 months ago
- Chunk your text using gpt4o-mini more accurately☆44Updated 9 months ago
- Generalist and Lightweight Model for Relation Extraction (Extract any relationship types from text)☆207Updated last month
- A guide book on data science for busy and equally lazy Data Scientists 😄☆131Updated 3 weeks ago
- This repository contains the code for dataset curation and finetuning of instruct variant of the Bilingual OpenHathi model. The resultin…☆23Updated last year
- ☆90Updated 5 months ago
- Starter pack for NeurIPS LLM Efficiency Challenge 2023.☆124Updated last year
- Pre-train Static Word Embeddings☆60Updated last month
- Sample notebooks and prompts for LLM evaluation☆126Updated last week
- FastFit ⚡ When LLMs are Unfit Use FastFit ⚡ Fast and Effective Text Classification with Many Classes☆204Updated last week
- Materials for workshops on the Hugging Face ecosystem☆150Updated 2 years ago