German Alpaca Dataset (Cleaned + Translated)
☆26Apr 6, 2023Updated 2 years ago
Alternatives and similar repositories for GerAlpacaDataCleaned
Users that are interested in GerAlpacaDataCleaned are comparing it to the libraries listed below
Sorting:
- German dataset for DPR model training☆19Jul 21, 2024Updated last year
- Machine Learning Toolbox 2☆13Nov 22, 2025Updated 3 months ago
- A framework for few-shot evaluation of autoregressive language models.☆13Feb 14, 2024Updated 2 years ago
- Codebase, data and models for hallucination of pruned models☆16Jan 11, 2025Updated last year
- Tools for Optuna, MLflow and the integration of both.☆17May 28, 2023Updated 2 years ago
- Efficient Language Model Training through Cross-Lingual and Progressive Transfer Learning☆30Jan 25, 2023Updated 3 years ago
- Plan and train German transformer models.☆23Feb 22, 2021Updated 5 years ago
- A repository containing the code for translating popular LLM benchmarks to German.☆32Aug 20, 2023Updated 2 years ago
- Parkiet is a 1.6B parameter Dutch text-to-speech model (TTS)☆66Sep 30, 2025Updated 5 months ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆32Sep 19, 2025Updated 5 months ago
- 语音合成从零开始☆11Nov 28, 2023Updated 2 years ago
- BLOOM+1: Adapting BLOOM model to support a new unseen language☆74Mar 2, 2024Updated 2 years ago
- Multilingual Large Language Models Evaluation Benchmark☆132Aug 21, 2024Updated last year
- EMNLP 2021 - Frustratingly Simple Pretraining Alternatives to Masked Language Modeling☆34Nov 21, 2021Updated 4 years ago
- An SDK and Library that is used in several Deutsche Telekom mobile apps☆12Sep 23, 2024Updated last year
- Extendable Scratch3 Programming Environment☆10Jan 24, 2026Updated last month
- A tokenizer and sentence splitter for German and English web and social media texts.☆153Dec 9, 2024Updated last year
- ☆11Nov 10, 2020Updated 5 years ago
- An example project that demonstrates the brand new and upcoming physics related features of the plugin.☆12Feb 7, 2026Updated 3 weeks ago
- Guide for the slp group on how to use the Grnet cluster☆11Apr 16, 2020Updated 5 years ago
- speech to text gui for different (mostly Whisper, also Voxtral) models and backends, including whisper.cpp, mlx-whisper, faster-whisper, …☆11Dec 7, 2025Updated 2 months ago
- XWikisCorpus, cross-lingual summarisation, multi-lingual summarisation, pre-trained language models, zero-shot and few-shot summarisation…☆10Nov 4, 2022Updated 3 years ago
- ☆12Feb 2, 2020Updated 6 years ago
- ☆10Oct 2, 2024Updated last year
- This is a shader can running on Minecraft Java Edition For Phone project which uses GL4ES. This repository contains source code for iOS/i…☆14Aug 13, 2023Updated 2 years ago
- A web application for studying Ancient Greek texts with integrated lexical, syntactic, and morphological analysis tools.☆20Dec 1, 2025Updated 3 months ago
- competitive programming library☆10Feb 25, 2026Updated last week
- Using deep research workflow to generate datasets for finetuning LLMs.☆38Oct 9, 2025Updated 4 months ago
- distilled Self-Critique refines the outputs of a LLM with only synthetic data☆11Apr 11, 2024Updated last year
- ☆11May 5, 2022Updated 3 years ago
- PITS-中日英韩☆12Mar 14, 2023Updated 2 years ago
- 💬📝 A small dictation app using OpenAI's Whisper speech recognition model.☆11Sep 13, 2024Updated last year
- This project is distributed as a free Unreal Engine Plugin. It consists in a single c++ actor component that handles the playback of anim…☆12Mar 10, 2024Updated last year
- GraphOfDocs: Representing multiple documents as a single graph☆21Jun 22, 2022Updated 3 years ago
- ☆11Apr 6, 2021Updated 4 years ago
- Is BERT Robust to Label Noise? A Study on Learning with Noisy Labels in Text Classification☆10May 31, 2022Updated 3 years ago
- A .NET MAUI sample app implementing biometric login☆13Jan 11, 2025Updated last year
- Revamped: Hugo+LoveIt☆10Updated this week
- benchmarks for evaluating MT models☆11Jun 26, 2024Updated last year