LEL-A/GerAlpacaDataCleaned

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/LEL-A/GerAlpacaDataCleaned)

LEL-A / GerAlpacaDataCleaned

German Alpaca Dataset (Cleaned + Translated)

☆26

Alternatives and similar repositories for GerAlpacaDataCleaned

Users that are interested in GerAlpacaDataCleaned are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

casszhao / PruneHall
View on GitHub
Codebase, data and models for hallucination of pruned models
☆16Jan 11, 2025Updated last year
bjoernpl / GermanBenchmark
View on GitHub
A repository containing the code for translating popular LLM benchmarks to German.
☆32Aug 20, 2023Updated 2 years ago
yamac-kurtulus / Windows-Docker-Images
View on GitHub
Some Windows images for tool images that I had to use in a Windows Environment.
☆10Sep 27, 2020Updated 5 years ago
stefan-it / europeana-bert
View on GitHub
BERT and ELECTRA models trained on Europeana Newspapers
☆39Dec 14, 2021Updated 4 years ago
bjoernpl / lm-evaluation-harness-de
View on GitHub
A framework for few-shot evaluation of autoregressive language models.
☆13Feb 14, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
stefan-it / gc4lm
View on GitHub
GC4LM: A Colossal (Biased) language model for German
☆13May 2, 2021Updated 5 years ago
wangyu-ustc / LVChat
View on GitHub
The official implementation of the paper **LVChat: Facilitating Long Video Comprehension**
☆14Apr 15, 2024Updated 2 years ago
telekom / SmartCredentials-SDK-android
View on GitHub
An SDK and Library that is used in several Deutsche Telekom mobile apps
☆12Sep 23, 2024Updated last year
georgepar / grnet_guide
View on GitHub
Guide for the slp group on how to use the Grnet cluster
☆11Apr 16, 2020Updated 6 years ago
telekom / HPOflow
View on GitHub
Tools for Optuna, MLflow and the integration of both.
☆17May 28, 2023Updated 3 years ago
bigscience-workshop / multilingual-modeling
View on GitHub
BLOOM+1: Adapting BLOOM model to support a new unseen language
☆75Mar 2, 2024Updated 2 years ago
julien-nc / integration_suitecrm
View on GitHub
Integration of SuiteCRM into Nextcloud
☆19Nov 12, 2021Updated 4 years ago
mourga / transformer-uncertainty
View on GitHub
Code for evaluating uncertainty estimation methods for Transformer-based architectures in natural language understanding tasks.
☆44Aug 16, 2021Updated 4 years ago
proycon / spacy2folia
View on GitHub
Use spaCy for NLP and output to the FoLiA XML format.
☆12Feb 27, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
IlyasMoutawwakil / py-txi
View on GitHub
A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.
☆32Sep 19, 2025Updated 10 months ago
GermanT5 / wikipedia2corpus
View on GitHub
Wikipedia text corpus for self-supervised NLP model training
☆47Jul 17, 2022Updated 4 years ago
LaSTUS-TALN-UPF / TSAR-2022-Shared-Task
View on GitHub
TSAR2022 Shared Task on Lexical Simplification - Datasets and Evaluation scripts
☆10Oct 27, 2022Updated 3 years ago
masakhane-io / masakhanePreprocessor
View on GitHub
Building an effective preprocessing tool for African languages
☆13Jan 24, 2024Updated 2 years ago
nlp-stat-test / nlp-stat-test
View on GitHub
The NLPStatTest project
☆12Mar 12, 2022Updated 4 years ago
julmaxi / Abstractive-Timeline-Summarization
View on GitHub
☆11Dec 8, 2022Updated 3 years ago
BayBenj / english-syllabifier
View on GitHub
Tool for parsing English phonemes into syllables.
☆10Jan 15, 2018Updated 8 years ago
buschmo / Simple-German-Corpus
View on GitHub
Code to create the dataset from "A New Aligned Simple German Corpus
☆11Jan 8, 2024Updated 2 years ago
armbues / SiLLM-examples
View on GitHub
Examples for using the SiLLM framework for training and running Large Language Models (LLMs) on Apple Silicon
☆16May 8, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ad-si / adriansieber-com
View on GitHub
My website & blog with articles about coding, tech, functional programming, …
☆10Jun 12, 2026Updated last month
lt3 / nfr
View on GitHub
Neural Fuzzy Repair (NFR) is a data augmentation pipeline, which integrates fuzzy matches (i.e. similar translations) into neural machine…
☆12Aug 14, 2024Updated last year
abvijaykumar / rag-llamaindex-blog
View on GitHub
Source code used in the blog
☆12Feb 6, 2024Updated 2 years ago
NC0DER / GraphOfDocs
View on GitHub
GraphOfDocs: Representing multiple documents as a single graph
☆21Jun 22, 2022Updated 4 years ago
xcratch / xcratch.github.io
View on GitHub
Extendable Scratch3 Programming Environment
☆10Jul 6, 2026Updated 2 weeks ago
LibrAIResearch / libra-eval
View on GitHub
☆23May 20, 2025Updated last year
dennlinger / klexikon
View on GitHub
Klexikon: A German Dataset for Joint Summarization and Simplification
☆17Oct 5, 2022Updated 3 years ago
Reason-Wang / NAT
View on GitHub
[NAACL 2025] The official implementation of paper "Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language M…
☆28Mar 14, 2024Updated 2 years ago
LuisaMaerz / KnowMAN
View on GitHub
KnowMAN: Weakly Supervised Multinomial Adversarial Networks
☆12Nov 9, 2021Updated 4 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
Pleias / Pleias-Rag
View on GitHub
☆17Feb 25, 2025Updated last year
BramVanroy / spacy_download
View on GitHub
Download and load spaCy models on-the-fly
☆15Feb 9, 2023Updated 3 years ago
krangelie / bias-in-german-nlg
View on GitHub
Master thesis: Exploring bias in German NLG (GPT-3 & GerPT-2). Applies regard classification and bias mitigation triggers.
☆16Sep 25, 2024Updated last year
mithrendal / boostanista
View on GitHub
alternative remote for Lego Boost with Pythonista and iOS
☆10Aug 27, 2017Updated 8 years ago
informagi / GEEER
View on GitHub
Code supporting the paper Graph-Embedding Empowered Entity Retrieval
☆24Apr 11, 2025Updated last year
valentinhofmann / flota
View on GitHub
☆18Feb 1, 2023Updated 3 years ago
daniel-furman / polyglot-or-not
View on GitHub
Are foundation LMs multilingual knowledge bases? (EMNLP 2023)
☆18Dec 8, 2023Updated 2 years ago