bjoernpl / GermanBenchmarkLinks

A repository containing the code for translating popular LLM benchmarks to German.

☆26

Alternatives and similar repositories for GermanBenchmark

Users that are interested in GermanBenchmark are comparing it to the libraries listed below

Sorting:

bminixhofer / zett
Code for Zero-Shot Tokenizer Transfer
☆133Updated 6 months ago
EleutherAI / concept-erasure
Erasing concepts from neural representations with provable guarantees
☆230Updated 5 months ago
bjoernpl / lm-evaluation-harness-de
A framework for few-shot evaluation of autoregressive language models.
☆13Updated last year
sileod / tasksource
Datasets collection and preprocessings framework for NLP extreme multitask learning
☆184Updated last week
ZurichNLP / mbr
Minimum Bayes Risk Decoding for Hugging Face Transformers
☆58Updated last year
catie-aq / flashT5
A fast implementation of T5/UL2 in PyTorch using Flash Attention
☆105Updated 4 months ago
llm-efficiency-challenge / neurips_llm_efficiency_challenge
NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day
☆256Updated last year
huggingface / llm-swarm
Manage scalable open LLM inference endpoints in Slurm clusters
☆265Updated last year
nostalgebraist / transformer-utils
Utilities for the HuggingFace transformers library
☆69Updated 2 years ago
hadasah / btm
☆75Updated last year
booydar / babilong
BABILong is a benchmark for LLM evaluation using the needle-in-a-haystack approach.
☆206Updated 2 months ago
huggingface / olm-training
Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.
☆93Updated 2 years ago
huggingface / data-is-better-together
Let's build better datasets, together!
☆260Updated 6 months ago
KoyenaPal / future-lens
Code and Data Repo for the CoNLL Paper -- Future Lens: Anticipating Subsequent Tokens from a Single Hidden State
☆18Updated last year
babylm / evaluation-pipeline-2023
Evaluation pipeline for the BabyLM Challenge 2023.
☆76Updated last year
huggingface / that_is_good_data
☆66Updated last year
ARBORproject / arborproject.github.io
☆79Updated 4 months ago
HazyResearch / zoology
Understand and test language model architectures on synthetic tasks.
☆219Updated last month
epfml / llm-baselines
nanoGPT-like codebase for LLM training
☆100Updated 2 months ago
kernelmachine / cbtm
Code repository for the c-BTM paper
☆106Updated last year
guy-dar / embedding-space
☆54Updated 2 years ago
evandez / relations
How do transformer LMs encode relations?
☆50Updated last year
jonhue / activeft
PyTorch library for Active Fine-Tuning
☆87Updated 5 months ago
EleutherAI / delphi
Delphi was the home of a temple to Phoebus Apollo, which famously had the inscription, 'Know Thyself.' This library lets language models …
☆193Updated this week
imoneoi / multipack_sampler
Multipack distributed sampler for fast padding-free training of LLMs
☆195Updated 11 months ago
evandez / REMEDI
Inspecting and Editing Knowledge Representations in Language Models
☆116Updated last year
SALT-NLP / demonstrated-feedback
☆124Updated 9 months ago
huggingface / datablations
Scaling Data-Constrained Language Models
☆338Updated 2 weeks ago
melisa-writer / short-transformers
Prune transformer layers
☆69Updated last year
allenai / bff
☆38Updated last year