bjoernpl / GermanBenchmark
A repository containing the code for translating popular LLM benchmarks to German.
☆25Updated last year
Alternatives and similar repositories for GermanBenchmark
Users that are interested in GermanBenchmark are comparing it to the libraries listed below
Sorting:
- A framework for few-shot evaluation of autoregressive language models.☆13Updated last year
- Minimum Bayes Risk Decoding for Hugging Face Transformers☆58Updated 11 months ago
- Code for Zero-Shot Tokenizer Transfer☆127Updated 4 months ago
- ☆38Updated last year
- Evaluation pipeline for the BabyLM Challenge 2023.☆76Updated last year
- ☆72Updated last year
- ☆72Updated last year
- How do transformer LMs encode relations?☆48Updated last year
- Spherical Merge Pytorch/HF format Language Models with minimal feature loss.☆121Updated last year
- Official Code for M-RᴇᴡᴀʀᴅBᴇɴᴄʜ: Evaluating Reward Models in Multilingual Settings (ACL 2025 Main)☆28Updated this week
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆128Updated last year
- Repository for "I am a Strange Dataset: Metalinguistic Tests for Language Models"☆44Updated last year
- Manage scalable open LLM inference endpoints in Slurm clusters☆256Updated 10 months ago
- Code and Data Repo for the CoNLL Paper -- Future Lens: Anticipating Subsequent Tokens from a Single Hidden State☆18Updated last year
- [ICLR 2023] Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learners☆115Updated 8 months ago
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆59Updated 9 months ago
- Repo accompanying our paper "Do Llamas Work in English? On the Latent Language of Multilingual Transformers".☆73Updated last year
- Language models scale reliably with over-training and on downstream tasks☆97Updated last year
- ☆54Updated last year
- Exploring finetuning public checkpoints on filter 8K sequences on Pile☆115Updated 2 years ago
- Multilingual Large Language Models Evaluation Benchmark☆123Updated 8 months ago
- ☆65Updated last year
- PyTorch library for Active Fine-Tuning☆72Updated 3 months ago
- ☆97Updated 2 years ago
- Code repository for the c-BTM paper☆106Updated last year
- ☆120Updated 7 months ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆93Updated 2 years ago
- Inspecting and Editing Knowledge Representations in Language Models☆116Updated last year
- ☆57Updated this week
- Code for NeurIPS LLM Efficiency Challenge☆58Updated last year