babylm/evaluation-pipeline-2025

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/babylm/evaluation-pipeline-2025)

babylm / evaluation-pipeline-2025

☆26

Alternatives and similar repositories for evaluation-pipeline-2025

Users that are interested in evaluation-pipeline-2025 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

babylm / evaluation-pipeline-2024
View on GitHub
The evaluation pipeline for the 2024 BabyLM Challenge.
☆34Nov 13, 2024Updated last year
babylm / babylm.github.io
View on GitHub
☆16Updated this week
jannik-brinkmann / multilingual-features
View on GitHub
Code for the paper "Large Language Models Share Representations of Latent Grammatical Concepts Across Typologically Diverse Languages" (N…
☆17Apr 13, 2025Updated last year
tommccoy1 / inductive-bias-distillation
View on GitHub
☆22Apr 5, 2026Updated 3 months ago
catherinearnett / morphscore
View on GitHub
This is the repository for MorphScore, a tokenizer evaluation framework for morphological alignment.
☆17Jul 10, 2025Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
michahu / pre-pretraining
View on GitHub
Accelerate pretraining by pre-pretraining on formal languages!
☆20Feb 13, 2026Updated 5 months ago
yangyuan / brown-clustering
View on GitHub
Brown clustering in Python
☆22Dec 12, 2017Updated 8 years ago
google-deepmind / agent_debugger
View on GitHub
Causal Analysis of Agent Behavior for AI Safety
☆21Jun 27, 2023Updated 3 years ago
rycolab / prefix-parsing
View on GitHub
☆14Feb 1, 2024Updated 2 years ago
IBM / learn-vector-symbolic-architectures-rule-formulations
View on GitHub
PyTorch Implementation of the paper "Probabilistic Abduction for Visual Abstract Reasoning via Learning Rules in Vector-symbolic Architec…
☆10Sep 18, 2025Updated 10 months ago
FAIRNS / Number_and_syntax_units_in_LSTM_LMs
View on GitHub
☆10Jun 19, 2019Updated 7 years ago
verypluming / HELP
View on GitHub
HELP: a dataset for Handling Entailments with Lexical and logical Phenomena (Ver.1.0)
☆15Jul 20, 2023Updated 3 years ago
kr-ramesh / synthtexteval
View on GitHub
SynthTextEval: A Toolkit for Generating and Evaluating Synthetic Data For High-Stakes Domains (EMNLP 2025 System Demonstration)
☆27Nov 3, 2025Updated 8 months ago
UKPLab / AdaSent
View on GitHub
This repository contains the code for the EMNLP'23 paper "AdaSent: Efficient Domain-Adapted Sentence Embeddings for Few-Shot Classificati…
☆16Jun 3, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
jkallini / mission-impossible-language-models
View on GitHub
Code repository for the paper "Mission: Impossible Language Models."
☆56Sep 25, 2025Updated 9 months ago
facebookresearch / multiloko
View on GitHub
A benchmark with locally sourced multilingual questions for 31 languages.
☆18May 13, 2026Updated 2 months ago
DIPSAS / DockerBuildManagement
View on GitHub
Build Management is a python application, installed with pip. The application makes it easy to manage a build system based on Docker by c…
☆14Sep 22, 2021Updated 4 years ago
netique / corona
View on GitHub
☆10Sep 11, 2020Updated 5 years ago
SrishtiGautam / ProtoVAE
View on GitHub
☆16Jun 8, 2023Updated 3 years ago
ltgoslo / factorizer
View on GitHub
☆16May 14, 2024Updated 2 years ago
i-machine-think / awesome-compositionality
View on GitHub
A list of resources dedicated to compositionality
☆14Feb 21, 2019Updated 7 years ago
LAAC-LSCP / ChildProject
View on GitHub
Python package for the management of day-long recordings of children.
☆16Jun 29, 2026Updated 3 weeks ago
srush / tangent
View on GitHub
Source-to-Source Debuggable Derivatives in Pure Python
☆15Jan 23, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
lhoangan / template-uva-thesis
View on GitHub
PhD thesis template with title page according to the University of Amsterdam.
☆16Sep 12, 2021Updated 4 years ago
aaronmueller / clams
View on GitHub
Syntactic evaluation sets, attribute-varying grammars, and code for replicating the CLAMS paper. ACL 2020.
☆17Nov 26, 2024Updated last year
babylm / evaluation-pipeline-2023
View on GitHub
Evaluation pipeline for the BabyLM Challenge 2023.
☆77Oct 18, 2023Updated 2 years ago
ChenghaoMou / embeddings
View on GitHub
zero-vocab or low-vocab embeddings
☆18Jul 17, 2022Updated 4 years ago
beinborn / brain-lang
View on GitHub
Code for processing brain data
☆12Apr 5, 2019Updated 7 years ago
jennhu / lm-pragmatics
View on GitHub
Code and data for "A fine-grained comparison of pragmatic language understanding in humans and language models"
☆11Dec 14, 2022Updated 3 years ago
0xnurl / mdlrnn-torch
View on GitHub
Minimum Description Length Recurrent Neural Networks (MDLRNNs) in PyTorch
☆22May 6, 2025Updated last year
aryamanarora / causalgym
View on GitHub
CausalGym: Benchmarking causal interpretability methods on linguistic tasks
☆54Nov 30, 2024Updated last year
ufal / multilexnorm2021
View on GitHub
MultiLexNorm 2021 competition system from ÚFAL
☆16Dec 30, 2021Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
edchengg / easyproject
View on GitHub
ACL 2023 (Findings) End-to-end Cross-lingual Label Project
☆15Nov 24, 2023Updated 2 years ago
Wickstrom / RELAX
View on GitHub
Code for RELAX, a framework for explaining representations.
☆12Jan 7, 2024Updated 2 years ago
mattynaz / latex-notes
View on GitHub
A LaTeX document class for notes 📝 and textbooks 📚
☆14Jul 14, 2021Updated 5 years ago
ltgoslo / NorQuAD
View on GitHub
Norwegian question answering dataset
☆15Feb 3, 2024Updated 2 years ago
utilForever / HellSolver
View on GitHub
Helltaker simulator using C++ with some reinforcement learning
☆16Jan 5, 2021Updated 5 years ago
epierson9 / print_google_calendar_availability
View on GitHub
Python script which prints out a summary of your free slots from your Google calendar(s) so you can paste into a scheduling email.
☆42Oct 28, 2022Updated 3 years ago
YuchenJin / llm.c
View on GitHub
LLM training in simple, raw C/CUDA
☆15Dec 5, 2024Updated last year