babylm / evaluation-pipeline-2023View external linksLinks
Evaluation pipeline for the BabyLM Challenge 2023.
☆77Oct 18, 2023Updated 2 years ago
Alternatives and similar repositories for evaluation-pipeline-2023
Users that are interested in evaluation-pipeline-2023 are comparing it to the libraries listed below
Sorting:
- LTG-Bert☆34Jan 8, 2024Updated 2 years ago
- Source code for CoNLL 2021 paper by Huebner et al. 2021☆20Jul 13, 2023Updated 2 years ago
- several algorithms for converting dependency structures into constituency structures.☆10Feb 7, 2022Updated 4 years ago
- ☆17Mar 17, 2023Updated 2 years ago
- PyTorch implementation for PaLM: A Hybrid Parser and Language Model.☆10Jan 7, 2020Updated 6 years ago
- Distillation of Ensemble Dependency Parsers into a Single Graph-Based Parser☆11Oct 14, 2016Updated 9 years ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Aug 12, 2023Updated 2 years ago
- Implementation of Hyena Hierarchy in JAX☆10Apr 30, 2023Updated 2 years ago
- Advanced Formal Language Theory (263-5352-00L; Frühjahr 2023)☆10Feb 21, 2023Updated 2 years ago
- Triton Implementation of HyperAttention Algorithm☆48Dec 11, 2023Updated 2 years ago
- Apps that run on modal.com☆13Sep 14, 2025Updated 5 months ago
- ☆11Mar 18, 2024Updated last year
- A 20M RWKV v6 can do nonogram☆14Oct 18, 2024Updated last year
- Utilities for Training Very Large Models☆58Sep 25, 2024Updated last year
- Code for the paper "Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns"☆18Mar 15, 2024Updated last year
- ☆12Mar 7, 2022Updated 3 years ago
- ☆15Aug 18, 2022Updated 3 years ago
- ☆13Apr 15, 2024Updated last year
- The substitution of qsub.☆12Jan 25, 2019Updated 7 years ago
- We investigated corruption robustness across different architectures including Convolutional Neural Networks, Vision Transformers, and th…☆16Oct 28, 2021Updated 4 years ago
- Forcing Diffuse Distributions out of Language Models☆18Sep 10, 2024Updated last year
- Scripts and applications for autonomously controlling the Crazyflie using camera/Kinect on a host☆12Feb 16, 2022Updated 4 years ago
- Parsing only with Pretraining Networks☆16Jul 25, 2024Updated last year
- Curse-of-memory phenomenon of RNNs in sequence modelling☆19May 8, 2025Updated 9 months ago
- 方便扩展的Cuda算子理解和优化框架,仅用在学习使用☆18Jun 13, 2024Updated last year
- ☆20Jan 23, 2024Updated 2 years ago
- Parallel Associative Scan for Language Models☆18Jan 8, 2024Updated 2 years ago
- Self-Distillation with weighted ground-truth targets; ResNet and Kernel Ridge Regression☆19Oct 12, 2021Updated 4 years ago
- A Kernel-Based View of Language Model Fine-Tuning https://arxiv.org/abs/2210.05643☆78Sep 4, 2023Updated 2 years ago
- ☆20May 30, 2024Updated last year
- Learning to Model Editing Processes☆26Aug 3, 2025Updated 6 months ago
- Python implementation of paper "AntisymmetricRNN: A Dynamical System View on Recurrent Neural Networks"☆15Aug 2, 2019Updated 6 years ago
- Efficient PScan implementation in PyTorch☆17Jan 2, 2024Updated 2 years ago
- AGaLiTe: Approximate Gated Linear Transformers for Online Reinforcement Learning (Published in TMLR)☆23Oct 15, 2024Updated last year
- Code for paper: End-to-end Stochastic Optimization with Energy-based Model☆16Feb 14, 2023Updated 3 years ago
- Official Repository for Efficient Linear-Time Attention Transformers.☆18Jun 2, 2024Updated last year
- SRL4ORL: Improving Opinion Role Labeling Using Multi-Task Learning With Semantic Role Labeling☆14Oct 10, 2018Updated 7 years ago
- ☆82Apr 16, 2024Updated last year
- [CVPR 2024] This repository includes the official implementation our paper "Revisiting Adversarial Training at Scale"☆20Apr 21, 2024Updated last year