Evaluation pipeline for the BabyLM Challenge 2023.
☆78Oct 18, 2023Updated 2 years ago
Alternatives and similar repositories for evaluation-pipeline-2023
Users that are interested in evaluation-pipeline-2023 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LTG-Bert☆34Jan 8, 2024Updated 2 years ago
- The evaluation pipeline for the 2024 BabyLM Challenge.☆33Nov 13, 2024Updated last year
- 어느 고등학생 의 심플한 확률론적 앵무새 만들기☆19Sep 2, 2023Updated 2 years ago
- MG top-down beam parsing☆13Jul 2, 2018Updated 7 years ago
- ☆12Mar 7, 2022Updated 4 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Source-to-Source Debuggable Derivatives in Pure Python☆15Jan 23, 2024Updated 2 years ago
- several algorithms for converting dependency structures into constituency structures.☆10Feb 7, 2022Updated 4 years ago
- Source code for CoNLL 2021 paper by Huebner et al. 2021☆21Jul 13, 2023Updated 2 years ago
- Unsupervised Cross-lingual Sentiment Analysis (CoNLL 2019)☆10Nov 4, 2019Updated 6 years ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Aug 12, 2023Updated 2 years ago
- Triton Implementation of HyperAttention Algorithm☆48Dec 11, 2023Updated 2 years ago
- Code for the paper "Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns"☆18Mar 15, 2024Updated 2 years ago
- Code for the paper "A Multi-lingual Multi-task Architecture for Low-resource Sequence Labeling" (ACL2018)☆29Nov 6, 2019Updated 6 years ago
- Distillation of Ensemble Dependency Parsers into a Single Graph-Based Parser☆11Oct 14, 2016Updated 9 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Forcing Diffuse Distributions out of Language Models☆18Sep 10, 2024Updated last year
- Dependency Parsing as Sequence Labeling with Python3+ and PyTorch1+ and MTL☆10Nov 21, 2019Updated 6 years ago
- Parsing only with Pretraining Networks☆16Jul 25, 2024Updated last year
- ☆14Aug 18, 2022Updated 3 years ago
- Implementation of Hyena Hierarchy in JAX☆10Apr 30, 2023Updated 2 years ago
- Code for pre-training BabyLM baseline models.☆16Jun 19, 2023Updated 2 years ago
- ☆26Nov 23, 2023Updated 2 years ago
- Official repository for the paper "Exploring the Promise and Limits of Real-Time Recurrent Learning" (ICLR 2024)☆13Jun 11, 2025Updated 10 months ago
- Code for "Theoretical Foundations of Deep Selective State-Space Models" (NeurIPS 2024)☆16Jan 7, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Advanced Formal Language Theory (263-5352-00L; Frühjahr 2023)☆10Feb 21, 2023Updated 3 years ago
- Universal Neurons in GPT2 Language Models☆30May 28, 2024Updated last year
- Learning to Model Editing Processes☆26Aug 3, 2025Updated 8 months ago
- Behavioral probing of language acquisition models at the lexical and syntactic level☆19Jul 17, 2023Updated 2 years ago
- ☆13Apr 15, 2024Updated 2 years ago
- We investigated corruption robustness across different architectures including Convolutional Neural Networks, Vision Transformers, and th…☆16Oct 28, 2021Updated 4 years ago
- ☆20May 30, 2024Updated last year
- Will send the same request to one or more sources to exchange cost for reduced latency for inference☆11Dec 17, 2024Updated last year
- Blog post☆17Feb 16, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Source code for the paper "Prefix Language Models are Unified Modal Learners"☆44Apr 30, 2023Updated 2 years ago
- ☆21Dec 9, 2016Updated 9 years ago
- ☆33Dec 9, 2022Updated 3 years ago
- ☆83Apr 16, 2024Updated 2 years ago
- Code and data related to "Efficient, Compositional, Order-Sensitive n-gram Embeddings" (EACL 2017)☆15Apr 6, 2017Updated 9 years ago
- Code for the paper "Multi-Task Learning for Domain-General Spoken Disfluency Detection in Dialogue Systems" (Igor Shalyminov, Arash Eshgh…☆23Dec 8, 2022Updated 3 years ago
- 32 times longer context window than vanilla Transformers and up to 4 times longer than memory efficient Transformers.☆50Jun 16, 2023Updated 2 years ago