Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https://arxiv.org/abs/2309.08351)
β29Apr 17, 2024Updated 2 years ago
Alternatives and similar repositories for headless-lm
Users that are interested in headless-lm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ππ€ A collection of templates for Hugging Face Spacesβ34Oct 9, 2023Updated 2 years ago
- β16Jun 14, 2024Updated 2 years ago
- A fully-fledge PyTorch package for Morphological Analysis, tailored to morphologically rich and historical languages.β25Oct 27, 2023Updated 2 years ago
- β10Oct 2, 2024Updated last year
- [ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoderβ14Mar 11, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- β20Apr 26, 2026Updated last month
- A software for transferring pre-trained English models to foreign languagesβ20Mar 20, 2023Updated 3 years ago
- mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Modelsβ11Jan 19, 2024Updated 2 years ago
- Evaluate your models with A/B test experimentsβ14Jan 5, 2023Updated 3 years ago
- Goldfish: Monolingual language models for 350 languages.β26Mar 4, 2026Updated 3 months ago
- β10Oct 15, 2019Updated 6 years ago
- An open source platform for browser based speech and audio subjective quality tests.β39Updated this week
- [ACL 2025] π Multilingual Evaluation of English-Centric LLMs via Cross-Lingual Alignmentβ11Apr 6, 2025Updated last year
- [WAVC 2024] Official implementation of the paper: Semantic Generative Augmentations for Few-shot Countingβ13May 1, 2024Updated 2 years ago
- Proton VPN Special Offer - Get 70% off β’ AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- PyTorch implementation of the Flash Spectral Transform Unit.β22Sep 19, 2024Updated last year
- Extension for pie to include taggers with their models and pre/postprocessorsβ11May 30, 2024Updated 2 years ago
- Material for a course on Advanced NLPβ17Jul 22, 2025Updated 10 months ago
- β12Mar 15, 2024Updated 2 years ago
- Resources related to EMNLP 2021 paper "FAME: Feature-Based Adversarial Meta-Embeddings for Robust Input Representations"β13Dec 14, 2021Updated 4 years ago
- This repository contains the sample code to benchmark popular time series forecast algorithms using Gluonts in AWS Sagemaker Notebook Insβ¦β13Jul 26, 2021Updated 4 years ago
- An opinionated NLP research templateβ10Aug 29, 2024Updated last year
- Set-Equivariant Deep Learning Modelsβ22Dec 23, 2021Updated 4 years ago
- Use sync mode Playwright interactively, inside a Jupyter notebookβ19May 20, 2026Updated 3 weeks ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Exploring Few-Shot Adaptation of Language Models with Tablesβ24Aug 22, 2022Updated 3 years ago
- A neural network that jointly part-of-speech tags and lemmatizes sentences, boosting accuracy for morphologically-rich languages (Czech, β¦β34Apr 5, 2019Updated 7 years ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.β97Feb 9, 2023Updated 3 years ago
- Experiments for XLM-V Transformers Integerationβ13Feb 8, 2023Updated 3 years ago
- Lowering PyTorch's Memory Consumption for Selective Differentiationβ12Aug 29, 2024Updated last year
- The source code for the TIRA Shared Task Platformβ17Updated this week
- A Smalltalk Web Browser for Squeak/Smalltalkβ18Apr 18, 2022Updated 4 years ago
- Trully flash implementation of DeBERTa disentangled attention mechanism.β89Feb 10, 2026Updated 4 months ago
- HuCit KB: a knowledge base of classical texts and citable text units.β11Nov 17, 2021Updated 4 years ago
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Small python package to measure OCR quality and other related metrics.β27Feb 19, 2024Updated 2 years ago
- Repo of the Turing's Humanities & Data Science Discussion Groupβ13Jul 21, 2022Updated 3 years ago
- β19Oct 7, 2021Updated 4 years ago
- A extension of Transformers library to include T5ForSequenceClassification class.β40Apr 17, 2023Updated 3 years ago
- βοΈ Sentence segmentation with wtpsplit's state-of-the-art Segment any Text (SaT) modelsβ39May 2, 2026Updated last month
- DPO, but faster πβ52Dec 6, 2024Updated last year
- Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"β59Jan 12, 2023Updated 3 years ago