Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https://arxiv.org/abs/2309.08351)
☆29Apr 17, 2024Updated 2 years ago
Alternatives and similar repositories for headless-lm
Users that are interested in headless-lm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 🚀🤗 A collection of templates for Hugging Face Spaces☆34Oct 9, 2023Updated 2 years ago
- Python source code for EMNLP 2021 Findings paper: "Subword Mapping and Anchoring Across Languages".☆13Sep 17, 2021Updated 4 years ago
- ☆16Jun 14, 2024Updated 2 years ago
- A fully-fledge PyTorch package for Morphological Analysis, tailored to morphologically rich and historical languages.☆25Oct 27, 2023Updated 2 years ago
- Terminal UI for monitoring SLURM jobs☆15Mar 29, 2026Updated 3 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆10Oct 2, 2024Updated last year
- A software for transferring pre-trained English models to foreign languages☆20Mar 20, 2023Updated 3 years ago
- mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Models☆11Jan 19, 2024Updated 2 years ago
- Goldfish: Monolingual language models for 350 languages.☆27Mar 4, 2026Updated 4 months ago
- ☆10Oct 15, 2019Updated 6 years ago
- [ACL 2025] 🔍 Multilingual Evaluation of English-Centric LLMs via Cross-Lingual Alignment☆11Apr 6, 2025Updated last year
- T-Projection is a method to perform high-quality Annotation Projection of Sequence Labeling datasets.☆13Nov 21, 2023Updated 2 years ago
- PyTorch implementation of the Flash Spectral Transform Unit.☆22Sep 19, 2024Updated last year
- Minimal code to train ELMo models in recent versions of TensorFlow☆14Jun 16, 2026Updated 2 weeks ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆12Mar 15, 2024Updated 2 years ago
- Resources related to EMNLP 2021 paper "FAME: Feature-Based Adversarial Meta-Embeddings for Robust Input Representations"☆13Dec 14, 2021Updated 4 years ago
- This repository contains the sample code to benchmark popular time series forecast algorithms using Gluonts in AWS Sagemaker Notebook Ins…☆13Jul 26, 2021Updated 4 years ago
- An opinionated NLP research template☆10Aug 29, 2024Updated last year
- Set-Equivariant Deep Learning Models☆22Dec 23, 2021Updated 4 years ago
- A Python database interface for eXist-db☆15May 2, 2026Updated 2 months ago
- Use sync mode Playwright interactively, inside a Jupyter notebook☆19May 20, 2026Updated last month
- Exploring Few-Shot Adaptation of Language Models with Tables☆25Aug 22, 2022Updated 3 years ago
- A neural network that jointly part-of-speech tags and lemmatizes sentences, boosting accuracy for morphologically-rich languages (Czech, …☆34Apr 5, 2019Updated 7 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆98Feb 9, 2023Updated 3 years ago
- Experiments for XLM-V Transformers Integeration☆13Feb 8, 2023Updated 3 years ago
- Lowering PyTorch's Memory Consumption for Selective Differentiation☆12Aug 29, 2024Updated last year
- Code for AAAI 2023 Paper : “Alignment-Enriched Tuning for Patch-Level Pre-trained Document Image Models”☆18Dec 6, 2022Updated 3 years ago
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆89Feb 10, 2026Updated 4 months ago
- HuCit KB: a knowledge base of classical texts and citable text units.☆11Nov 17, 2021Updated 4 years ago
- Repo of the Turing's Humanities & Data Science Discussion Group☆13Jul 21, 2022Updated 3 years ago
- Small python package to measure OCR quality and other related metrics.☆27Feb 19, 2024Updated 2 years ago
- Generate BERT vocabularies and pretraining examples from Wikipedias☆17May 11, 2020Updated 6 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ✂️ Sentence segmentation with wtpsplit's state-of-the-art Segment any Text (SaT) models☆39May 2, 2026Updated 2 months ago
- DPO, but faster 🚀☆52Dec 6, 2024Updated last year
- ☆24Jan 30, 2020Updated 6 years ago
- Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"☆59Jan 12, 2023Updated 3 years ago
- Overview of corpora/datasets for Germanic low-resource languages and dialects. Accompanies "A Survey of Corpora for Germanic Low-Resource…☆28Feb 16, 2026Updated 4 months ago
- Scripts to convert datasets from various sources to Hugging Face Datasets.☆57Oct 26, 2022Updated 3 years ago
- ☆22Dec 15, 2023Updated 2 years ago