Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https://arxiv.org/abs/2309.08351)
β29Apr 17, 2024Updated 2 years ago
Alternatives and similar repositories for headless-lm
Users that are interested in headless-lm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ππ€ A collection of templates for Hugging Face Spacesβ34Oct 9, 2023Updated 2 years ago
- Python source code for EMNLP 2021 Findings paper: "Subword Mapping and Anchoring Across Languages".β13Sep 17, 2021Updated 4 years ago
- β16Jun 14, 2024Updated last year
- A fully-fledge PyTorch package for Morphological Analysis, tailored to morphologically rich and historical languages.β24Oct 27, 2023Updated 2 years ago
- β10Oct 2, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoderβ14Mar 11, 2025Updated last year
- β19Apr 26, 2026Updated last week
- A software for transferring pre-trained English models to foreign languagesβ19Mar 20, 2023Updated 3 years ago
- Lyrics crawling, pre-processing, embedding generation, model training, and lyrics generation - all in one toolβ14Nov 4, 2018Updated 7 years ago
- Goldfish: Monolingual language models for 350 languages.β24Mar 4, 2026Updated 2 months ago
- β10Oct 15, 2019Updated 6 years ago
- [ACL 2025] π Multilingual Evaluation of English-Centric LLMs via Cross-Lingual Alignmentβ11Apr 6, 2025Updated last year
- PyTorch implementation of the Flash Spectral Transform Unit.β22Sep 19, 2024Updated last year
- Minimal code to train ELMo models in recent versions of TensorFlowβ14Apr 30, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Material for a course on Advanced NLPβ16Jul 22, 2025Updated 9 months ago
- β11Mar 15, 2024Updated 2 years ago
- Resources related to EMNLP 2021 paper "FAME: Feature-Based Adversarial Meta-Embeddings for Robust Input Representations"β13Dec 14, 2021Updated 4 years ago
- This repository contains the sample code to benchmark popular time series forecast algorithms using Gluonts in AWS Sagemaker Notebook Insβ¦β13Jul 26, 2021Updated 4 years ago
- An opinionated NLP research templateβ10Aug 29, 2024Updated last year
- Set-Equivariant Deep Learning Modelsβ22Dec 23, 2021Updated 4 years ago
- A Python database interface for eXist-dbβ15Updated this week
- A neural network that jointly part-of-speech tags and lemmatizes sentences, boosting accuracy for morphologically-rich languages (Czech, β¦β34Apr 5, 2019Updated 7 years ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.β97Feb 9, 2023Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits β’ AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Experiments for XLM-V Transformers Integerationβ13Feb 8, 2023Updated 3 years ago
- Lowering PyTorch's Memory Consumption for Selective Differentiationβ12Aug 29, 2024Updated last year
- The source code for the TIRA Shared Task Platformβ17Updated this week
- Code for AAAI 2023 Paper : βAlignment-Enriched Tuning for Patch-Level Pre-trained Document Image Modelsββ18Dec 6, 2022Updated 3 years ago
- A Smalltalk Web Browser for Squeak/Smalltalkβ17Apr 18, 2022Updated 4 years ago
- Trully flash implementation of DeBERTa disentangled attention mechanism.β87Feb 10, 2026Updated 2 months ago
- Small python package to measure OCR quality and other related metrics.β27Feb 19, 2024Updated 2 years ago
- Generate BERT vocabularies and pretraining examples from Wikipediasβ17May 11, 2020Updated 5 years ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription usingβ¦β30May 27, 2023Updated 2 years ago
- Open source password manager - Proton Pass β’ AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- βοΈ Sentence segmentation with wtpsplit's state-of-the-art Segment any Text (SaT) modelsβ38Updated this week
- β24Jan 30, 2020Updated 6 years ago
- β75Jul 2, 2021Updated 4 years ago
- Overview of corpora/datasets for Germanic low-resource languages and dialects. Accompanies "A Survey of Corpora for Germanic Low-Resourceβ¦β27Feb 16, 2026Updated 2 months ago
- Cog wrapper for collabora/WhisperSpeechβ25Mar 5, 2024Updated 2 years ago
- Code for the paper "Getting the most out of your tokenizer for pre-training and domain adaptation"β22Feb 14, 2024Updated 2 years ago
- β22Dec 15, 2023Updated 2 years ago