HendrikStrobelt / LMdiff
A diff tool for language models
β42Updated 10 months ago
Related projects β
Alternatives and complementary repositories for LMdiff
- Google's BigBird (Jax/Flax & PyTorch) @ π€Transformersβ47Updated last year
- β67Updated 2 years ago
- β73Updated 3 years ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.β92Updated last year
- Embedding Recycling for Language modelsβ38Updated last year
- Helper scripts and notes that were used while porting various nlp modelsβ44Updated 2 years ago
- β46Updated this week
- Implementation of Marge, Pre-training via Paraphrasing, in Pytorchβ75Updated 3 years ago
- Shared code for training sentence embeddings with Flax / JAXβ27Updated 3 years ago
- Open source library for few shot NLPβ77Updated last year
- Simple-to-use scoring function for arbitrarily tokenized texts.β32Updated 3 weeks ago
- Apps built using Inspired Cognition's Critique.β58Updated last year
- β16Updated last year
- Official code and model checkpoints for our EMNLP 2022 paper "RankGen - Improving Text Generation with Large Ranking Models" (https://arxβ¦β136Updated last year
- Official repository with code and data accompanying the NAACL 2021 paper "Hurdles to Progress in Long-form Question Answering" (https://aβ¦β46Updated 2 years ago
- Code and Data for Evaluation WGβ41Updated 2 years ago
- The Python library with command line tools to interact with Dynabench(https://dynabench.org/), such as uploading models.β55Updated 2 years ago
- β21Updated 3 years ago
- Our open source implementation of MiniLMv2 (https://aclanthology.org/2021.findings-acl.188)β60Updated last year
- β44Updated last year
- Ranking of fine-tuned HF models as base models.β35Updated last year
- Generate BERT vocabularies and pretraining examples from Wikipediasβ18Updated 4 years ago
- β95Updated last year
- Minimum Bayes Risk Decoding for Hugging Face Transformersβ56Updated 5 months ago
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning Pβ¦β34Updated last year
- β97Updated 2 years ago
- A library to create and manage configuration files, especially for machine learning projects.β77Updated 2 years ago
- A highly sophisticated sequence-to-sequence model for code generationβ40Updated 3 years ago
- A case study of efficient training of large language models using commodity hardware.β68Updated 2 years ago
- β76Updated 11 months ago