calum-bird / howmanyparams.com

Compute-optimal LLMs

☆11

Related projects ⓘ

Alternatives and complementary repositories for howmanyparams.com

EleutherAI / mdl
Minimum Description Length probing for neural network representations
☆16Updated last week
allenai / EmbeddingRecycling
Embedding Recycling for Language models
☆38Updated last year
allenai / smashed
SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batchi…
☆31Updated 5 months ago
peterbhase / SLAG-Belief-Updating
Code for paper "Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs"
☆28Updated 2 years ago
LauraRuis / do-pigs-fly
☆16Updated last year
castorini / hf-spacerini
Plug-and-play Search Interfaces with Pyserini and Hugging Face
☆32Updated last year
facebookresearch / lss_eval
This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…
☆31Updated last year
NathanGodey / headless-lm
Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…
☆23Updated 7 months ago
gonglinyuan / metro_t0
Code repo for "Model-Generated Pretraining Signals Improves Zero-Shot Generalization of Text-to-Text Transformers" (ACL 2023)
☆22Updated last year
ielab / Starbucks
Starbucks: Improved Training for 2D Matryoshka Embeddings
☆17Updated last month
salesforce / simplification
☆20Updated last year
allenai / sso
Repository for Skill Set Optimization
☆12Updated 3 months ago
UKPLab / lagonn
Source code and data for Like a Good Nearest Neighbor
☆28Updated 9 months ago
HazyResearch / embroid
Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification
☆11Updated last year
stanfordnlp / multi-distribution-retrieval
Code for our paper Resources and Evaluations for Multi-Distribution Dense Information Retrieval
☆14Updated 10 months ago
luohongyin / EntST
Entailment self-training
☆25Updated last year
google-research-datasets / swim-ir
SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…
☆44Updated last year
allenai / c4-documentation
☆24Updated 3 years ago
trisongz / pylines
Simplifying parsing of large jsonline files in NLP Workflows
☆12Updated 2 years ago
google-research-datasets / QAmeleon
QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…
☆34Updated last year
stanfordnlp / ColBERT-QA
Code for Relevance-guided Supervision for OpenQA with ColBERT (TACL'21)
☆40Updated 3 years ago
rycolab / probing-via-prompting
☆11Updated 2 years ago
iantbutler01 / ditty
A library for simplifying fine tuning with multi gpu setups in the Huggingface ecosystem.
☆15Updated 3 weeks ago
r-three / RAD
Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model
☆41Updated 10 months ago
nlpie-research / Lightweight-Clinical-Transformers
This project develops compact transformer models tailored for clinical text analysis, balancing efficiency and performance for healthcare…
☆18Updated 7 months ago
wangskyGit / passage-sieve
official repo of AAAI2024 paper Mitigating the Impact of False Negatives in Dense Retrieval with Contrastive Confidence Regularization
☆13Updated 10 months ago
aviaefrat / lmentry
☆11Updated last year
CarperAI / squeakily
A library for squeakily cleaning and filtering language datasets.
☆45Updated last year