krandiash / quinineLinks

A library to create and manage configuration files, especially for machine learning projects.

☆79

Alternatives and similar repositories for quinine

Users that are interested in quinine are comparing it to the libraries listed below

Sorting:

guy-dar / embedding-space
☆54Updated 2 years ago
zphang / minimal-opt
☆67Updated 2 years ago
nostalgebraist / transformer-utils
Utilities for the HuggingFace transformers library
☆70Updated 2 years ago
HomebrewML / HomebrewNLP-torch
A case study of efficient training of large language models using commodity hardware.
☆68Updated 3 years ago
ethancaballero / broken_neural_scaling_laws
Code Release for "Broken Neural Scaling Laws" (BNSL) paper
☆59Updated last year
google-research / cascades
Python library which enables complex compositions of language models such as scratchpads, chain of thought, tool use, selection-inference…
☆207Updated 2 months ago
hadasah / btm
☆75Updated last year
RobertCsordas / transformer_generalization
The official repository for our paper "The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers". We s…
☆67Updated 2 years ago
TomFrederik / unseal
Mechanistic Interpretability for Transformer Models
☆51Updated 3 years ago
McGill-NLP / length-generalization
Code for the paper "The Impact of Positional Encoding on Length Generalization in Transformers", NeurIPS 2023
☆136Updated last year
EleutherAI / semantic-memorization
☆44Updated 8 months ago
lucidrains / memory-editable-transformer
My explorations into editing the knowledge and memories of an attention network
☆35Updated 2 years ago
peterbhase / SLAG-Belief-Updating
Code for paper "Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs"
☆28Updated 3 years ago
allenai / bff
☆39Updated last year
princeton-nlp / TransformerPrograms
[NeurIPS 2023] Learning Transformer Programs
☆162Updated last year
mlfoundations / scaling
Language models scale reliably with over-training and on downstream tasks
☆97Updated last year
google-deepmind / transformer_grammars
Transformer Grammars: Augmenting Transformer Language Models with Syntactic Inductive Biases at Scale, TACL (2022)
☆127Updated last month
google-research / jestimator
Amos optimizer with JEstimator lib.
☆82Updated last year
tau-nlp / scrolls
The official code of EMNLP 2022, "SCROLLS: Standardized CompaRison Over Long Language Sequences".
☆70Updated last year
rovle / gpt3-in-context-fitting
Experiments on GPT-3's ability to fit numerical models in-context.
☆14Updated 2 years ago
r2llab / wrangl
Parallel data preprocessing for NLP and ML.
☆34Updated 9 months ago
HomebrewML / Olmax
HomebrewNLP in JAX flavour for maintable TPU-Training
☆50Updated last year
ClashLuke / tpucare
Automatically take good care of your preemptible TPUs
☆36Updated 2 years ago
google-deepmind / emergent_in_context_learning
☆84Updated last year
CarperAI / autocrit
A repository for transformer critique learning and generation
☆90Updated last year
TristanThrush / perplexity-correlations
Simple and scalable tools for data-driven pretraining data selection.
☆24Updated last month
NohTow / PPL-MCTS
Repository for the code of the "PPL-MCTS: Constrained Textual Generation Through Discriminator-Guided Decoding" paper, NAACL'22
☆66Updated 2 years ago
HendrikStrobelt / LMdiff
A diff tool for language models
☆43Updated last year
jxiw / BiGS
Official Repository of Pretraining Without Attention (BiGS), BiGS is the first model to achieve BERT-level transfer learning on the GLUE …
☆114Updated last year
srush / do-we-need-attention
☆166Updated 2 years ago