EleutherAI / polyapproxLinks

Closed-form polynomial approximations to neural networks

☆13

Alternatives and similar repositories for polyapprox

Users that are interested in polyapprox are comparing it to the libraries listed below

Sorting:

EleutherAI / mdl
Minimum Description Length probing for neural network representations
☆18Updated 5 months ago
alon-albalak / FLAD
Few-shot Learning with Auxiliary Data
☆28Updated last year
AndyShih12 / LongHorizonTemperatureScaling
PyTorch implementation for "Long Horizon Temperature Scaling", ICML 2023
☆20Updated 2 years ago
UKPLab / on-emergence
Codes and files for the paper Are Emergent Abilities in Large Language Models just In-Context Learning
☆33Updated 6 months ago
rovle / gpt3-in-context-fitting
Experiments on GPT-3's ability to fit numerical models in-context.
☆14Updated 2 years ago
allenai / EmbeddingRecycling
Embedding Recycling for Language models
☆38Updated 2 years ago
HazyResearch / aioli
Aioli: A unified optimization framework for language model data mixing
☆27Updated 5 months ago
EleutherAI / rnngineering
Engineering the state of RNN language models (Mamba, RWKV, etc.)
☆32Updated last year
EleutherAI / features-across-time
Understanding how features learned by neural networks evolve throughout training
☆36Updated 8 months ago
IBM / ColPret
Efficient Scaling laws and collaborative pretraining.
☆16Updated 5 months ago
jmerullo / lm_vector_arithmetic
☆35Updated 2 years ago
orevaahia / magnet-tokenization
☆12Updated 7 months ago
sfeucht / footprints
https://footprints.baulab.info
☆17Updated 9 months ago
r-three / RAD
Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model
☆43Updated last year
ssokota / mec
Code for minimum-entropy coupling.
☆32Updated last year
harish-kamath / rqae
Residual Quantization Autoencoder, used for interpreting LLMs
☆12Updated 6 months ago
HazyResearch / embroid
Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification
☆11Updated last year
SuReLI / NeurOps
Implementations of growing and pruning in neural networks
☆22Updated last year
google-research-datasets / QAmeleon
QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…
☆34Updated last year
formll / resolving-scaling-law-discrepancies
☆20Updated last year
SamsungSAILMontreal / nino
Code for "Accelerating Training with Neuron Interaction and Nowcasting Networks" [to appear at ICLR 2025]
☆19Updated last month
EleutherAI / training-jacobian
☆23Updated 7 months ago
ielab / Starbucks
Starbucks: Improved Training for 2D Matryoshka Embeddings
☆21Updated 2 weeks ago
luohongyin / EntST
Entailment self-training
☆25Updated 2 years ago
mcleish7 / gemstone-scaling-laws
☆27Updated 5 months ago
MeLeLBGU / SaGe
Code for SaGe subword tokenizer (EACL 2023)
☆25Updated 7 months ago
stanfordnlp / multi-distribution-retrieval
Code for our paper Resources and Evaluations for Multi-Distribution Dense Information Retrieval
☆15Updated last year
peterbhase / SLAG-Belief-Updating
Code for paper "Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs"
☆28Updated 3 years ago
srush / tangent
Source-to-Source Debuggable Derivatives in Pure Python
☆15Updated last year
Leukas / CUTE
☆14Updated last month