StonyBrookNLP / ireneLinks

[ACL 2021] IrEne: Interpretable Energy Prediction for Transformers

☆10

Alternatives and similar repositories for irene

Users that are interested in irene are comparing it to the libraries listed below

Sorting:

lucidrains / memory-editable-transformer
My explorations into editing the knowledge and memories of an attention network
☆35Updated 2 years ago
google-research-datasets / QAmeleon
QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…
☆34Updated last year
lucidrains / tableformer-pytorch
Implementation of TableFormer, Robust Transformer Modeling for Table-Text Encoding, in Pytorch
☆39Updated 3 years ago
IST-DASLab / SparseFinetuning
Repository for Sparse Finetuning of LLMs via modified version of the MosaicML llmfoundry
☆42Updated last year
RAIVNLab / MatFormer-OLMo
Code repository for the public reproduction of the language modelling experiments on "MatFormer: Nested Transformer for Elastic Inference…
☆22Updated last year
amirzandieh / HyperAttention
Triton Implementation of HyperAttention Algorithm
☆48Updated last year
google-deepmind / asyncdiloco
☆44Updated last year
HazyResearch / embroid
Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification
☆11Updated last year
HazyResearch / prefix-linear-attention
☆54Updated 10 months ago
RobertCsordas / moe
Official repository for the paper "Approximating Two-Layer Feedforward Networks for Efficient Transformers"
☆37Updated last year
philschmid / optimum-static-quantization
☆28Updated 2 years ago
frankxwang / dpo-prefix-sharing
DPO, but faster 🚀
☆42Updated 6 months ago
crypdick / timm-lr-scheduler-explorer
A dashboard for exploring timm learning rate schedulers
☆19Updated 6 months ago
microsoft / AutoMoE
AutoMoE: Neural Architecture Search for Efficient Sparsely Activated Transformers
☆46Updated 2 years ago
withmartian / leaderboard-backend
Open sourced backend for Martian's LLM Inference Provider Leaderboard
☆18Updated 9 months ago
reddragon / efficient-dl-survey-paper
Efficient Deep Learning Survey Paper
☆33Updated 2 years ago
srush / transformers-bet
☆12Updated 3 years ago
stanford-crfm / helm-efficiency
☆9Updated last year
IST-DASLab / QIGen
Repository for CPU Kernel Generation for LLM Inference
☆26Updated last year
locuslab / scaling_laws_data_filtering
☆64Updated last year
yidingjiang / ado
The repository contains code for Adaptive Data Optimization
☆24Updated 5 months ago
aniquetahir / JORA
JORA: JAX Tensor-Parallel LoRA Library (ACL 2024)
☆33Updated last year
allenai / sso
Repository for Skill Set Optimization
☆13Updated 10 months ago
kyo-takano / chinchilla
A toolkit for scaling law research ⚖
☆49Updated 4 months ago
mayank31398 / ladder-residual-inference
☆13Updated 3 weeks ago
allenai / bff
☆38Updated last year
siyan-zhao / prepacking
The source code of our work "Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models" [AISTATS …
☆59Updated 7 months ago
google-research / precondition
☆31Updated last month
Upaya07 / NeurIPS-llm-efficiency-challenge
Code for NeurIPS LLM Efficiency Challenge
☆59Updated last year
renll / SeqBoat
[NeurIPS 2023] Sparse Modular Activation for Efficient Sequence Modeling
☆36Updated last year