zfgao66 / OPFLinks

Pytorch implementation of paper: Small Pre-trained Language Models Can be Fine-tuned as Large Models via Over-Parameterization.

☆12

Alternatives and similar repositories for OPF

Users that are interested in OPF are comparing it to the libraries listed below

Sorting:

ablghtianyi / ICL_Modular_Arithmetic
☆19Updated 3 months ago
NVlabs / PerAda
Repo for the paper: PerAda: Parameter-Efficient Federated Learning Personalization with Generalization Guarantees (CVPR 2024)
☆19Updated 10 months ago
VirtuosoResearch / Generalization-in-graph-neural-networks
Measuring generalization properties of graph neural networks
☆15Updated 2 years ago
Dousia / MetaPrompting
Code for COLING 2022 long paper: MetaPrompting: Learning to Learn Better Prompts
☆20Updated 2 years ago
yanyanSann / PromptTPP
PyTorch Implementation of Prompt-augmented Temporal Point Process for Streaming Event Sequence, NeurIPS 2023
☆14Updated last year
ml-energy / leaderboard
How much energy do GenAI models consume?
☆45Updated last month
Lingkai-Kong / so-ebm
Code for paper: End-to-end Stochastic Optimization with Energy-based Model
☆16Updated 2 years ago
hdong920 / GRIFFIN
☆37Updated 10 months ago
yifanycc / loretta
[NAACL 24 Oral] LoRETTA: Low-Rank Economic Tensor-Train Adaptation for Ultra-Low-Parameter Fine-Tuning of Large Language Models
☆35Updated 5 months ago
WANGXinyiLinda / concept-based-demonstration-selection
Offical code of the paper Large Language Models Are Implicitly Topic Models: Explaining and Finding Good Demonstrations for In-Context Le…
☆74Updated last year
andyjm3 / SLTrain
SLTrain: a sparse plus low-rank approach for parameter and memory efficient pretraining (NeurIPS 2024)
☆31Updated 7 months ago
amazon-science / street-reasoning
STREET: a multi-task and multi-step reasoning dataset
☆22Updated last year
SalesforceAIResearch / MobileAIBench
☆22Updated 5 months ago
algo-reasoning / algo-reasoning.github.io
Neural Algorithmic Reasoning Tutorial
☆12Updated 2 years ago
microsoft / reliableAI
☆46Updated last week
ma787639046 / bowdpr
[SIGIR24] Pre-training with Bag-of-Word Prediction for Dense Passage Retrieval
☆17Updated last year
danilonumeroso / conar
Official code for the paper `Neural Algorithmic Reasoning for Combinatorial Optimisation`
☆18Updated last year
thomasgauthier / LLM-self-play
Minimal implementation of the Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models paper (ArXiv 20232401.01335)
☆28Updated last year
casmlab / NPHardEval
Repository for NPHardEval, a quantified-dynamic benchmark of LLMs
☆54Updated last year
Scientific-Computing-Lab / MPI-rigen
MPI Code Generation through Domain-Specific Language Models
☆14Updated 7 months ago
nju-websoft / LKGE
Lifelong Embedding Learning and Transfer for Growing Knowledge Graphs, AAAI 2023
☆31Updated 2 years ago
Raiden-Zhu / ICML-2023-DSGD-and-SAM
[ICML 2023] Decentralized SGD and Average-direction SAM are Asymptotically Equivalent
☆19Updated last year
ExtensityAI / benchmark
Evaluation of neuro-symbolic engines
☆35Updated 10 months ago
zenrran4nlp / Awesome-LLM-Inference-Serving
☆36Updated last month
yecchen / MIRAI
Code and Data for "MIRAI: Evaluating LLM Agents for Event Forecasting"
☆65Updated 11 months ago
yanweiyue / GDesigner
☆39Updated 6 months ago
MurongYue / LLM_MoT_cascade
This is the implementation for the paper "LARGE LANGUAGE MODEL CASCADES WITH MIX- TURE OF THOUGHT REPRESENTATIONS FOR COST- EFFICIENT REA…
☆23Updated last year
antgroup / LLMOPT
The LLMOPT project offers a comprehensive set of resources, including the model, dataset, training framework, and inference code, enablin…
☆68Updated 2 months ago
dunzeng / MORE
Code for EMNLP'24 paper - On Diversified Preferences of Large Language Model Alignment
☆16Updated 10 months ago
awwang10 / llmpromptboosting
Accompanying code for "Boosted Prompt Ensembles for Large Language Models"
☆30Updated 2 years ago