zfgao66 / OPFLinks
Pytorch implementation of paper: Small Pre-trained Language Models Can be Fine-tuned as Large Models via Over-Parameterization.
☆12Updated 2 years ago
Alternatives and similar repositories for OPF
Users that are interested in OPF are comparing it to the libraries listed below
Sorting:
- ☆19Updated 3 months ago
- Repo for the paper: PerAda: Parameter-Efficient Federated Learning Personalization with Generalization Guarantees (CVPR 2024)☆19Updated 10 months ago
- Measuring generalization properties of graph neural networks☆15Updated 2 years ago
- Code for COLING 2022 long paper: MetaPrompting: Learning to Learn Better Prompts☆20Updated 2 years ago
- PyTorch Implementation of Prompt-augmented Temporal Point Process for Streaming Event Sequence, NeurIPS 2023☆14Updated last year
- How much energy do GenAI models consume?☆45Updated last month
- Code for paper: End-to-end Stochastic Optimization with Energy-based Model☆16Updated 2 years ago
- ☆37Updated 10 months ago
- [NAACL 24 Oral] LoRETTA: Low-Rank Economic Tensor-Train Adaptation for Ultra-Low-Parameter Fine-Tuning of Large Language Models☆35Updated 5 months ago
- Offical code of the paper Large Language Models Are Implicitly Topic Models: Explaining and Finding Good Demonstrations for In-Context Le…☆74Updated last year
- SLTrain: a sparse plus low-rank approach for parameter and memory efficient pretraining (NeurIPS 2024)☆31Updated 7 months ago
- STREET: a multi-task and multi-step reasoning dataset☆22Updated last year
- ☆22Updated 5 months ago
- Neural Algorithmic Reasoning Tutorial☆12Updated 2 years ago
- ☆46Updated last week
- [SIGIR24] Pre-training with Bag-of-Word Prediction for Dense Passage Retrieval☆17Updated last year
- Official code for the paper `Neural Algorithmic Reasoning for Combinatorial Optimisation`☆18Updated last year
- Minimal implementation of the Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models paper (ArXiv 20232401.01335)☆28Updated last year
- Repository for NPHardEval, a quantified-dynamic benchmark of LLMs☆54Updated last year
- MPI Code Generation through Domain-Specific Language Models☆14Updated 7 months ago
- Lifelong Embedding Learning and Transfer for Growing Knowledge Graphs, AAAI 2023☆31Updated 2 years ago
- [ICML 2023] Decentralized SGD and Average-direction SAM are Asymptotically Equivalent☆19Updated last year
- Evaluation of neuro-symbolic engines☆35Updated 10 months ago
- ☆36Updated last month
- Code and Data for "MIRAI: Evaluating LLM Agents for Event Forecasting"☆65Updated 11 months ago
- ☆39Updated 6 months ago
- This is the implementation for the paper "LARGE LANGUAGE MODEL CASCADES WITH MIX- TURE OF THOUGHT REPRESENTATIONS FOR COST- EFFICIENT REA…☆23Updated last year
- The LLMOPT project offers a comprehensive set of resources, including the model, dataset, training framework, and inference code, enablin…☆68Updated 2 months ago
- Code for EMNLP'24 paper - On Diversified Preferences of Large Language Model Alignment☆16Updated 10 months ago
- Accompanying code for "Boosted Prompt Ensembles for Large Language Models"☆30Updated 2 years ago