KentoNishi / awesome-all-you-need-papers
A list of all "all you need" papers. Updated daily using the arXiv API.
☆92Updated this week
Alternatives and similar repositories for awesome-all-you-need-papers:
Users that are interested in awesome-all-you-need-papers are comparing it to the libraries listed below
- NeuMeta transforms neural networks by allowing a single model to adapt on the fly to different sizes, generating the right weights when n…☆42Updated 5 months ago
- Official source code for "Graph Neural Networks for Learning Equivariant Representations of Neural Networks". In ICLR 2024 (oral).☆79Updated 9 months ago
- Implementation of the general framework for AMIE, from the paper "Towards Conversational Diagnostic AI", out of Google Deepmind☆60Updated 7 months ago
- ☆118Updated 8 months ago
- Summarizing Mean Review Score for All Submissions for a Conference hosted on Openreview☆22Updated last year
- Access latex source of any arxiv.org paper directly on overleaf☆187Updated 10 months ago
- Plug in & Play Pytorch Implementation of the paper: "Evolutionary Optimization of Model Merging Recipes" by Sakana AI☆30Updated 5 months ago
- Implementation of Mind Evolution, Evolving Deeper LLM Thinking, from Deepmind☆48Updated 2 months ago
- Explorations into adversarial losses on top of autoregressive loss for language modeling☆35Updated 2 months ago
- Description and applications of OpenAI's paper about DALL-E (2021) and implementation of other (CLIP-guided) zero-shot text-to-image gene…☆33Updated 2 years ago
- Multi-Layer Key-Value sharing experiments on Pythia models☆32Updated 10 months ago
- Video descriptions of research papers relating to foundation models and scaling☆30Updated 2 years ago
- Revisiting Efficient Training Algorithms For Transformer-based Language Models (NeurIPS 2023)☆80Updated last year
- Conference schedule, top papers, and analysis of the data for NeurIPS 2023!☆119Updated last year
- Timm model explorer☆39Updated last year
- This is a PyTorch implementation of the paperViP A Differentially Private Foundation Model for Computer Vision☆36Updated last year
- Official code for the paper "Image generation with shortest path diffusion" accepted at ICML 2023.☆23Updated last year
- The official repository for HyperZ⋅Z⋅W Operator Connects Slow-Fast Networks for Full Context Interaction.☆36Updated 3 weeks ago
- Code for “Pretrained Language Models as Visual Planners for Human Assistance”☆60Updated last year
- Awesome Learn From Model Beyond Fine-Tuning: A Survey☆62Updated 4 months ago
- OpenReivew Submission Visualization (ICLR 2024/2025)☆152Updated 6 months ago
- [NeurIPS 2024] A Novel Rank-Based Metric for Evaluating Large Language Models☆45Updated 5 months ago
- Code for "Don't trust your eyes: on the (un)reliability of feature visualizations" (ICML 2024)☆32Updated last year
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM☆54Updated last year
- Challenge LLMs to Reason About Reasoning: A Benchmark to Unveil Cognitive Depth in LLMs☆45Updated 9 months ago
- A light-weight implementation of ICCV2023 paper "Reinforce Data, Multiply Impact: Improved Model Accuracy and Robustness with Dataset Rei…☆79Updated last year
- ☆22Updated 3 months ago
- Repository for the code and dataset for the paper: "Have LLMs Advanced enough? Towards Harder Problem Solving Benchmarks For Large Langu…☆39Updated last year
- [NeurIPS 2024] Can LLMs Learn by Teaching for Better Reasoning? A Preliminary Study☆49Updated 5 months ago
- Documentation, notes, links, etc for streams.☆78Updated last year