An open-source implementation of Scaling Laws for Neural Language Models using nanoGPT
☆52Dec 8, 2023Updated 2 years ago
Alternatives and similar repositories for scaling_laws
Users that are interested in scaling_laws are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Your fruity companion for transformers☆14May 25, 2022Updated 4 years ago
- Score LLM pretraining data with classifiers☆55Nov 2, 2023Updated 2 years ago
- [NeurIPS'24 Spotlight] Observational Scaling Laws☆61Oct 2, 2024Updated last year
- Error correction scheme for storing information on DNA☆13Jul 27, 2017Updated 8 years ago
- An implementation of the paper Brain2Qwerty that translates brain EEG data into text for reading people's brains. There was no code so we…☆23Feb 9, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Gradient-Free Textual Inversion for Personalized Text-to-Image Generation☆43Jan 23, 2023Updated 3 years ago
- Official implementation of 'A Large-Scale Exploration of mu-Transfer'☆32Jun 5, 2025Updated last year
- Comparing attention-based convolutional and recurrent neural networks under adversarial attacks to investigate their success and limitati…☆10Aug 24, 2018Updated 7 years ago
- Simplistic Pytorch Implementation of the Dreamer-RL☆20May 7, 2025Updated last year
- A dataset of real DNA traces for benchmarking trace reconstruction algorithms☆23Nov 18, 2024Updated last year
- Experiments on the impact of depth in transformers and SSMs.☆41Oct 23, 2025Updated 8 months ago
- [Preprint] On the Effectiveness of Mitigating Data Poisoning Attacks with Gradient Shaping☆10Feb 27, 2020Updated 6 years ago
- ☆17Apr 23, 2026Updated 2 months ago
- ☆61Dec 6, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆23Sep 6, 2021Updated 4 years ago
- A+つくばは大学の課題を効率よく十分な品質で提出することができない (A+が取れない!!)問題を解決したい 同じ講義に知り合いが少ない筑波大生向けの筑波大生専用の匿名学習支援SNSです。☆11Nov 23, 2025Updated 7 months ago
- SIMBL plugin to give windows some extra functionality on macOS☆10Aug 17, 2017Updated 8 years ago
- llms related stuff , including code, docs☆13Feb 25, 2025Updated last year
- This is an interactive mock-up of the SpaceX Dragon 2 spacecraft's user interface. It contains 5 panels and multiple amusing features. A …☆11Jul 28, 2022Updated 3 years ago
- A Citation Manager and Zotero Integration for RemNote! Cite research all within your knowledge base!☆29Jan 22, 2026Updated 5 months ago
- ☆10Dec 6, 2022Updated 3 years ago
- Implementation of paper "Transferring Robustness for Graph Neural Network Against Poisoning Attacks".☆20Feb 26, 2020Updated 6 years ago
- ☆18May 13, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆13Nov 28, 2018Updated 7 years ago
- ☆33Sep 10, 2024Updated last year
- Skill-It! A Data-Driven Skills Framework for Understanding and Training Language Models☆48Oct 31, 2023Updated 2 years ago
- FormulaOne: A dataset of algorithmic problems based on MSO formulas.☆25Mar 1, 2026Updated 3 months ago
- Benchmarking data and script used for LLM multi-agent collaboration systems from AWS Bedrock Agents Science team.☆18Dec 10, 2024Updated last year
- Code for paper: KNN-BERT: Fine-Tuning Pre-Trained Models with KNN Classifier☆26Dec 5, 2021Updated 4 years ago
- Notionに毎日新しいarXiv論文のアブストラクト日本語訳 + αを表示するスクリプト☆12Jan 22, 2023Updated 3 years ago
- Code Roberta version of RetroMAE: Pre-Training Retrieval-oriented Language Models Via Masked Auto-Encoder☆10Mar 16, 2023Updated 3 years ago
- Inverse Scaling in Test-Time Compute☆25Dec 3, 2025Updated 6 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Implementation of PatchAIL in the ICLR 2023 paper <Visual Imitation with Patch Rewards>☆14Feb 15, 2023Updated 3 years ago
- https://robotmlcourse.github.io/SP20/index.html☆14Aug 27, 2020Updated 5 years ago
- Official PyTorch implementation of The Linear Attention Resurrection in Vision Transformer☆15Sep 7, 2024Updated last year
- ☆18Sep 29, 2020Updated 5 years ago
- ☆51Jan 18, 2024Updated 2 years ago
- ☆13Jul 4, 2020Updated 5 years ago
- Continual Memorization of Factoids in Large Language Models☆12Nov 20, 2024Updated last year