An open-source implementation of Scaling Laws for Neural Language Models using nanoGPT
☆52Dec 8, 2023Updated 2 years ago
Alternatives and similar repositories for scaling_laws
Users that are interested in scaling_laws are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Your fruity companion for transformers☆14May 25, 2022Updated 4 years ago
- A toolkit for interpreting and analyzing neural networks (vision)☆34Jul 28, 2020Updated 5 years ago
- Defeasible Natural Language Inference☆13Dec 4, 2020Updated 5 years ago
- Official repository of Graph RAG-Tool Fusion and ToolLinkOS dataset.☆24Feb 13, 2025Updated last year
- LDPC codes for Illumina sequencing-based DNA storage☆11Dec 2, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- An implementation of the paper Brain2Qwerty that translates brain EEG data into text for reading people's brains. There was no code so we…☆23Feb 9, 2025Updated last year
- Hierarchical Image Representation☆10Dec 9, 2023Updated 2 years ago
- Official implementation of 'A Large-Scale Exploration of mu-Transfer'☆32Jun 5, 2025Updated last year
- The corresponding code from our paper "Social Commonsense Reasoning with Multi-Head Knowledge Attention (EMNLP 2020)". Do not hesitate to…☆11Jun 12, 2022Updated 3 years ago
- an attempt to beat the shit out of gnulib☆11Sep 19, 2011Updated 14 years ago
- Simplistic Pytorch Implementation of the Dreamer-RL☆20May 7, 2025Updated last year
- Cross platform C++ threading library☆12Dec 11, 2015Updated 10 years ago
- Deep Reinforcement Learning for Dialogue Generation using SEQ2SEQ model☆11Feb 23, 2021Updated 5 years ago
- Python library for finding English tenses in sentences☆12Jun 9, 2019Updated 7 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- The sorce code for the realtime video descriptors: HOG, HOF, MBH and HMG☆10Feb 6, 2017Updated 9 years ago
- 逻辑回归和单层softmax的解析解☆12Jul 29, 2021Updated 4 years ago
- The benchmark for LLMs designed to tackle one of the most knowledge-intensive tasks in data science: writing feature engineering code, wh…☆21Oct 28, 2024Updated last year
- A+つくばは大学の課題を効率よく十分な品質で提出することができない (A+が取れない!!)問題を解決したい 同じ講義に知り合いが少ない筑波大生向けの筑波大生専用の匿名学習支援SNSです。☆11Nov 23, 2025Updated 6 months ago
- Small notebook to preprocess and evaluate images.☆14Nov 11, 2022Updated 3 years ago
- Study notebooks made for learning machine learning for the Hawk team☆11Oct 10, 2018Updated 7 years ago
- Semi-supervised spoken language understanding (SLU) via self-supervised speech and language model pretraining☆12Mar 23, 2021Updated 5 years ago
- SIMBL plugin to give windows some extra functionality on macOS☆10Aug 17, 2017Updated 8 years ago
- llms related stuff , including code, docs☆13Feb 25, 2025Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- This is an interactive mock-up of the SpaceX Dragon 2 spacecraft's user interface. It contains 5 panels and multiple amusing features. A …☆11Jul 28, 2022Updated 3 years ago
- Source code for the ACL-IJCNLP 2021 paper entitled "T-DNA: Taming Pre-trained Language Models with N-gram Representations for Low-Resourc…☆19Jan 12, 2023Updated 3 years ago
- A Citation Manager and Zotero Integration for RemNote! Cite research all within your knowledge base!☆29Jan 22, 2026Updated 4 months ago
- I use various Data Science and machine learning techniques to analyze customer data using STP framework. I preprocessed the data, perform…☆11Apr 26, 2020Updated 6 years ago
- An efficient hierarchical Graph-based RAG☆41Nov 27, 2025Updated 6 months ago
- ☆18May 13, 2025Updated last year
- ☆34Sep 10, 2024Updated last year
- Skill-It! A Data-Driven Skills Framework for Understanding and Training Language Models☆48Oct 31, 2023Updated 2 years ago
- ☆14Mar 2, 2011Updated 15 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- LLM training in simple, raw C/CUDA☆15Dec 5, 2024Updated last year
- Python tools for VMD☆10Jun 3, 2018Updated 8 years ago
- ☆11Apr 23, 2023Updated 3 years ago
- Code for paper: KNN-BERT: Fine-Tuning Pre-Trained Models with KNN Classifier☆26Dec 5, 2021Updated 4 years ago
- Code Roberta version of RetroMAE: Pre-Training Retrieval-oriented Language Models Via Masked Auto-Encoder☆10Mar 16, 2023Updated 3 years ago
- Deluge plugin to stop torrents after seeding for a certain amount of time.☆12Jun 24, 2019Updated 6 years ago
- Implementation of PatchAIL in the ICLR 2023 paper <Visual Imitation with Patch Rewards>☆14Feb 15, 2023Updated 3 years ago