An approximate implementation of the OpenAI paper - An Empirical Model of Large-Batch Training for MNIST
☆11Nov 19, 2022Updated 3 years ago
Alternatives and similar repositories for An-Empirical-Model-of-Large-Batch-Training
Users that are interested in An-Empirical-Model-of-Large-Batch-Training are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of OpenAI paper with Simple Noise Scale on Fastai V2☆19Apr 16, 2021Updated 5 years ago
- Implementation of the paper "Opcodes as predictor for malware " by Daniel Bilar☆11Oct 17, 2020Updated 5 years ago
- Yali Zhang, Haifan Yin, Weidong Li, Emil Björnson, Mérouane Debbah, "Port-LLM: A Port Prediction Method for Fluid Antenna based on Large …☆16Sep 1, 2025Updated 9 months ago
- Thesis recurrence about Channel Estimation for One-Bit Multiuser Massive MIMO Using Conditional GAN☆10Oct 14, 2022Updated 3 years ago
- Research without Re-search: Maximal Update Parametrization Yields Accurate Loss Prediction across Scales☆32Jul 17, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- cs142, web application☆11Jun 24, 2017Updated 8 years ago
- ☆14Jun 29, 2023Updated 2 years ago
- Code for paper 'Are We Falling in a Middle-Intelligence Trap? An Analysis and Mitigation of the Reversal Curse'☆14Aug 2, 2024Updated last year
- A social networking website in Django☆26Aug 17, 2022Updated 3 years ago
- Using FlexAttention to compute attention with different masking patterns☆47Sep 22, 2024Updated last year
- PrecoderNet: Hybrid Beamforming for Millimeter Wave Systems with Deep Reinforcement Learning☆25Dec 26, 2022Updated 3 years ago
- DeciMamba: Exploring the Length Extrapolation Potential of Mamba (ICLR 2025)☆32Apr 9, 2025Updated last year
- ☆10Oct 8, 2018Updated 7 years ago
- A re-implementation of the "Red Teaming Language Models with Language Models" paper by Perez et al., 2022☆34Oct 9, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆22Dec 5, 2022Updated 3 years ago
- Toy implementations of some popular ML optimizers using Python/JAX☆44Jun 20, 2021Updated 4 years ago
- A re-implementation of the "Extracting Training Data from Large Language Models" paper by Carlini et al., 2020☆39Jul 10, 2022Updated 3 years ago
- Experiments Notebook of "Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism"☆15Apr 30, 2025Updated last year
- Official Code Repository for DDAT: Diffusion Policies Enforcing Dynamically Admissible Robot Trajectories☆29Sep 9, 2025Updated 8 months ago
- Reasoning-based Evaluation and Ranking of Translations.☆20Jul 18, 2025Updated 10 months ago
- manipulating cointegrated pairs to achieve a market-neutral strategy that outperforms indices☆11Jan 12, 2021Updated 5 years ago
- The repository is for Reinforcement-Learning Uncertainty research, in which we investigate various uncertain factors in RL.☆23Jun 16, 2023Updated 2 years ago
- coloring terminal text with intensities (used for plotting probability, entropy with tokens)☆12Oct 11, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- IIG-RL-Benchmark is a library for training and evaluating game theoretical or deep RL algorithms on OpenSpiel games.☆25Nov 18, 2025Updated 6 months ago
- A few models converted from caffe to CoreMLs format.☆15Jun 6, 2017Updated 8 years ago
- Scratchpad/Chain-of-Thought Prompts☆12Jun 6, 2022Updated 3 years ago
- Combining SOAP and MUON☆22Feb 11, 2025Updated last year
- Code to reproduce key results accompanying "SAEs (usually) Transfer Between Base and Chat Models"☆13Jul 18, 2024Updated last year
- This repository contains the code for Diversity Control (DiCo), a novel method to constrain behavioral diversity in multi-agent reinforce…☆31Dec 21, 2024Updated last year
- ☆17Dec 11, 2024Updated last year
- ☆21Mar 13, 2024Updated 2 years ago
- Longitudinal Evaluation of LLMs via Data Compression☆33May 29, 2024Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- [NeurIPS 2022] "Losses Can Be Blessings: Routing Self-Supervised Speech Representations Towards Efficient Multilingual and Multitask Spee…☆17Sep 19, 2023Updated 2 years ago
- Mantella spell mod for Skyrim VR / AE / SE☆17Apr 21, 2026Updated last month
- ☆15Mar 2, 2025Updated last year
- Variational autoencoder for single cell RNA-seq datasets☆44Aug 15, 2017Updated 8 years ago
- 2021 Spring☆18Oct 12, 2024Updated last year
- ☆21Nov 26, 2022Updated 3 years ago
- A Statistical Arbitrage Strategy to trade Cryptocurrency Pairs☆13Nov 6, 2020Updated 5 years ago