Thytu / StockLLMLinks
Elevating Chess Strategy with Fine-Tuned Large Language Model
☆13Updated last year
Alternatives and similar repositories for StockLLM
Users that are interested in StockLLM are comparing it to the libraries listed below
Sorting:
- Grokking on modular arithmetic in less than 150 epochs in MLX☆14Updated 9 months ago
- Visualising Losses in Deep Neural Networks☆16Updated last year
- A collection of optimisers for use with candle☆36Updated 2 months ago
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆24Updated last year
- get token info from all dex so that you can get most healthy pool☆5Updated 5 months ago
- This repository has code for fine-tuning LLMs with GRPO specifically for Rust Programming using cargo as feedback☆100Updated 4 months ago
- ☆48Updated 10 months ago
- GoldFinch and other hybrid transformer components☆46Updated last year
- ☆11Updated last year
- DeMo: Decoupled Momentum Optimization☆189Updated 7 months ago
- Understanding how features learned by neural networks evolve throughout training☆36Updated 9 months ago
- Simple GRPO scripts and configurations.☆59Updated 5 months ago
- Fast, Modern, and Low Precision PyTorch Optimizers☆99Updated last week
- RL significantly the reasoning capability of Qwen2.5-1.5B-Instruct☆29Updated 5 months ago
- Implementation of OpenAI paper with Simple Noise Scale on Fastai V2☆19Updated 4 years ago
- ☆23Updated 7 months ago
- Official implementation for the paper "Can Large Reasoning Models Self-Train?"☆53Updated last month
- Tiled Flash Linear Attention library for fast and efficient mLSTM Kernels.☆64Updated 3 weeks ago
- ☆9Updated 2 years ago
- MergeBench: A Benchmark for Merging Domain-Specialized LLMs☆17Updated 2 months ago
- train entropix like a champ!☆20Updated 9 months ago
- An approximate implementation of the OpenAI paper - An Empirical Model of Large-Batch Training for MNIST☆11Updated 2 years ago
- ☆29Updated last week
- Implementation of 2-simplicial attention proposed by Clift et al. (2019) and the recent attempt to make practical in Fast and Simplex, Ro…☆40Updated last week
- ☆27Updated last year
- Tiny re-implementation of MDM in style of LLaDA and nano-gpt speedrun☆55Updated 4 months ago
- Triton Implementation of HyperAttention Algorithm☆48Updated last year
- 👷 Build compute kernels☆78Updated this week
- Modded vLLM to run pipeline parallelism over public networks☆37Updated 2 months ago
- An implementation of PPO in Pytorch☆93Updated last month