Thytu / StockLLMLinks
Elevating Chess Strategy with Fine-Tuned Large Language Model
☆16Updated last year
Alternatives and similar repositories for StockLLM
Users that are interested in StockLLM are comparing it to the libraries listed below
Sorting:
- Visualising Losses in Deep Neural Networks☆16Updated last year
- Teaching transformers to play chess☆142Updated 9 months ago
- ☆48Updated last year
- ☆28Updated last year
- Understanding how features learned by neural networks evolve throughout training☆39Updated last year
- Exploring an idea where one forgets about efficiency and carries out attention across each edge of the nodes (tokens)☆55Updated 7 months ago
- Simple GRPO scripts and configurations.☆59Updated 9 months ago
- Simple repository for training small reasoning models☆45Updated 9 months ago
- Synthetic data generation and benchmark implementation for "Episodic Memories Generation and Evaluation Benchmark for Large Language Mode…☆59Updated last month
- A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.☆40Updated 7 months ago
- Deep learning library implemented from scratch in numpy. Mixtral, Mamba, LLaMA, GPT, ResNet, and other experiments.☆53Updated last year
- Training framework with a goal to explore the frontier of sample efficiency of small language models☆78Updated last week
- Read and write tensorboard data using Rust☆23Updated last year
- ☆58Updated this week
- Grokking on modular arithmetic in less than 150 epochs in MLX☆14Updated last year
- Same as llm.c but in Rust, as I want to get deeper and deeper into Rust programming☆67Updated 10 months ago
- Utilities for Training Very Large Models☆58Updated last year
- SMIT: A Simple Modality Integration Tool☆15Updated last year
- A RL env with procedurally generated symbolic reasoning data☆29Updated last month
- DiffuLab is designed to provide a simple and flexible way to train diffusion models while allowing full customization of its core compone…☆40Updated last week
- ☆24Updated 11 months ago
- some mixture of experts architecture implementations☆22Updated last year
- This repository has code for fine-tuning LLMs with GRPO specifically for Rust Programming using cargo as feedback☆111Updated 8 months ago
- ☆65Updated 8 months ago
- ☆62Updated last year
- Official repository for "BLEUBERI: BLEU is a surprisingly effective reward for instruction following"☆29Updated 5 months ago
- A collection of lightweight interpretability scripts to understand how LLMs think☆66Updated this week
- ☆23Updated 2 months ago
- Experimental CUDA kernel framework unifying typed dimensions, NVRTC JIT specialization, and ML‑guided tuning.☆43Updated this week
- Implementation of Direct Preference Optimization☆17Updated 2 years ago