schletz / Dbi2Sem
DBI im 4. Semester AIF/KIF bzw. im III. Jahrgang HIF
☆7Updated 11 months ago
Related projects ⓘ
Alternatives and complementary repositories for Dbi2Sem
- POS im 5. Semester AIF/KIF bzw. im III. Jahrgang HIF☆34Updated 2 months ago
- Swarming algorithms like PSO, Ant Colony, Sakana, and more in PyTorch 😊☆111Updated last week
- Implementation of DreamerV3 in Pytorch☆33Updated this week
- ☆50Updated 5 months ago
- Minimal but scalable implementation of large language models in JAX☆26Updated 2 weeks ago
- Efficient World Models with Context-Aware Tokenization. ICML 2024☆84Updated 2 months ago
- Cost aware hyperparameter tuning algorithm☆124Updated 4 months ago
- Efficient baselines for autocurricula in JAX.☆173Updated 2 months ago
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆55Updated 2 weeks ago
- Muon optimizer for neural networks: >30% extra sample efficiency, <3% wallclock overhead☆113Updated this week
- ☆65Updated 2 weeks ago
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆113Updated 7 months ago
- Simple single-file baselines for Q-Learning in pure-GPU setting☆103Updated 3 months ago
- Generative cellular automaton-like learning environments for RL.☆19Updated last month
- Schedule free optimiser implemented in JAX using Optimistix☆14Updated 5 months ago
- Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models☆45Updated 5 months ago
- Official Implementation of "Can Learned Optimization Make Reinforcement Learning Less Difficult"☆12Updated 3 weeks ago
- ☆13Updated 4 months ago
- Alpha-Zero Connect Four NN trained via self play☆13Updated last month
- Entropy Based Sampling and Parallel CoT Decoding☆17Updated last month
- ☆27Updated 4 months ago
- A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks☆21Updated 3 weeks ago
- Ultra low overhead NVIDIA GPU telemetry plugin for telegraf with memory temperature readings.☆61Updated 4 months ago
- look how they massacred my boy☆58Updated last month
- A toolkit for practical Human-AI cooperation research☆13Updated 7 months ago
- ☆15Updated last year
- XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning☆53Updated 3 months ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆84Updated this week
- papers.day☆79Updated 11 months ago
- [ICML 2024] Official code release accompanying the paper "diff History for Neural Language Agents" (Piterbarg, Pinto, Fergus)☆18Updated 3 months ago