saurabhaloneai / qwen3-expLinks
qwen3 experiments
☆27Updated 2 weeks ago
Alternatives and similar repositories for qwen3-exp
Users that are interested in qwen3-exp are comparing it to the libraries listed below
Sorting:
- ☆64Updated last month
- This repository contain the simple llama3 implementation in pure jax.☆67Updated 4 months ago
- A locally trained model of Stoney Nakoda has been developed and released. You can access the working model here or train your own instanc…☆10Updated 3 months ago
- So, I trained a Llama a 130M architecture I coded from ground up to build a small instruct model from scratch. Trained on FineWeb dataset…☆15Updated 3 months ago
- ☆186Updated 2 weeks ago
- rl from zero pretrain, can it be done? we'll see.☆65Updated 3 weeks ago
- ☆38Updated 11 months ago
- look how they massacred my boy☆63Updated 8 months ago
- ☆49Updated this week
- ☆19Updated 4 months ago
- Very minimal (and stateless) agent framework☆44Updated 6 months ago
- The original BabyAGI, updated with LiteLLM and no vector database reliance (csv instead)☆21Updated 9 months ago
- Training an LLM to use a calculator with multi-turn reinforcement learning, achieving a **62% absolute increase in evaluation accuracy**.☆42Updated 2 months ago
- An intelligent code optimization system leveraging AI analysis, automated refactoring, and test generation. Built with DSPy and Gradio, i…☆20Updated 5 months ago
- Simple GRPO scripts and configurations.☆59Updated 5 months ago
- Testing paligemma2 finetuning on reasoning dataset☆18Updated 6 months ago
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆64Updated 8 months ago
- tiny_fnc_engine is a minimal python library that provides a flexible engine for calling functions extracted from a LLM.☆38Updated 10 months ago
- j1-micro (1.7B) & j1-nano (600M) are absurdly tiny but mighty reward models.☆91Updated last month
- NanoGPT-speedrunning for the poor T4 enjoyers☆68Updated 2 months ago
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆87Updated 2 weeks ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆54Updated 5 months ago
- Lego for GRPO☆28Updated last month
- Useful resources for LLM-based Diarization and Transcription.☆55Updated 9 months ago
- Quick Notebook Tutorials☆32Updated 5 months ago
- Really quick-and-dirty example of AI recursive learning☆26Updated 8 months ago
- Daily Research Bot helps you stay on top of new AI-related research and updates. Currently supports: `huggingface.co/papers` and `hype.re…☆46Updated 7 months ago
- ☆87Updated 6 months ago
- Train your own SOTA deductive reasoning model☆96Updated 4 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆91Updated 5 months ago