not-lain / pxiaLinks
minimalistic AI library that resembles HF's transformers
☆14Updated 7 months ago
Alternatives and similar repositories for pxia
Users that are interested in pxia are comparing it to the libraries listed below
Sorting:
- ☆66Updated last year
- Entropy Based Sampling and Parallel CoT Decoding☆17Updated 10 months ago
- ☆134Updated 11 months ago
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆63Updated 9 months ago
- look how they massacred my boy☆63Updated 9 months ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆104Updated 5 months ago
- Train your own SOTA deductive reasoning model☆104Updated 5 months ago
- Official codebase for "Quantile Reward Policy Optimization: Alignment with Pointwise Regression and Exact Partition Functions" (Matrenok …☆24Updated last month
- Lego for GRPO☆28Updated 2 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆82Updated this week
- ☆131Updated 4 months ago
- ☆48Updated 10 months ago
- The simplest, fastest repository for training/finetuning medium-sized xLSTMs.☆41Updated last year
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆146Updated 5 months ago
- Scaling is a distributed training library and installable dependency designed to scale up neural networks, with a dedicated module for tr…☆64Updated 9 months ago
- Lightweight toolkit package to train and fine-tune 1.58bit Language models☆83Updated 2 months ago
- entropix style sampling + GUI☆26Updated 9 months ago
- Google TPU optimizations for transformers models☆118Updated 6 months ago
- Training an LLM to use a calculator with multi-turn reinforcement learning, achieving a **62% absolute increase in evaluation accuracy**.☆45Updated 3 months ago
- Arxflix turns your boring Arxiv research paper into a captivating video.☆52Updated this week
- ☆118Updated 11 months ago
- Video+code lecture on building nanoGPT from scratch☆69Updated last year
- An introduction to LLM Sampling☆79Updated 8 months ago
- Simple examples using Argilla tools to build AI☆53Updated 8 months ago
- Tcurtsni: Reverse Instruction Chat, ever wonder what your LLM wants to ask you?☆22Updated last year
- Easy to use, High Performant Knowledge Distillation for LLMs☆89Updated 3 months ago
- ☆130Updated 4 months ago
- A blueprint for AI development, focusing on applied examples of RAG, information extraction, analysis and fine-tuning in the age of LLMs …☆58Updated 6 months ago
- Just a bunch of benchmark logs for different LLMs☆119Updated last year
- 1.58-bit LLaMa model☆82Updated last year