imelnyk / ArxivPapers
Code behind Arxiv Papers
☆515Updated last year
Alternatives and similar repositories for ArxivPapers:
Users that are interested in ArxivPapers are comparing it to the libraries listed below
- LLM Analytics☆658Updated 6 months ago
- Visualize the intermediate output of Mistral 7B☆360Updated 3 months ago
- A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and full…☆607Updated last month
- a small code base for training large models☆294Updated last week
- A pure NumPy implementation of Mamba.☆222Updated 9 months ago
- Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.☆854Updated last year
- llama3.np is a pure NumPy implementation for Llama 3 model.☆981Updated last week
- Animating R1's thoughts.☆380Updated 2 months ago
- a curated list of data for reasoning ai☆134Updated 9 months ago
- MINT-1T: A one trillion token multimodal interleaved dataset.☆810Updated 9 months ago
- Finetune llama2-70b and codellama on MacBook Air without quantization☆448Updated last year
- Grandmaster-Level Chess Without Search☆572Updated 3 months ago
- Fine-tune LLM agents with online reinforcement learning☆1,137Updated last year
- gpt-2 from scratch in mlx☆383Updated 10 months ago
- Open weights language model from Google DeepMind, based on Griffin.☆636Updated 2 months ago
- data-to-paper: Backward-traceable AI-driven scientific research☆633Updated 3 weeks ago
- Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI☆1,378Updated last year
- A JAX research toolkit for building, editing, and visualizing neural networks.☆1,774Updated last week
- A comprehensive deep dive into the world of tokens☆222Updated 10 months ago
- Bayesian Optimization as a Coverage Tool for Evaluating LLMs. Accurate evaluation (benchmarking) that's 10 times faster with just a few l…☆283Updated this week
- A modern model graph visualizer and debugger☆1,175Updated this week
- Things you can do with the token embeddings of an LLM☆1,440Updated last month
- Felafax is building AI infra for non-NVIDIA GPUs☆559Updated 3 months ago
- Website for hosting the Open Foundation Models Cheat Sheet.☆267Updated 2 weeks ago
- Absolute minimalistic implementation of a GPT-like transformer using only numpy (<650 lines).☆251Updated last year
- Run and explore Llama models locally with minimal dependencies on CPU☆189Updated 6 months ago
- Achieve the llama3 inference step-by-step, grasp the core concepts, master the process derivation, implement the code.☆571Updated 2 months ago
- MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.☆1,291Updated 2 weeks ago
- Official codebase for the paper "Beyond A* Better Planning with Transformers via Search Dynamics Bootstrapping".☆365Updated 10 months ago
- See Through Your Models☆389Updated last month