imelnyk / ArxivPapers
Code behind Arxiv Papers
☆508Updated 11 months ago
Alternatives and similar repositories for ArxivPapers:
Users that are interested in ArxivPapers are comparing it to the libraries listed below
- A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and full…☆603Updated 3 months ago
- LLM Analytics☆644Updated 4 months ago
- Visualize the intermediate output of Mistral 7B☆343Updated last month
- Fine-tune LLM agents with online reinforcement learning☆1,077Updated 11 months ago
- A pure NumPy implementation of Mamba.☆219Updated 8 months ago
- Finetune llama2-70b and codellama on MacBook Air without quantization☆448Updated 11 months ago
- Talk to any ArXiv paper using ChatGPT☆520Updated last year
- Textbook on reinforcement learning from human feedback☆474Updated this week
- MINT-1T: A one trillion token multimodal interleaved dataset.☆802Updated 7 months ago
- llama3.np is a pure NumPy implementation for Llama 3 model.☆973Updated 9 months ago
- A modern model graph visualizer and debugger☆1,132Updated this week
- Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.☆853Updated last year
- data-to-paper: Backward-traceable AI-driven scientific research☆596Updated 2 weeks ago
- A JAX research toolkit for building, editing, and visualizing neural networks.☆1,739Updated 2 months ago
- Open weights language model from Google DeepMind, based on Griffin.☆623Updated 2 weeks ago
- a small code base for training large models☆287Updated 2 months ago
- Ask GPT to run a command☆197Updated last week
- From scratch implementation of a sparse mixture of experts language model inspired by Andrej Karpathy's makemore :)☆672Updated 4 months ago
- Absolute minimalistic implementation of a GPT-like transformer using only numpy (<650 lines).☆250Updated last year
- Bayesian Optimization as a Coverage Tool for Evaluating LLMs. Accurate evaluation (benchmarking) that's 10 times faster with just a few l…☆279Updated 2 weeks ago
- Official codebase for the paper "Beyond A* Better Planning with Transformers via Search Dynamics Bootstrapping".☆362Updated 8 months ago
- Website for hosting the Open Foundation Models Cheat Sheet.☆262Updated 8 months ago
- Things you can do with the token embeddings of an LLM☆1,426Updated last month
- Felafax is building AI infra for non-NVIDIA GPUs☆555Updated last month
- ☆242Updated 11 months ago
- ☆111Updated last month
- a curated list of data for reasoning ai☆130Updated 7 months ago
- A library for making RepE control vectors☆555Updated 2 months ago