traceopt-ai / tracemlLinks
A simple package to automatically trace PyTorch training memory usage.
☆67Updated last week
Alternatives and similar repositories for traceml
Users that are interested in traceml are comparing it to the libraries listed below
Sorting:
- A collection of lightweight interpretability scripts to understand how LLMs think☆71Updated this week
- Datamodels for hugging face tokenizers☆86Updated 3 weeks ago
- Multi-backend recommender systems with Keras 3☆151Updated last week
- Aana SDK is a powerful framework for building AI enabled multimodal applications.☆55Updated 4 months ago
- Seemless interface of using PyTOrch distributed with Jupyter notebooks☆57Updated 3 months ago
- Tensor-Slayer : Manipulate weights and tensors of LLMs to achieve performance upgrades and introduce a novel inferenceless mechanistic in…☆27Updated 6 months ago
- ☆45Updated 2 months ago
- Machine Learning Serving focused on GenAI with simplicity as the top priority.☆59Updated 2 months ago
- ☆36Updated 7 months ago
- Visualize multi-model embedding spaces. The first goal is to quickly get a lay of the land of any embedding space. Then be able to scroll…☆27Updated last year
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆59Updated 2 months ago
- ☆181Updated 5 months ago
- Efficient non-uniform quantization with GPTQ for GGUF☆57Updated 3 months ago
- Fast, High-Fidelity LLM Decoding with Regex Constraints☆21Updated last year
- ☆89Updated 5 months ago
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆103Updated last year
- Framework for building and maintaining self-updating prompts for LLMs☆65Updated last year
- ML/DL Math and Method notes☆65Updated 2 years ago
- j1-micro (1.7B) & j1-nano (600M) are absurdly tiny but mighty reward models.☆99Updated 5 months ago
- ☆213Updated last week
- The official evaluation suite and dynamic data release for MixEval.☆11Updated last year
- Tools to make language models a bit easier to use☆61Updated this week
- LLM training in simple, raw C/CUDA☆15Updated last year
- Large multi-modal models (L3M) pre-training.☆223Updated 3 months ago
- Train LLM on Hugging Face infra☆67Updated last month
- Pivotal Token Search☆135Updated this week
- Train an adapter for any embedding model in under a minute☆129Updated 8 months ago
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆85Updated last year
- ScalarLM - a unified training and inference stack☆93Updated last month
- Creating Generative AI Apps which work☆17Updated 8 months ago