furiousteabag / vram-calculatorView external linksLinks
Transformer GPU VRAM estimator
☆68Mar 26, 2024Updated last year
Alternatives and similar repositories for vram-calculator
Users that are interested in vram-calculator are comparing it to the libraries listed below
Sorting:
- URL downloader supporting checkpointing and continuous checksumming.☆19Nov 29, 2023Updated 2 years ago
- An Educational Framework Based on PyTorch for Deep Learning Education and Exploration☆10Dec 24, 2023Updated 2 years ago
- See https://github.com/cuda-mode/triton-index/ instead!☆11May 8, 2024Updated last year
- ☆17Dec 22, 2025Updated last month
- Template project for building a Fixie Sidekick.☆13Nov 27, 2023Updated 2 years ago
- Explorations into the proposed SDFT, Self-Distillation Enables Continual Learning, from Shenfeld et al. of MIT☆29Feb 6, 2026Updated last week
- ☆28Sep 13, 2024Updated last year
- This sample shows how to take text documents as a input via BlobTrigger, does Text Summarization & Sentiment Score processing using the A…☆23Oct 14, 2024Updated last year
- Notes on Direct Preference Optimization☆24Apr 14, 2024Updated last year
- Utilities for PyTorch distributed☆25Feb 27, 2025Updated 11 months ago
- LLM plugin for models hosted on Replicate☆65Apr 18, 2024Updated last year
- ☆26Mar 28, 2025Updated 10 months ago
- ☆21Mar 3, 2025Updated 11 months ago
- Efficiently computing & storing token n-grams from large corpora☆26Oct 6, 2024Updated last year
- ☆23Feb 22, 2017Updated 8 years ago
- A collection of Ollama model deployments on Google Cloud Run☆28Jun 27, 2024Updated last year
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.☆25Aug 31, 2025Updated 5 months ago
- GNU APL native interop for Clojure☆29Mar 18, 2022Updated 3 years ago
- Sentencepiece based BPE tokenizer for English and Japanese language text.☆28Apr 4, 2024Updated last year
- LMQL implementation of tree of thoughts☆36Jan 31, 2024Updated 2 years ago
- An unofficial implementation of the Infini-gram model proposed by Liu et al. (2024)☆33Jun 19, 2024Updated last year
- This repository offers a Python framework for a retrieval-augmented generation (RAG) pipeline using text and images from MHTML documents,…☆34Nov 17, 2025Updated 2 months ago
- Test-Time Memory Framework: Control Hallucinations in Foundation Models☆11Nov 4, 2025Updated 3 months ago
- This solution converts speech to text and then processes and summarizes the text based on the prompt scenario.☆39Oct 8, 2024Updated last year
- Simple and efficient pytorch-native transformer training and inference (batched)☆79Apr 2, 2024Updated last year
- Sparsey, trademark Neurithmic Systems, is unsupervised learning algorithm inspired from the computations of cortical macro-columns and mi…☆12Feb 27, 2023Updated 2 years ago
- Consists of the largest (10K) human annotated code-switched semantic parsing dataset & 170K generated utterance using the CST5 augmentati…☆41Feb 1, 2023Updated 3 years ago
- A simple UI to check the availability of domains with Namecheap's API☆32Mar 2, 2018Updated 7 years ago
- A python algorithm to change the pitch of the voice in real time☆13Dec 13, 2020Updated 5 years ago
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Jul 19, 2024Updated last year
- A Reward-Modulated Hebbian Learning Rule for Recurrent Neural Networks☆35Jul 26, 2021Updated 4 years ago
- Official code for LEWIS, from: "LEWIS: Levenshtein Editing for Unsupervised Text Style Transfer", ACL-IJCNLP 2021 Findings by Machel Rei…☆31Oct 24, 2022Updated 3 years ago
- [ACL 2023] Counterspeeches up my sleeve! Intent Distribution Learning and Persistent Fusion for Intent-Conditioned Counterspeech Generati…☆10Sep 23, 2023Updated 2 years ago
- This project contains the original white paper for Language Construct Modeling (LCM) v1.13, authored by Vincent Shing Hin Chong. It intro…☆15Jul 23, 2025Updated 6 months ago
- ☆16Nov 18, 2021Updated 4 years ago
- Python platform for working with LLMs☆40Jan 23, 2024Updated 2 years ago
- ☆16Jul 23, 2023Updated 2 years ago
- A set of visualization engines.☆14Updated this week
- Feature extraction from sound signals along with complete CNN model and evaluations using tensorflow, keras and, librosa for MFCC generat…☆10Jan 1, 2022Updated 4 years ago