minimal diffusion transformer in pytorch.
☆17Oct 6, 2024Updated last year
Alternatives and similar repositories for mini_DiT
Users that are interested in mini_DiT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An experiment with movie scenes and contrastive learning☆11Feb 1, 2025Updated last year
- Notes of ADRL course taught at IISC as part of MTech AI curriculum☆13Nov 30, 2024Updated last year
- Forward and backward reaching inverse kinematics☆17Jun 1, 2024Updated last year
- YOLOv10: Real-Time End-to-End Object Detection☆12May 24, 2024Updated last year
- ☆63Mar 4, 2022Updated 4 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- alternative way to calculating self attention☆18May 25, 2024Updated last year
- Retrieve the source code for any model made available on replicate.com!☆36Jan 22, 2024Updated 2 years ago
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆19Jul 24, 2025Updated 8 months ago
- minimalistic desktop setup☆13Sep 9, 2024Updated last year
- a Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization in pure C.☆24Jul 6, 2024Updated last year
- ☆11Apr 20, 2020Updated 5 years ago
- Inference code for LLaMA models☆21Apr 3, 2025Updated 11 months ago
- dinov2 features aligned with CLIP☆21Jul 9, 2024Updated last year
- ☆16Jul 8, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆90Mar 18, 2025Updated last year
- Some useful websites for programmers.☆15Sep 24, 2024Updated last year
- You like pytorch? You like micrograd? You love tinygrad! ❤️☆18Feb 14, 2025Updated last year
- Generate markdown notes from your exam syllabus, provide syllabus in txt format & get markdown notes generated☆18May 24, 2024Updated last year
- Fast approximate joins on string columns for polars dataframes.☆16Dec 24, 2025Updated 3 months ago
- Deep Learning-based Forecasting of Building Energy Consumption☆17Sep 15, 2025Updated 6 months ago
- PTX-Tutorial Written Purely By AIs (Deep Research of Openai and Claude 3.7)☆66Mar 24, 2025Updated last year
- ☆15Jul 9, 2024Updated last year
- ☆19Sep 9, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆20Nov 18, 2024Updated last year
- ☆26Updated this week
- image retrieval/tagging with CLIP☆13Jul 13, 2024Updated last year
- Synthetic Alphabet Dataset☆19Mar 27, 2025Updated 11 months ago
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆62Nov 4, 2024Updated last year
- Sentencepiece based BPE tokenizer for English and Japanese language text.☆28Apr 4, 2024Updated last year
- Sparse Transformer with limited attention span in PyTorch☆15Apr 4, 2021Updated 4 years ago
- Code to enable layer-level steering in LLMs using sparse auto encoders☆31Sep 18, 2025Updated 6 months ago
- A central place to organize and publish all of my hobbyist electronics knowledge and projects.☆40Aug 17, 2017Updated 8 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- ☆29Sep 30, 2025Updated 5 months ago
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM☆61Apr 8, 2024Updated last year
- Sparse Autoencoders (SAE) vs CLIP fine-tuning fun.☆18Dec 19, 2024Updated last year
- A digital archive of analog bits and bobs.☆21Feb 4, 2024Updated 2 years ago
- One click away from a locally downloaded, fine-tuned model, hosted on hugging face, with inference built in. In two hours.☆24Nov 9, 2025Updated 4 months ago
- ☆28Oct 7, 2025Updated 5 months ago
- A project between Anyscale and deepsense.ai implementing a cross-modal search application for e-commerce☆13Jun 5, 2024Updated last year