LLaDA implementation
☆19Jul 24, 2025Updated 8 months ago
Alternatives and similar repositories for LLaDA_Arithmetic
Users that are interested in LLaDA_Arithmetic are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Jax Implementation of MD4 Masked Diffusion Models☆154Feb 27, 2025Updated last year
- A research project exploring fine-tuning BERT-style models for text generation☆39Nov 30, 2025Updated 3 months ago
- a simple federate-learning framework write by python☆11Sep 18, 2024Updated last year
- Tiny re-implementation of MDM in style of LLaDA and nano-gpt speedrun☆58Mar 10, 2025Updated last year
- Source code for SWIFT, an efficient reward model.☆19Jan 13, 2026Updated 2 months ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Unofficial implementation of the Ask-LLM paper 'How to Train Data-Efficient LLMs', arXiv:2402.09668.☆12Jun 19, 2024Updated last year
- ☆29Dec 19, 2025Updated 3 months ago
- [ICLR 2026] Official Implementation of ProxyThinker: Test-Time Guidance through Small Visual Reasoners.☆20Sep 24, 2025Updated 6 months ago
- Code for experiments on transformers using Markovian data.☆22Nov 22, 2024Updated last year
- Mixture-of-Basis-Experts for Compressing MoE-based LLMs☆31Dec 24, 2025Updated 3 months ago
- 魔镜魔镜,无所不知的魔镜[-_-](并不是)☆13Jun 10, 2021Updated 4 years ago
- A repository for the EMNLP 2021 paper "Is Information Density Uniform in Task-Oriented Dialogues?" and for the CoNLL 2021 paper "Analysin…☆10Jun 17, 2024Updated last year
- [CVPR 2025] Decision SpikeFormer: Spike-Driven Transformer for Decision Making☆18Aug 8, 2025Updated 7 months ago
- Materials for implementing and reproducing results in the NIPS paper.☆23Nov 6, 2014Updated 11 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Self Reproduction Code of Paper "Reducing Transformer Key-Value Cache Size with Cross-Layer Attention (MIT CSAIL)☆17May 24, 2024Updated last year
- ☆15Nov 5, 2020Updated 5 years ago
- Generative Modeling via Drifting in MLX☆42Feb 6, 2026Updated last month
- The PyTorch implementation for "DEAL: Disentangle and Localize Concept-level Explanations for VLMs" (ECCV 2024 Strong Double Blind)☆20Mar 9, 2026Updated 2 weeks ago
- Official Implementation of Geo2Vec oral presented @ [AAAI '2026]☆32Nov 22, 2025Updated 4 months ago
- Official code for "ZigZag: Universal Sampling-free Uncertainty Estimation Through Two-Step Inference" (TMLR 2024)☆17Nov 7, 2024Updated last year
- A PyTorch native platform for training generative AI models☆16Nov 18, 2025Updated 4 months ago
- Official code for "Enabling Uncertainty Estimation in Iterative Neural Networks" (ICML 2024)☆19Jul 8, 2024Updated last year
- ☆12Feb 19, 2024Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Python toolkit for document information extraction using LMDX☆13Oct 15, 2023Updated 2 years ago
- Towards a million-node RISC-V cluster.☆14Mar 6, 2025Updated last year
- a simple web of data visualization☆11Feb 18, 2023Updated 3 years ago
- [CCS-LAMPS'24] LLM IP Protection Against Model Merging☆16Oct 14, 2024Updated last year
- This repository contains a curated list of resources related to World Models for Autonomous Driving (WMAD), based on the survey.☆29Oct 10, 2025Updated 5 months ago
- A custom Color Picker widget for PyQt5/PyQt6 applications.☆17May 22, 2021Updated 4 years ago
- A C/C++ header file that converts Intel SSE intrinsics to MIPS/MIPS64 MSA intrinsics.☆10Nov 16, 2021Updated 4 years ago
- ☆19Sep 11, 2024Updated last year
- 信创群友语录☆13Nov 5, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆19Mar 12, 2025Updated last year
- ☆41Sep 30, 2025Updated 5 months ago
- ☆14Sep 14, 2021Updated 4 years ago
- Official implementation repository for the paper Towards General Conceptual Model Editing via Adversarial Representation Engineering.☆19Dec 6, 2024Updated last year
- 个人轨迹树 / My Previous Blog☆16Mar 14, 2020Updated 6 years ago
- The open-source materials for paper "Sparsing Law: Towards Large Language Models with Greater Activation Sparsity".☆30Nov 12, 2024Updated last year
- Generate sentences from a probabilistic context-free grammar.☆17Nov 8, 2024Updated last year