fal-ai / lavender-dataLinks
Load & manage evolving datasets efficiently
☆23Updated 5 months ago
Alternatives and similar repositories for lavender-data
Users that are interested in lavender-data are comparing it to the libraries listed below
Sorting:
- ☆34Updated last year
- Focused on fast experimentation and simplicity☆80Updated last year
- Lightweight package that tracks and summarizes code changes using LLMs (Large Language Models)☆34Updated 11 months ago
- Because it's there.☆16Updated last year
- ☆50Updated 3 months ago
- Simple implementation of muP, based on Spectral Condition for Feature Learning. The implementation is SGD only, dont use it for Adam☆85Updated last year
- ☆24Updated last year
- Simple high-throughput inference library☆155Updated 8 months ago
- ☆12Updated last year
- ☆18Updated last year
- ☆19Updated 2 months ago
- ☆23Updated last year
- https://hf.co/hexgrad/Kokoro-82M☆14Updated 3 weeks ago
- implementation of https://arxiv.org/pdf/2312.09299☆21Updated last year
- research impl of Native Sparse Attention (2502.11089)☆63Updated 11 months ago
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆18Updated 6 months ago
- [WIP] Transformer to embed Danbooru labelsets☆13Updated last year
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆110Updated 11 months ago
- ☆47Updated 2 years ago
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Updated last year
- ☆32Updated last year
- High-throughput tensor loading for PyTorch☆221Updated 2 weeks ago
- Tiny re-implementation of MDM in style of LLaDA and nano-gpt speedrun☆56Updated 11 months ago
- ☆307Updated this week
- Experimental GPU language with meta-programming☆25Updated last year
- ☆53Updated 2 years ago
- A minimal implementation of Drifting Models for 2D toy data. Unlike diffusion/flow models that iterate at inference, drifting models evo…☆39Updated this week
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆132Updated last year
- DeMo: Decoupled Momentum Optimization☆198Updated last year
- A synthetic story narration dataset to study small audio LMs.☆31Updated 2 years ago