Using modal.com to process FineWeb-edu data
☆20Apr 5, 2025Updated 11 months ago
Alternatives and similar repositories for latent-data-modal
Users that are interested in latent-data-modal are comparing it to the libraries listed below
Sorting:
- Training code for Sparse Autoencoders on Embedding models☆39Feb 27, 2025Updated last year
- utilities for loading and running text embeddings with onnx☆45Aug 16, 2025Updated 6 months ago
- Sparse autoencoders for Contra text embedding models☆25Apr 24, 2024Updated last year
- Build your own custom knowledge base from various sources such as youtube videos transcripts, tweets, articles, videos and audios. Uses G…☆13Dec 15, 2023Updated 2 years ago
- Apps that run on modal.com☆13Sep 14, 2025Updated 5 months ago
- A framework for evaluating the effectiveness of chain-of-thought reasoning in language models.☆19Feb 6, 2025Updated last year
- Because it's there.☆16Sep 22, 2024Updated last year
- various experiments for scaling inference time compute with small reasoning models☆17Jan 16, 2025Updated last year
- A single static file as vector database, using the cloud-native flatgeobuf file format and http range requests☆17Oct 28, 2025Updated 4 months ago
- A high performance batching router optimises max throughput for text inference workload☆16Sep 6, 2023Updated 2 years ago
- PyTorch implementation for MRL☆22Feb 22, 2024Updated 2 years ago
- FormFill is a CLI tool that uses LLMs to automatically fill out PDF forms.☆29Nov 22, 2024Updated last year
- OpenAI GPT-3/3.5/4 API client written in Go☆20Apr 13, 2023Updated 2 years ago
- ☆21Oct 14, 2024Updated last year
- A collection of LLM services you can self host via docker or modal labs to support your applications development☆201Apr 29, 2024Updated last year
- 5X faster 60% less memory QLoRA finetuning☆21May 28, 2024Updated last year
- A collection of tools for your LLMs that run on Modal☆23Feb 28, 2025Updated last year
- Deterministic text generation and embeddings with zero configuration☆42Feb 21, 2026Updated 2 weeks ago
- Real-time latent exploration of diffusion models☆29Apr 21, 2024Updated last year
- Deploy a FastHTML app in just a few lines of simple python code on Modal's serverless infra.☆26Aug 19, 2024Updated last year
- Server bots for Poe☆18Nov 17, 2025Updated 3 months ago
- ☆12Jan 17, 2026Updated last month
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32May 25, 2024Updated last year
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31May 22, 2024Updated last year
- A library to use `modal` as a backend for `joblib`.☆32Jan 15, 2025Updated last year
- Fine-tune ModernBERT with custom tokenizers, curriculum learning, and next-gen optimizers.☆74Jan 16, 2026Updated last month
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆159Jul 14, 2025Updated 7 months ago
- fine-tuning tutorial☆18Feb 20, 2026Updated 2 weeks ago
- 🚀 A simple, modern, full-stack toolkit for Python 🐍☆38Oct 18, 2024Updated last year
- Repository for go shared libraries (for now).☆11Dec 1, 2025Updated 3 months ago
- A Claude Code plugin that solves the same problems as community frameworks (GSD, BMAD, Ralph, Agent OS) — but using the tool's native arc…☆28Mar 1, 2026Updated last week
- Official implementation of the ICML 2024 paper RoSA (Robust Adaptation)☆44Feb 13, 2024Updated 2 years ago
- this is a dataset converter that takes YOLO bbox data and makes polygons using SAM-HQ!☆39Jan 24, 2024Updated 2 years ago
- LightGBM for handling label-imbalanced data with focal and weighted loss functions in binary and multiclass classification☆21Jan 29, 2026Updated last month
- An AI-powered web application leveraging Next.js 14 and TensorFlow.js for real-time object detection. Utilizing Tensorflow model for accu…☆12Dec 3, 2024Updated last year
- ApertureDB Python Client☆12Jan 14, 2026Updated last month
- Text preprocessing package for use in NLP tasks https://pypi.org/project/textcl/☆11Aug 9, 2024Updated last year
- PyTorch Implementation of Context-Aware Sequential Model for Multi-Behaviour Recommendation https://arxiv.org/abs/2312.09684☆10May 31, 2024Updated last year
- QUIC pluggable crypto to use the protocol as plaintext (for use when cryptography is already handled at another layer, e.g. Wireguard)☆10Aug 27, 2025Updated 6 months ago