enjalot / latent-data-modalView external linksLinks
Using modal.com to process FineWeb-edu data
☆20Apr 5, 2025Updated 10 months ago
Alternatives and similar repositories for latent-data-modal
Users that are interested in latent-data-modal are comparing it to the libraries listed below
Sorting:
- utilities for loading and running text embeddings with onnx☆45Aug 16, 2025Updated 5 months ago
- Sparse autoencoders for Contra text embedding models☆25Apr 24, 2024Updated last year
- Apps that run on modal.com☆13Sep 14, 2025Updated 5 months ago
- Build your own custom knowledge base from various sources such as youtube videos transcripts, tweets, articles, videos and audios. Uses G…☆13Dec 15, 2023Updated 2 years ago
- A framework for evaluating the effectiveness of chain-of-thought reasoning in language models.☆19Feb 6, 2025Updated last year
- Because it's there.☆16Sep 22, 2024Updated last year
- ☆68May 26, 2024Updated last year
- A high performance batching router optimises max throughput for text inference workload☆16Sep 6, 2023Updated 2 years ago
- A single static file as vector database, using the cloud-native flatgeobuf file format and http range requests☆17Oct 28, 2025Updated 3 months ago
- ☆21Oct 14, 2024Updated last year
- OpenAI GPT-3/3.5/4 API client written in Go☆20Apr 13, 2023Updated 2 years ago
- FormFill is a CLI tool that uses LLMs to automatically fill out PDF forms.☆29Nov 22, 2024Updated last year
- A collection of LLM services you can self host via docker or modal labs to support your applications development☆198Apr 29, 2024Updated last year
- 5X faster 60% less memory QLoRA finetuning☆21May 28, 2024Updated last year
- A collection of tools for your LLMs that run on Modal☆23Feb 28, 2025Updated 11 months ago
- ☆25May 15, 2024Updated last year
- Server bots for Poe☆18Nov 17, 2025Updated 2 months ago
- Deploy a FastHTML app in just a few lines of simple python code on Modal's serverless infra.☆26Aug 19, 2024Updated last year
- ☆12Jan 17, 2026Updated 3 weeks ago
- Entity resolution, also known as Data Matching or Record linkage is the task of finding a data set that refer to the same or similar real…☆32Apr 8, 2025Updated 10 months ago
- 🔌 Want one client library for all your embeddings? 💙 Choose Catsu! 🐱☆58Jan 16, 2026Updated last month
- ☆24Jan 30, 2025Updated last year
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32May 25, 2024Updated last year
- ☆29Apr 29, 2024Updated last year
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31May 22, 2024Updated last year
- Fine-tune ModernBERT with custom tokenizers, curriculum learning, and next-gen optimizers.☆74Jan 16, 2026Updated last month
- Repository for the Q-Filters method (https://arxiv.org/pdf/2503.02812)☆35Mar 7, 2025Updated 11 months ago
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆156Jul 14, 2025Updated 7 months ago
- Repository for go shared libraries (for now).☆11Dec 1, 2025Updated 2 months ago
- fine-tuning tutorial☆17Dec 13, 2025Updated 2 months ago
- DOS Program Development☆12Nov 9, 2022Updated 3 years ago
- Official implementation of the ICML 2024 paper RoSA (Robust Adaptation)☆44Feb 13, 2024Updated 2 years ago
- this is a dataset converter that takes YOLO bbox data and makes polygons using SAM-HQ!☆39Jan 24, 2024Updated 2 years ago
- ☆13Nov 5, 2024Updated last year
- Text preprocessing package for use in NLP tasks https://pypi.org/project/textcl/☆11Aug 9, 2024Updated last year
- MLX-based QA pair generator and LLM finetuning tool in Streamlit☆42Oct 18, 2025Updated 3 months ago
- ☆11Oct 2, 2025Updated 4 months ago
- MLX Implementation of Recursive Reasoning with Tiny Networks☆78Oct 11, 2025Updated 4 months ago
- A minimal tool to generate and validate datasets.☆26Feb 8, 2026Updated last week