☆62Jan 20, 2026Updated 2 months ago
Alternatives and similar repositories for dolma3
Users that are interested in dolma3 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Data mapping framework for rust stuff☆49Updated this week
- decontamination☆27Mar 4, 2026Updated 3 weeks ago
- Tooling for exact and MinHash deduplication of large-scale text datasets☆76Mar 16, 2026Updated last week
- The evaluation code for A Safety Report on GPT-5.2, Gemini 3 Pro, Qwen3-VL, Grok 4.1 Fast, Nano Banana Pro, and Seedream 4.5☆53Jan 18, 2026Updated 2 months ago
- Test for Graph Unlearning Benchmark☆19Jul 12, 2025Updated 8 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆44Dec 16, 2025Updated 3 months ago
- ☆33Apr 22, 2025Updated 11 months ago
- ☆20Jun 4, 2025Updated 9 months ago
- ☁️ KUMO: Generative Evaluation of Complex Reasoning in Large Language Models☆19Jun 4, 2025Updated 9 months ago
- ☆22May 30, 2023Updated 2 years ago
- ☆20Jun 9, 2025Updated 9 months ago
- Official PyTorch implementation of "CBNet: A Plug-and-Play Network for Segmentation-Based Scene Text Detection"☆22Mar 30, 2024Updated last year
- ☆109Jul 15, 2025Updated 8 months ago
- Single-pass Adaptive Image Tokenization for Minimum Program Search | What's the Kolmogorov Complexity of an Image?☆42Jul 26, 2025Updated 8 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- OpenFaaS function for Caire, the content aware image resize library. (https://github.com/esimov/caire)☆14May 2, 2021Updated 4 years ago
- FROM $f(x)$ AND $g(x)$ TO $f(g(x))$: LLMs Learn New Skills in RL by Composing Old Ones☆64Jan 26, 2026Updated 2 months ago
- ☆23Nov 26, 2024Updated last year
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆40Jan 4, 2024Updated 2 years ago
- FamilyTool benchmark☆13Sep 10, 2025Updated 6 months ago
- Directional diffusion models☆43Oct 31, 2024Updated last year
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆149Oct 27, 2024Updated last year
- ☆26Mar 4, 2025Updated last year
- ☆14Jan 11, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆68Mar 19, 2026Updated last week
- Using PyTorch autograd to compute Hessian of Perplexity for Large Language Models☆29Apr 17, 2025Updated 11 months ago
- a large-scale graph database created as a combination of multiple taxonomy backbones extracted from 5 existing knowledge graphs, namely: …☆14Jan 23, 2024Updated 2 years ago
- ☆37Oct 29, 2024Updated last year
- Benchmarks for Low Latency (Streaming) solutions including Apache Storm, Apache Spark, Apache Flink, Kafka Stream API and Hazelcast Jet☆10Apr 3, 2024Updated last year
- A comprehensive paper list of Table-based Question Answering.☆37Sep 1, 2023Updated 2 years ago
- Official Implementation of "Learning to Refuse: Towards Mitigating Privacy Risks in LLMs"☆10Dec 13, 2024Updated last year
- AWS Sample for extracting sensor data and detecting scenes from autonomous driving data collected in ROS bag files.☆12Sep 27, 2021Updated 4 years ago
- 小模型LLM的搭建,学习LLM的建模、训练过程 基于DeepSeek-MOE架构的小模型,用于个人学习,从0开始,解释每一条语句☆14Mar 28, 2025Updated 11 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- [SIGIR 2023] Schema-aware Reference as Prompt Improves Data-Efficient Knowledge Graph Construction☆42Apr 5, 2023Updated 2 years ago
- Our paper is titled "NUS-IDS at FinCausal 2021: Dependency Tree in Graph Neural Networks for better Cause-Effect Span Detection".☆13Feb 11, 2022Updated 4 years ago
- This is code for the EMNLP 2022 Paper "UniRPG: Unified Discrete Reasoning over Table and Text as Program Generation".☆10Apr 30, 2023Updated 2 years ago
- Here we provide and collect many functions to generate math problem and step by step solutions for LLM training☆18Jun 21, 2023Updated 2 years ago
- This workshop will familiarize you with some of the key steps towards building an autonomous driving data lake and extracting images from…☆10Jul 12, 2022Updated 3 years ago
- [Preprint] Efficient Generative Model Training via Embedded Representation Warmup☆36Oct 15, 2025Updated 5 months ago
- My Arduboy Mini Games☆15Mar 15, 2019Updated 7 years ago