☆48Jan 20, 2026Updated last month
Alternatives and similar repositories for dolma3
Users that are interested in dolma3 are comparing it to the libraries listed below
Sorting:
- decontamination☆26Dec 3, 2025Updated 3 months ago
- Data mapping framework for rust stuff☆47Updated this week
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆40Jan 4, 2024Updated 2 years ago
- Simplifies data migration between Apache Ignite clusters by relying on Apache Avro as an intermediate storage format☆13Jun 27, 2023Updated 2 years ago
- 是APEX贡献的一个基于大数据平台能力的数据开发平台,帮助企业以最小成本实现链接数据,构建和沉淀数仓模型,降低数据应用门槛,沉淀数据价值。☆12Oct 31, 2024Updated last year
- [SIGIR 2023] Schema-aware Reference as Prompt Improves Data-Efficient Knowledge Graph Construction☆42Apr 5, 2023Updated 2 years ago
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆149Oct 27, 2024Updated last year
- Official Implementation of "Learning to Refuse: Towards Mitigating Privacy Risks in LLMs"☆10Dec 13, 2024Updated last year
- ☆17Aug 5, 2025Updated 7 months ago
- Analysis on stop reasons☆10Jun 17, 2024Updated last year
- Azure Machine Learning - MLOps Python SDKv2☆10Jul 24, 2023Updated 2 years ago
- 어린이를 위한 동화 제작 서비스, My AI Fairy-Tale☆11Apr 7, 2023Updated 2 years ago
- Collaborative Discourse Manager☆11Nov 6, 2016Updated 9 years ago
- KuaiSearch PERKS☆12Nov 16, 2021Updated 4 years ago
- CWTS OpenAlex ETL data pipeline.☆16Oct 29, 2025Updated 4 months ago
- Simplified DOM Trees for Transferable Attribute Extraction from the Web☆40Sep 27, 2024Updated last year
- ☆12Updated this week
- [ICML 2024] Code for the paper "MoE-RBench: Towards Building Reliable Language Models with Sparse Mixture-of-Experts"☆10Jul 1, 2024Updated last year
- ☆11Jan 13, 2013Updated 13 years ago
- This is the code for reproducing the TABBIE baseline in our paper: "Retrieval-Based Transformer for Table Augmentation"☆12Sep 17, 2025Updated 5 months ago
- ☆15May 11, 2025Updated 9 months ago
- ☆11Feb 19, 2026Updated 2 weeks ago
- A smart distributed crawler that infers navigation models of structured websites, used to cluster pages based on their structure and extr…☆10Aug 17, 2025Updated 6 months ago
- Html article content extractor in Golang.☆12Oct 31, 2022Updated 3 years ago
- Attempt to understand Percy Liang's Dependency-based Compositional Semantics by implementing it in Python☆10Mar 10, 2013Updated 12 years ago
- A truth inference tool in crowdsourcing☆13May 19, 2020Updated 5 years ago
- this is a work about UpliftRec☆10Dec 10, 2024Updated last year
- Tools for controlling full disk encryption☆14Jan 30, 2026Updated last month
- This is a sample project where we can get the exact use case of pythons multi threading.☆11Oct 6, 2020Updated 5 years ago
- Our paper is titled "NUS-IDS at FinCausal 2021: Dependency Tree in Graph Neural Networks for better Cause-Effect Span Detection".☆13Feb 11, 2022Updated 4 years ago
- This repository contains a series of 4 jupyter notebooks demonstrating how AWS AI Services like Amazon Rekognition, Amazon Transcribe and…☆13Nov 26, 2021Updated 4 years ago
- 小模型LLM的搭建,学习LLM的建模、训练过程 基于DeepSeek-MOE架构的小模型,用于个人学习,从0开始,解释每一条语句☆14Mar 28, 2025Updated 11 months ago
- Using OpenVINO to speed up inference of PaddleOCR-VL model☆25Updated this week
- ☆31Sep 19, 2025Updated 5 months ago
- FamilyTool benchmark☆12Sep 10, 2025Updated 5 months ago
- 批量监控指定QQ消息窗口并将新消息发送至邮箱☆11Apr 13, 2023Updated 2 years ago
- Widgets JSON for OpenBB Terminal Pro☆15Aug 30, 2024Updated last year
- 苏州大学研究生学位论文模板 - Soochow University Thesis TeX Template☆17Feb 27, 2026Updated last week
- Code for running forward and backward versions of GPT2☆10Nov 20, 2021Updated 4 years ago