thu-pacman/Kaiyuan-Spark

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/thu-pacman/Kaiyuan-Spark)

thu-pacman / Kaiyuan-Spark

A scalable data preprocessing framework built on PySpark for LLM training

☆24

Alternatives and similar repositories for Kaiyuan-Spark

Users that are interested in Kaiyuan-Spark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

meituan-longcat / MineExplorer
View on GitHub
Reproduction code for paper "MineExplorer: Evaluating Open-World Exploration of MLLM Agents in Minecraft"
☆18Jun 12, 2026Updated last month
AstrBotDevs / astr-plugin-reviewer
View on GitHub
Use AI to automate preliminary review plugin
☆20Apr 2, 2026Updated 3 months ago
sascommunities / sas-viya-workbench-examples
View on GitHub
SAS and Python code examples for use with SAS Viya Workbench.
☆11Jun 26, 2024Updated 2 years ago
yeyupiaoling / PaddlePaddleCourse
View on GitHub
《PaddlePaddle从入门到实战》源码
☆25Mar 5, 2021Updated 5 years ago
markduan / a-philosophy-of-software-design-skills
View on GitHub
A skill based on A Philosophy of Software Design
☆16Feb 5, 2026Updated 5 months ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
IDR / idr-notebooks
View on GitHub
Jupyter Notebooks for the Image Data Resource
☆19Jun 25, 2026Updated last month
1521620063 / VomiShield
View on GitHub
VomiShield | 轻量桌面视觉锚点工具，基于 Tauri 2 实现透明屏幕Overlay，提供准星、辅助线等多种视觉参考，缓解3D游戏晕动症与眩晕不适；不注入游戏进程，无反作弊风险，支持参数自定义与全局快捷键。
☆21Jul 3, 2026Updated 3 weeks ago
advent259141 / astrbot_plugin_mc_manager
View on GitHub
让你的bot成为mc服务器op，在聊天软件上与bot对话轻松管理服务器
☆15Jan 18, 2026Updated 6 months ago
tom-urkin / Round-Robin
View on GitHub
This repository contains a SystemVerilog implementation of a parametrized Round Robin arbiter with three instantiation options
☆13Jan 28, 2024Updated 2 years ago
Yan98 / EGN
View on GitHub
Exemplar Guided Deep Neural Network for Spatial Transcriptomics Analysis of Gene Expression Prediction
☆20Oct 20, 2023Updated 2 years ago
Zhaoyilunnn / qdao
View on GitHub
Full State Quantum Circuit Simulation Beyond Memory Limit
☆16Aug 5, 2024Updated last year
Toby-Shi-cloud / SysY-Compiler-2023
View on GitHub
BUAA Compiler Course Project 2023 by Toby Shi.
☆13Aug 20, 2024Updated last year
VitoVan / v2ex-universe
View on GitHub
Yet Another Way to Explore
☆11May 21, 2023Updated 3 years ago
BUAA-CI-LAB / GNN-Feature-Decomposition
View on GitHub
Using Feature Decomposition method to accelerate GNN inference
☆13Sep 27, 2021Updated 4 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
carpenter-singh-lab / 2023_Cimini_NatureProtocols
View on GitHub
Optimizing the Cell Painting assay for image-based profiling
☆21Aug 11, 2025Updated 11 months ago
Tanggling / BOOMCore
View on GitHub
This is a project created and completed by team BOOM(Beihang OO masters).This is a superscalar processor with a 13-stage out-of-order dua…
☆18Sep 29, 2024Updated last year
aws-samples / multiagent-collab-scenario-benchmark
View on GitHub
Benchmarking data and script used for LLM multi-agent collaboration systems from AWS Bedrock Agents Science team.
☆18Dec 10, 2024Updated last year
eric-haibin-lin / verl-data
View on GitHub
☆14May 12, 2025Updated last year
zhangtianhong-1998 / Cuda_learn
View on GitHub
这是一个从零学习CUDA课程
☆13Nov 3, 2024Updated last year
TuringEnterprises / SWE-Bench-plus-plus
View on GitHub
SWE-Bench-plus-plus
☆25Feb 5, 2026Updated 5 months ago
aldebran97 / three_body
View on GitHub
三体运动模拟和可视化(three-body motion simulation and visualization)
☆11Sep 20, 2021Updated 4 years ago
ModelTC / awesome-lm-system
View on GitHub
Summary of system papers/frameworks/codes/tools on training or serving large model
☆57Dec 17, 2023Updated 2 years ago
canbozaci / Cache
View on GitHub
L1 Data, L1 Instruction and L2 Unified Cache Design
☆16May 26, 2026Updated 2 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
pulp-platform / fpu_div_sqrt_mvp
View on GitHub
[UNRELEASED] FP div/sqrt unit for transprecision
☆27Jul 16, 2026Updated last week
wangguojim / LargeScale
View on GitHub
☆19May 11, 2024Updated 2 years ago
hrushikeshrv / docxlatex
View on GitHub
A python library for extracting equations, text, and images from .docx files
☆21Dec 7, 2025Updated 7 months ago
stdrc / hakimi
View on GitHub
☆15Feb 5, 2026Updated 5 months ago
LunarMeal / astrbot_plugin_memes
View on GitHub
一个可以让机器人随机发表情包的插件
☆14May 9, 2026Updated 2 months ago
flagos-ai / DeepSeek-V4-FlagOS
View on GitHub
☆16Jul 18, 2026Updated last week
Essential-AI / reflection
View on GitHub
☆51Apr 11, 2025Updated last year
SWE-rebench / SWE-bench-fork
View on GitHub
Fork to run instances from SWE-rebench
☆31Jun 3, 2026Updated last month
fsoft72 / claude-desktop-to-appimage
View on GitHub
A script to create AppImage from the Windows version of Claude Desktop
☆18Jun 16, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
qhjqhj00 / MetaAgent
View on GitHub
MetaAgent: Toward Self-Evolving Agent via Tool Meta-Learning
☆47Sep 3, 2025Updated 10 months ago
xldrx / tictac
View on GitHub
☆22Jun 5, 2019Updated 7 years ago
MC-Linker / MC-Linker
View on GitHub
Link your Minecraft server with Discord!
☆12Jul 9, 2026Updated 2 weeks ago
SigmaQuan / Awesome-Chinese-Corpus-Datasets-and-Models
View on GitHub
Awesome Chinese Corpus Datasets and Models.
☆19Oct 28, 2019Updated 6 years ago
HK-SHAO / HK-SHAO.github.io
View on GitHub
shao fun's website.
☆18Mar 12, 2026Updated 4 months ago
v4fx / Genshin-beta-stuff
View on GitHub
Download link and other stuffs for genshin impact beta
☆15Sep 11, 2022Updated 3 years ago
advent259141 / astrbot_plugin_InitiativeDialogue
View on GitHub
☆12May 5, 2025Updated last year