Zyphra/Zyda_processing

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Zyphra/Zyda_processing)

Zyphra / Zyda_processing

☆44

Alternatives and similar repositories for Zyda_processing

Users that are interested in Zyda_processing are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

sekstini / basedxl
View on GitHub
☆18Mar 18, 2024Updated 2 years ago
ManifoldRG / NEKO_Archive
View on GitHub
The NEKO Project is an open source effort to build a model of equivalent scale and capability as that reported in DeepMind’s 2022 Paper, …
☆10Sep 2, 2023Updated 2 years ago
furlat / OpenBugger
View on GitHub
Code to create bugged python scripts for OpenAssistant Training, maintained by https://twitter.com/Cyndesama
☆24Jul 23, 2023Updated 2 years ago
er537 / whisper_interpretability
View on GitHub
A repo to do interpretability of pre-trained acoustic models
☆15Oct 15, 2023Updated 2 years ago
lixilinx / Fully-Trainable-SSM
View on GitHub
A fully trainable state space model (SSM)
☆16Mar 18, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
JinjieNi / MixEval
View on GitHub
The official evaluation suite and dynamic data release for MixEval.
☆254Nov 10, 2024Updated last year
KhoomeiK / complexity-scaling
View on GitHub
gzip Predicts Data-dependent Scaling Laws
☆35May 28, 2024Updated 2 years ago
renll / SparseLT
View on GitHub
[EMNLP 2022] Language Model Pre-Training with Sparse Latent Typing
☆14Feb 10, 2023Updated 3 years ago
mohmdelsayed / HesScale
View on GitHub
Scalable Computation of Hessian Diagonals
☆14Jun 2, 2024Updated 2 years ago
mlfoundations / scaling
View on GitHub
Language models scale reliably with over-training and on downstream tasks
☆102Apr 2, 2024Updated 2 years ago
aflah02 / TokenSmith
View on GitHub
A comprehensive toolkit for streamlining data editing, search, and inspection for large-scale language model training and interpretabilit…
☆21Oct 30, 2025Updated 8 months ago
sgl-project / whl
View on GitHub
SGLang Kernel Wheel Index
☆24Updated this week
epfml / schedules-and-scaling
View on GitHub
Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"
☆93Oct 30, 2024Updated last year
goodevening13 / aquakv
View on GitHub
☆21Apr 27, 2026Updated 2 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
thecharlieblake / lovely-llama
View on GitHub
An implementation of the Llama architecture, to instruct and delight
☆21May 31, 2025Updated last year
SynthLabsAI / big-math
View on GitHub
A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models
☆74Feb 25, 2025Updated last year
kaloureyes3 / v4-clients
View on GitHub
☆10Apr 5, 2024Updated 2 years ago
drarijitdas / Natural-GaLore
View on GitHub
An extention to the GaLore paper, to perform Natural Gradient Descent in low rank subspace
☆19Oct 21, 2024Updated last year
Unicorn-Commander / Center-Deep
View on GitHub
Privacy-focused metasearch engine with one-click setup, beautiful admin panel, and AI integration. Fork of SearXNG.
☆20Sep 7, 2025Updated 10 months ago
haileyschoelkopf / triton-index
View on GitHub
See https://github.com/cuda-mode/triton-index/ instead!
☆11May 8, 2024Updated 2 years ago
GallagherCommaJack / modulax
View on GitHub
☆18Aug 24, 2024Updated last year
irregular-rhomboid / EAI-Math-Reading-Group
View on GitHub
Resources from the EleutherAI Math Reading Group
☆55Feb 28, 2025Updated last year
jzhang38 / EasyContext
View on GitHub
Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.
☆759Sep 27, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
uynaes / RankingAwareCLIP
View on GitHub
[ICLR'25] Official repository of paper: Ranking-aware adapter for text-driven image ordering with CLIP
☆16Apr 17, 2025Updated last year
cohaereo / minecrab
View on GitHub
Minecraft client written in Rust and WGPU
☆11Apr 16, 2023Updated 3 years ago
LLM360 / TxT360
View on GitHub
☆25Dec 18, 2024Updated last year
formll / resolving-scaling-law-discrepancies
View on GitHub
☆19Nov 4, 2025Updated 8 months ago
Farseer-Scaling-Law / Farseer
View on GitHub
☆21Jun 12, 2025Updated last year
oscar-project / ungoliant
View on GitHub
The pipeline for the OSCAR corpus
☆178Nov 9, 2025Updated 8 months ago
imoneoi / multipack
View on GitHub
Multipack distributed sampler for fast padding-free training of LLMs
☆207Aug 10, 2024Updated last year
nikhilchandak / answer-matching
View on GitHub
Code for 'Answer Matching Outperforms Multiple Choice for Language Model Evaluation' paper
☆18Jul 4, 2025Updated last year
gallen881 / Physics_Master
View on GitHub
Physics Master is a model fine-tuned from llama3-8B-Instruct. It can answer your physics question!
☆16Aug 24, 2024Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
kyo-takano / chinchilla
View on GitHub
A toolkit for scaling law research ⚖
☆68Jan 27, 2025Updated last year
Birch-san / booru-embed
View on GitHub
[WIP] Transformer to embed Danbooru labelsets
☆13Mar 31, 2024Updated 2 years ago
yxuansu / Contrastive_Search_Is_What_You_Need
View on GitHub
[TMLR'23] Contrastive Search Is What You Need For Neural Text Generation
☆122Mar 5, 2023Updated 3 years ago
Acly / dlimgedit
View on GitHub
A C++ library for image painting and editing workflows which make use of deep learning.
☆11Sep 3, 2025Updated 10 months ago
Zhaoyi-Li21 / creme
View on GitHub
[ACL 2024] "Understanding and Patching Compositional Reasoning in LLMs"
☆14Aug 28, 2024Updated last year
ASE-REEF / REEF-data
View on GitHub
☆16Aug 16, 2023Updated 2 years ago
fbarez / neuroplasticity
View on GitHub
☆14Mar 31, 2024Updated 2 years ago