sambanova/generative_data_prep

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sambanova/generative_data_prep)

sambanova / generative_data_prep

☆67

Alternatives and similar repositories for generative_data_prep

Users that are interested in generative_data_prep are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

sambanova / tutorials
View on GitHub
☆13Apr 30, 2024Updated 2 years ago
sambanova / toolbench
View on GitHub
ToolBench, an evaluation suite for LLM tool manipulation capabilities.
☆180Feb 28, 2024Updated 2 years ago
UniversalDependencies / UD_Thai-PUD
View on GitHub
Parallel Universal Dependencies.
☆15May 6, 2026Updated 2 months ago
KoreaMGLEE / Concept-based-curriculum-masking
View on GitHub
Efficient Pre-training of Masked Language Model via Concept-based Curriculum Masking
☆13Feb 5, 2023Updated 3 years ago
betweentwomidnights / gary4live
View on GitHub
musicgen, melodyflow, stable-audio-open-small inside ableton.
☆18Jul 6, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
kyegomez / autogpt-tot
View on GitHub
Simple Autogpt with tree of thoughts
☆14May 25, 2023Updated 3 years ago
sneakers-the-rat / dissertation
View on GitHub
my dissertation!
☆12Sep 6, 2022Updated 3 years ago
aviaefrat / lmentry
View on GitHub
☆15Nov 22, 2023Updated 2 years ago
allenai / easy-to-hard-generalization
View on GitHub
Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"
☆48Jan 17, 2024Updated 2 years ago
The-Swarm-Corporation / swarms-core
View on GitHub
Multi-threading, Concurrency, Asynchrony, and various Execution Methods implemented in a Rust backend for bleeding edge performance.
☆20Nov 11, 2024Updated last year
furiosa-ai / EfficientRollout
View on GitHub
EfficientRollout: System-Aware Self-Speculative Decoding for RL Rollouts
☆16Jun 24, 2026Updated 3 weeks ago
kyegomez / Tiktokx
View on GitHub
Tiktok is an advanced multimedia recommender system that fuses the generative modality-aware collaborative self-augmentation and contrast…
☆14Aug 18, 2023Updated 2 years ago
andysingal / Audio-LLM
View on GitHub
The purpose of this repository is to discuss on Audio transformers
☆14Apr 16, 2026Updated 3 months ago
Xiefeng69 / Awesome-Entity-Alignment
View on GitHub
Awesome Entity Alignment is a collection of EA techniques, including papers, codes, and datasets.
☆11Oct 27, 2022Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
nlp-uoregon / ullme
View on GitHub
☆20Apr 8, 2025Updated last year
causalNLP / amr_llm
View on GitHub
This repo explores how AMR to address tasks difficult for LLMs
☆13Jan 15, 2024Updated 2 years ago
seanchatmangpt / dspygen
View on GitHub
A Ruby on Rails style framework for the DSPy (Demonstrate, Search, Predict) project for Language Models like GPT, BERT, and LLama.
☆135Jun 16, 2026Updated last month
sevren / DirectInput-Game-Controller-Python
View on GitHub
Direct X game controller server/client written in Python
☆10Jul 10, 2018Updated 8 years ago
VikParuchuri / textbook_quality
View on GitHub
Generate textbook-quality synthetic LLM pretraining data
☆508Oct 19, 2023Updated 2 years ago
Agora-Lab-AI / The-Distiller
View on GitHub
Generate High Quality textual or multi-modal datasets with Agents
☆18Jun 7, 2023Updated 3 years ago
Alignment-Lab-AI / datagen
View on GitHub
a pipeline for using api calls to agnostically convert unstructured data into structured training data
☆32Sep 22, 2024Updated last year
UKPLab / emnlp2024-code-prompting
View on GitHub
Code Prompting Elicits Conditional Reasoning Abilities in Text+Code LLMs. EMNLP 2024
☆27Nov 13, 2024Updated last year
cohere-samples / cohere-slack-starter-app
View on GitHub
Co:here-powered Slack App Starter Project
☆13Apr 1, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
kyegomez / Exa
View on GitHub
Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…
☆27Nov 11, 2024Updated last year
msclar / symbolictom
View on GitHub
☆23Nov 8, 2023Updated 2 years ago
quinte22 / bumblebee
View on GitHub
bumble bee transformer
☆14Apr 19, 2021Updated 5 years ago
kyegomez / Finetuning-Suite
View on GitHub
Finetune any model on HF in less than 30 seconds
☆57Jul 5, 2026Updated 2 weeks ago
kyegomez / ProfitPilot
View on GitHub
ProfitPilot closes deals for you effortlessly 24/7, just provide a list of customer and ProfitPilot will reach out on your behalf and clo…
☆21Sep 7, 2023Updated 2 years ago
sudoish / openai_cli
View on GitHub
An openAI CLI built in rust
☆10Dec 28, 2022Updated 3 years ago
jmanhype / mcp-flux-studio
View on GitHub
A Model Context Protocol server for Flux image generation, providing tools for image generation, manipulation, and control
☆25Mar 25, 2026Updated 3 months ago
AdiCohen501 / ExNet-BF-PF
View on GitHub
☆15Jul 23, 2024Updated last year
Josiah-tan / plover-vim-tutor
View on GitHub
tutor + help for learning plover-vim
☆17Jul 7, 2026Updated 2 weeks ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ExpressAI / reStructured-Pretraining
View on GitHub
reStructured Pre-training
☆99Dec 22, 2022Updated 3 years ago
davispolito / Phase-Vocoder
View on GitHub
☆13Apr 10, 2020Updated 6 years ago
ostrovsky / TouchOSC_Sequencer_FH-2
View on GitHub
A TouchOSC template for controlling the sequencers in Expert Sleepers FH-2 eurorack module
☆11Jan 5, 2024Updated 2 years ago
latynt / ans
View on GitHub
Arabic News Stance Corpus
☆11Feb 5, 2021Updated 5 years ago
Fermain / -mollify
View on GitHub
☆10Feb 29, 2024Updated 2 years ago
kyegomez / dev-swarm
View on GitHub
A swarm of LLM agents that will help you test, document, and productionize your code!
☆19Jul 13, 2026Updated last week
yohasebe / monadic-chat-cli
View on GitHub
Highly configurable CLI app for OpenAI's chat/text completion API
☆11Nov 8, 2024Updated last year