Score LLM pretraining data with classifiers
☆55Nov 2, 2023Updated 2 years ago
Alternatives and similar repositories for classified
Users that are interested in classified are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Generate textbook-quality synthetic LLM pretraining data☆508Oct 19, 2023Updated 2 years ago
- Convert all of libgen to high quality markdown☆253Dec 13, 2023Updated 2 years ago
- Your fruity companion for transformers☆14May 25, 2022Updated 4 years ago
- ☆14Jul 25, 2023Updated 2 years ago
- Targeted Data Generation with Large Language Models☆19Jun 25, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆21Aug 27, 2023Updated 2 years ago
- ☆24May 19, 2024Updated 2 years ago
- Official implementation of 'A Large-Scale Exploration of mu-Transfer'☆32Jun 5, 2025Updated last year
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆19Feb 27, 2023Updated 3 years ago
- ☆18Mar 20, 2024Updated 2 years ago
- Implementation of 'Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis', in MLX☆24Oct 30, 2024Updated last year
- A new way to generate large quantities of high quality synthetic data (on par with GPT-4), with better controllability, at a fraction of …☆23Oct 1, 2024Updated last year
- ☆45Oct 13, 2023Updated 2 years ago
- An algorithm that intelligently executes a crypto order over time via Coinbase☆13Oct 26, 2021Updated 4 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- An Educational Framework Based on PyTorch for Deep Learning Education and Exploration☆11Dec 24, 2023Updated 2 years ago
- GraphRag vs Embeddings☆16Jul 14, 2024Updated last year
- @jmorganca's ollama.ai demo app on Fly.io☆17Dec 5, 2024Updated last year
- Lightweight open-source perplexity☆60May 6, 2024Updated 2 years ago
- ☆185Oct 13, 2023Updated 2 years ago
- A Citation Manager and Zotero Integration for RemNote! Cite research all within your knowledge base!☆29Jan 22, 2026Updated 5 months ago
- prediction market indexer with semantic search☆37Jan 27, 2026Updated 5 months ago
- ICLR 2025☆30May 21, 2025Updated last year
- A simple AI agent controlling a simulation of a smart home☆13Jun 13, 2024Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- [ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale☆270Jul 8, 2025Updated 11 months ago
- Simple FieldCache based query introspection Solr Search Component - solves the 'red sofa' problem☆11Jan 27, 2025Updated last year
- Simple meeting diarization and speaker id assistant for meetings.☆12Feb 10, 2025Updated last year
- ☆30Jul 22, 2024Updated last year
- Causal Inference for Time Series Data (with CausalML Demo)☆14Jun 11, 2023Updated 3 years ago
- minimalist vector ad☆11Feb 11, 2024Updated 2 years ago
- opennlp-solr-examples☆10Jul 1, 2022Updated 3 years ago
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Feb 22, 2024Updated 2 years ago
- The Solr Package Directory and Sanctuary☆13May 28, 2026Updated last month
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆41Jan 4, 2024Updated 2 years ago
- Tools to make language models a bit easier to use☆67Updated this week
- ☆13Apr 5, 2026Updated 2 months ago
- Fantastic Dungeons - 7DRL 2016☆10Mar 12, 2016Updated 10 years ago
- Verbosity control for AI agents☆66May 23, 2024Updated 2 years ago
- A glowfic to epub converter.☆14Apr 11, 2026Updated 2 months ago
- ☆10Jan 10, 2025Updated last year