VikParuchuri / classifiedView external linksLinks
Score LLM pretraining data with classifiers
☆55Nov 2, 2023Updated 2 years ago
Alternatives and similar repositories for classified
Users that are interested in classified are comparing it to the libraries listed below
Sorting:
- Generate textbook-quality synthetic LLM pretraining data☆509Oct 19, 2023Updated 2 years ago
- Convert all of libgen to high quality markdown☆254Dec 13, 2023Updated 2 years ago
- ☆14Jul 25, 2023Updated 2 years ago
- ☆22Aug 27, 2023Updated 2 years ago
- ☆24May 19, 2024Updated last year
- A new way to generate large quantities of high quality synthetic data (on par with GPT-4), with better controllability, at a fraction of …☆23Oct 1, 2024Updated last year
- [ICLR 2024 Oral] Improving Convergence and Generalization Using Parameter Symmetries☆31May 29, 2024Updated last year
- tiny_fnc_engine is a minimal python library that provides a flexible engine for calling functions extracted from a LLM.☆38Sep 11, 2024Updated last year
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆40Jan 4, 2024Updated 2 years ago
- This repository is a react component to edit images at mobile phone by Konvajs.☆11Feb 2, 2023Updated 3 years ago
- ☆10Jan 22, 2021Updated 5 years ago
- ☆10Jan 21, 2026Updated 3 weeks ago
- A part of the course Mobile Application Development☆13Nov 30, 2021Updated 4 years ago
- GestureX is an OpenCV-based hand motion sensing system for intuitive, efficient user control.This project aims to investigate the potenti…☆16Jun 29, 2024Updated last year
- 哔哩哔哩-API收集整理【不断更新中....】☆10Apr 25, 2025Updated 9 months ago
- Soil Sample and Soil Profile Datasets: an Open Compilation☆21Jun 18, 2025Updated 7 months ago
- Panorama_498全景图像数据集☆14Apr 8, 2022Updated 3 years ago
- DiagnoSys is a comprehensive web application that provides advanced detection and analysis for various health conditions. This project le…☆14May 6, 2024Updated last year
- An Educational Framework Based on PyTorch for Deep Learning Education and Exploration☆10Dec 24, 2023Updated 2 years ago
- Get started with Automated Machine Learning (AutoML) and Machine Learning Operations (MLOps) in Azure Machine Learning☆13Nov 21, 2023Updated 2 years ago
- ☆13Aug 10, 2023Updated 2 years ago
- ☆13Nov 9, 2025Updated 3 months ago
- Agent to integrate Webdriver.io with ReportPortal.☆11Updated this week
- VenomPred 2.0 API☆11Feb 4, 2026Updated last week
- MLOps Pipeline for Amazon Forecast written in AWS CDK☆11Apr 10, 2025Updated 10 months ago
- ☆84Jan 15, 2024Updated 2 years ago
- AI Chat app written in GPUI and GPUI Component☆21Dec 10, 2025Updated 2 months ago
- ☆10Nov 16, 2023Updated 2 years ago
- This repository shows how to implement a basic model for multimodal entailment.☆10Aug 17, 2021Updated 4 years ago
- A simple hello world Python application.☆11Jun 14, 2023Updated 2 years ago
- HRA ASCT+B Reporter☆10Nov 20, 2025Updated 2 months ago
- Handlebars helper, alternative to built-in partials. Similar to handlebars-helper-partial, but this helper will allow wildcard (glob) pat…☆16Nov 10, 2014Updated 11 years ago
- Sample AutoML notebooks evolving towards MLOps☆11Feb 15, 2022Updated 4 years ago
- AI-WordCards is an innovative project that leverages the power of GPT, StableDiffusion, and DALL-E3 to create educational and engaging wo…☆10May 16, 2024Updated last year
- Real-time multi-language unit test generation tool via LSP☆31Updated this week
- [ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale☆266Jul 8, 2025Updated 7 months ago
- Just large language models. Hackable, with as little abstraction as possible. Done for my own purposes, feel free to rip.☆44Sep 6, 2023Updated 2 years ago
- ☆11Jul 30, 2024Updated last year
- ☆14Jan 31, 2023Updated 3 years ago