Distill thinking dataset more compactly and accurately!
☆37Jun 6, 2025Updated 9 months ago
Alternatives and similar repositories for small-datasets
Users that are interested in small-datasets are comparing it to the libraries listed below
Sorting:
- Wonderful Matrices to Build Small Language Models☆44Feb 15, 2025Updated last year
- ☆16Jun 10, 2025Updated 9 months ago
- Understanding the correlation between different LLM benchmarks☆29Jan 11, 2024Updated 2 years ago
- ☆56Nov 6, 2024Updated last year
- 🐜🔧 A minimalistic tool to fine-tune your LLMs☆18Aug 17, 2023Updated 2 years ago
- Open source LLM arena created by the French Government☆65Updated this week
- Using multiple LLMs for ensemble Forecasting☆16Jan 17, 2024Updated 2 years ago
- A framework for few-shot evaluation of autoregressive language models.☆16Aug 23, 2023Updated 2 years ago
- Simple Model Similarities Analysis☆21Feb 3, 2024Updated 2 years ago
- GreenLambert macOS IDA plugin to deobfuscate strings☆14Oct 4, 2021Updated 4 years ago
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆54Jun 6, 2025Updated 9 months ago
- 第十九届“挑战杯”揭榜挂帅专项赛华为赛道打榜第一&国家特等奖-拔萝卜的工程队作品仓库 19th Challenge Cup National Grand Prize☆34Jun 20, 2025Updated 9 months ago
- Build use cases with VideoDB☆31Feb 12, 2026Updated last month
- Layout Analysis Dataset with Segmonto (LADaS)☆24Jul 12, 2025Updated 8 months ago
- Automatic evals for LLMs☆583Feb 24, 2026Updated 3 weeks ago
- Document intricacies of using WinDBG to aid Rust project development☆17Nov 19, 2024Updated last year
- 🎓Automatically Update CV Papers Daily using Github Actions (Update Every 12th hours)☆12Updated this week
- ☆32Jan 1, 2024Updated 2 years ago
- ☆14Mar 21, 2025Updated last year
- Simple snippet database☆13Nov 19, 2024Updated last year
- self-adaptive in-context learning☆45May 5, 2023Updated 2 years ago
- A comprehensive overview of Data Distillation and Condensation (DDC). DDC is a data-centric task where a representative (i.e., small but …☆13Dec 1, 2022Updated 3 years ago
- A simple macOS debugger detection trick☆19Apr 7, 2025Updated 11 months ago
- A proof-of-concept to demonstrate randomized execution paths and their impact on call stack signatures — ideal for EDR testing, behavior-…☆24Jan 17, 2026Updated 2 months ago
- [ICDCS 2023] Evaluation and Optimization of Gradient Compression for Distributed Deep Learning☆10Apr 28, 2023Updated 2 years ago
- 2020 AI研习社 金融用户评论分类☆14May 17, 2020Updated 5 years ago
- ☆28Aug 27, 2025Updated 6 months ago
- LobotoMl is a set of scripts and tools to assess production deployments of ML services☆10May 16, 2022Updated 3 years ago
- ☆13Aug 11, 2018Updated 7 years ago
- 基于Pytorch热门深度学习框架 从零开发NLP聊天机器人☆14Sep 13, 2020Updated 5 years ago
- Implementation of VQ-VAE with a GPT-style sampler in the JAX and Haiku ecosystem.☆12Nov 23, 2023Updated 2 years ago
- examples and guides to using Nomic Atlas☆37Apr 18, 2025Updated 11 months ago
- Repository of the RANLP 2023 paper "Exploring the Landscape of Natural Language Processing Research".☆13Oct 20, 2024Updated last year
- Supervised instruction finetuning for LLM with HF trainer and Deepspeed☆36Jul 6, 2023Updated 2 years ago
- [Ongoing Project] Codebase for network quantization study.☆12May 20, 2020Updated 5 years ago
- Backend services for an AI-powered, privacy-first team collaboration platform. Manages secure data, AI processing, and real-time communic…☆18Oct 16, 2025Updated 5 months ago
- ☆13May 9, 2024Updated last year
- ☆11Nov 13, 2024Updated last year
- Binaries for mathematicians☆10Mar 19, 2025Updated last year