LLM360 / crystalcoder-data-prep
Data preparation code for CrystalCoder 7B LLM
☆42Updated 6 months ago
Related projects ⓘ
Alternatives and complementary repositories for crystalcoder-data-prep
- Pre-training code for CrystalCoder 7B LLM☆53Updated 6 months ago
- Data preparation code for Amber 7B LLM☆82Updated 6 months ago
- Open Implementations of LLM Analyses☆94Updated last month
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆75Updated last month
- ☆22Updated 2 months ago
- ☆35Updated last year
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆74Updated 10 months ago
- Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understanding☆38Updated last month
- ☆40Updated 2 weeks ago
- Implementation of "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"☆42Updated last week
- Small and Efficient Mathematical Reasoning LLMs☆71Updated 9 months ago
- ☆27Updated 5 months ago
- ☆46Updated last week
- ☆41Updated 2 months ago
- ☆53Updated 5 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated 8 months ago
- A repository for research on medium sized language models.☆74Updated 5 months ago
- Simple examples using Argilla tools to build AI☆40Updated this week
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆37Updated 7 months ago
- FuseAI Project☆76Updated 3 months ago
- ☆37Updated 3 weeks ago
- ☆59Updated last month
- Fast approximate inference on a single GPU with sparsity aware offloading☆38Updated 10 months ago
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆23Updated 8 months ago
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated last year
- ☆22Updated 3 months ago
- This is the official repository for Inheritune.☆105Updated last month
- Using multiple LLMs for ensemble Forecasting☆16Updated 10 months ago
- My implementation of "Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models"☆92Updated last year