Glavin001 / Data2AITextbook
π Automatically convert unstructured data into a high-quality 'textbook' format, optimized for fine-tuning Large Language Models (LLMs)
β26Updated last year
Alternatives and similar repositories for Data2AITextbook:
Users that are interested in Data2AITextbook are comparing it to the libraries listed below
- Data preparation code for CrystalCoder 7B LLMβ44Updated 10 months ago
- β48Updated 4 months ago
- β20Updated last year
- β40Updated last month
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorerβ41Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β49Updated 8 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optunaβ39Updated 2 months ago
- Implementation of "SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models"β27Updated last month
- Official code for ACL 2023 (short, findings) paper "Recursion of Thought: A Divide and Conquer Approach to Multi-Context Reasoning with Lβ¦β43Updated last year
- SCREWS: A Modular Framework for Reasoning with Revisionsβ27Updated last year
- One Line To Build Zero-Data Classifiers in Minutesβ36Updated 6 months ago
- GPT-4 Level Conversational QA Trained In a Few Hoursβ59Updated 7 months ago
- Universal text classifier for generative modelsβ22Updated 8 months ago
- β48Updated last year
- β32Updated 9 months ago
- entropix style sampling + GUIβ25Updated 5 months ago
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM datasetβ16Updated last year
- Fast approximate inference on a single GPU with sparsity aware offloadingβ38Updated last year
- Using open source LLMs to build synthetic datasets for direct preference optimizationβ59Updated last year
- Lightweight demos for finetuning LLMs. Powered by π€ transformers and open-source datasets.β73Updated 5 months ago
- Simple GRPO scripts and configurations.β58Updated last month
- LLMs as Collaboratively Edited Knowledge Basesβ45Updated last year
- Writing Blog Posts with Generative Feedback Loops!β47Updated last year
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.β33Updated last year
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)β76Updated 5 months ago
- β45Updated 6 months ago
- β49Updated 10 months ago