cloneofsimo / auto_llm_codebase_analysis
☆26Updated 8 months ago
Related projects ⓘ
Alternatives and complementary repositories for auto_llm_codebase_analysis
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆20Updated 9 months ago
- Modified Beam Search with periodical restart☆12Updated 2 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated 8 months ago
- ☆41Updated 2 weeks ago
- ☆11Updated last month
- ☆31Updated 10 months ago
- Collection of autoregressive model implementation☆67Updated this week
- Using multiple LLMs for ensemble Forecasting☆16Updated 10 months ago
- ☆27Updated 3 months ago
- [WIP] Transformer to embed Danbooru labelsets☆13Updated 7 months ago
- QLoRA for Masked Language Modeling☆20Updated last year
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Updated 5 months ago
- BH hackathon☆14Updated 7 months ago
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Updated last year
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆23Updated 8 months ago
- Code for the examples presented in the talk "Training a Llama in your backyard: fine-tuning very large models on consumer hardware" given…☆14Updated last year
- LLM reads a paper and produce a working prototype☆36Updated last week
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆46Updated 2 months ago
- ☆24Updated 5 months ago
- Zeta implementation of a reusable and plug in and play feedforward from the paper "Exponentially Faster Language Modeling"☆15Updated last week
- ☆37Updated last year
- Visual RAG using less than 300 lines of code.☆23Updated 8 months ago
- implementation of https://arxiv.org/pdf/2312.09299☆19Updated 4 months ago
- ☆15Updated 8 months ago
- Using modal.com to process FineWeb-edu data☆19Updated 2 months ago
- QLoRA with Enhanced Multi GPU Support☆36Updated last year
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆37Updated 7 months ago
- Training hybrid models for dummies.☆15Updated 3 weeks ago
- ☆24Updated last year
- ☆36Updated 3 months ago