stunningpixels / lou-eval
Track the progress of LLM context utilisation
☆53Updated 2 months ago
Related projects: ⓘ
- Just a bunch of benchmark logs for different LLMs☆112Updated last month
- ☆48Updated 11 months ago
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆93Updated 5 months ago
- Mixing Language Models with Self-Verification and Meta-Verification☆96Updated 10 months ago
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆49Updated 3 weeks ago
- Testing speed and accuracy of RAG with, and without Cross Encoder Reranker.☆45Updated 8 months ago
- Evaluating LLMs with CommonGen-Lite☆83Updated 5 months ago
- ☆68Updated 2 months ago
- ☆57Updated last year
- ☆37Updated 9 months ago
- Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models☆67Updated last year
- A framework for evaluating function calls made by LLMs☆34Updated last month
- auto fine tune of models with synthetic data☆71Updated 7 months ago
- Using multiple LLMs for ensemble Forecasting☆17Updated 8 months ago
- Writing Blog Posts with Generative Feedback Loops!☆41Updated 6 months ago
- ☆22Updated last year
- an implementation of Self-Extend, to expand the context window via grouped attention☆117Updated 8 months ago
- Functional Benchmarks and the Reasoning Gap☆74Updated last month
- Camel-Coder: Collaborative task completion with multiple agents. Role-based prompts, intervention mechanism, and thoughtful suggestions☆33Updated last year
- ☆83Updated last year
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆81Updated last year
- Official homepage for "Self-Harmonized Chain of Thought"☆45Updated this week
- Automating enterprise workflows with multimodal agents☆83Updated last month
- Collection of Tree of Thoughts prompting techniques I've found useful to start with, then stylize, then iterate☆69Updated 11 months ago
- Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through r…☆53Updated 2 months ago
- Scrape and export data from the Open LLM Leaderboard.☆38Updated 2 weeks ago
- ☆85Updated 7 months ago
- ☆75Updated 7 months ago
- Collection of recipes aiding Gen AI model development☆78Updated 2 weeks ago
- Comparing retrieval abilities from GPT4-Turbo and a RAG system on a toy example for various context lengths☆35Updated 9 months ago