marzenakrp / nochaView external linksLinks
☆54Oct 24, 2024Updated last year
Alternatives and similar repositories for nocha
Users that are interested in nocha are comparing it to the libraries listed below
Sorting:
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- The HELMET Benchmark☆199Dec 4, 2025Updated 2 months ago
- Exploring limitations of LLM-as-a-judge☆20Aug 17, 2024Updated last year
- JacQues is a Dash-based interactive web application that facilitates real-time chat and document management.☆22Jan 5, 2026Updated last month
- Yet another frontend for LLM, written using .NET and WinUI 3☆10Sep 14, 2025Updated 5 months ago
- REBUS: A Robust Evaluation Benchmark of Understanding Symbols☆13Aug 13, 2024Updated last year
- AI_Powered_Dev_Search_Engine☆12Mar 10, 2024Updated last year
- ☆33Dec 17, 2025Updated last month
- ☆29Dec 2, 2024Updated last year
- Homepage for ProLong (Princeton long-context language models) and paper "How to Train Long-Context Language Models (Effectively)"☆246Sep 12, 2025Updated 5 months ago
- ☆59Sep 24, 2024Updated last year
- Repository for DEMETR: Diagnosing Evaluation Metrics for Translation☆17Nov 29, 2022Updated 3 years ago
- ☆12Feb 21, 2021Updated 4 years ago
- ☆17Apr 7, 2025Updated 10 months ago
- ☆109Aug 21, 2025Updated 5 months ago
- ☆15Jul 1, 2020Updated 5 years ago
- Generate python documentation using LLMs☆71Jun 28, 2024Updated last year
- [ICLR 2025] Monet: Mixture of Monosemantic Experts for Transformers☆75Jun 23, 2025Updated 7 months ago
- Code & data for EMNLP 2020 paper "MOCHA: A Dataset for Training and Evaluating Reading Comprehension Metrics".☆16May 3, 2022Updated 3 years ago
- Running Microsoft's BitNet via Electron, React & Astro☆53Sep 26, 2025Updated 4 months ago
- ☆20Jul 2, 2024Updated last year
- Local Ollama with Qdrant RAG: Embed, index, and enhance models for retrieval-augmented generation. Get started with easy setup for powerf…☆25Mar 27, 2024Updated last year
- ☆24Dec 12, 2025Updated 2 months ago
- NumGLUE: A Suite of Fundamental yet Challenging Mathematical Reasoning Tasks☆20May 10, 2022Updated 3 years ago
- Code for the paper "Function-Space Learning Rates"☆25Jun 3, 2025Updated 8 months ago
- Information Processing Evaluation for Large Language Models☆41Jan 25, 2026Updated 2 weeks ago
- ☆96Nov 6, 2024Updated last year
- Suri: Multi-constraint instruction following for long-form text generation (EMNLP’24)☆27Oct 3, 2025Updated 4 months ago
- ☆56Aug 10, 2024Updated last year
- ☆50May 27, 2024Updated last year
- This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?☆1,450Nov 13, 2025Updated 3 months ago
- [EMNLP 2024 Findings] ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs☆29May 22, 2025Updated 8 months ago
- ☆26Nov 21, 2022Updated 3 years ago
- Framework-Agnostic RL Environments for LLM Fine-Tuning☆42Updated this week
- Locally hosted AI Agent Python Tool To Generate Novel Research Hypothesis + Titles + Abstracts☆30Apr 30, 2025Updated 9 months ago
- Evaluating LLMs with fewer examples☆169Apr 12, 2024Updated last year
- GPU Power and Performance Manager☆66Oct 13, 2024Updated last year
- treemind interprets tree models☆41Jul 23, 2025Updated 6 months ago
- Benchmark evaluating LLMs on their ability to create and resist disinformation. Includes comprehensive testing across major models (Claud…☆31Mar 20, 2025Updated 10 months ago