☆54Oct 24, 2024Updated last year
Alternatives and similar repositories for nocha
Users that are interested in nocha are comparing it to the libraries listed below
Sorting:
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- The HELMET Benchmark☆203Feb 26, 2026Updated last week
- Exploring limitations of LLM-as-a-judge☆20Aug 17, 2024Updated last year
- JacQues is a Dash-based interactive web application that facilitates real-time chat and document management.☆22Jan 5, 2026Updated 2 months ago
- Yet another frontend for LLM, written using .NET and WinUI 3☆10Sep 14, 2025Updated 5 months ago
- AI_Powered_Dev_Search_Engine☆12Mar 10, 2024Updated last year
- REBUS: A Robust Evaluation Benchmark of Understanding Symbols☆13Aug 13, 2024Updated last year
- ☆33Dec 17, 2025Updated 2 months ago
- ☆29Dec 2, 2024Updated last year
- Homepage for ProLong (Princeton long-context language models) and paper "How to Train Long-Context Language Models (Effectively)"☆247Sep 12, 2025Updated 5 months ago
- ☆60Sep 24, 2024Updated last year
- Official demo repository for our ACL 2019 long paper "Generating Question-Answer Hierarchies".☆20Feb 13, 2026Updated 3 weeks ago
- ☆17Apr 7, 2025Updated 11 months ago
- Repository for DEMETR: Diagnosing Evaluation Metrics for Translation☆17Nov 29, 2022Updated 3 years ago
- ☆110Aug 21, 2025Updated 6 months ago
- Generate python documentation using LLMs☆71Jun 28, 2024Updated last year
- [ICLR 2025] Monet: Mixture of Monosemantic Experts for Transformers☆75Jun 23, 2025Updated 8 months ago
- Code & data for EMNLP 2020 paper "MOCHA: A Dataset for Training and Evaluating Reading Comprehension Metrics".☆16May 3, 2022Updated 3 years ago
- BABILong is a benchmark for LLM evaluation using the needle-in-a-haystack approach.☆241Sep 2, 2025Updated 6 months ago
- Running Microsoft's BitNet via Electron, React & Astro☆53Sep 26, 2025Updated 5 months ago
- Local Ollama with Qdrant RAG: Embed, index, and enhance models for retrieval-augmented generation. Get started with easy setup for powerf…☆25Mar 27, 2024Updated last year
- GPT* - Training faster small transformers using ALiBi, Parallel Residual Connections and more!☆21Oct 29, 2022Updated 3 years ago
- NumGLUE: A Suite of Fundamental yet Challenging Mathematical Reasoning Tasks☆20May 10, 2022Updated 3 years ago
- ☆25Dec 12, 2025Updated 2 months ago
- ☆96Nov 6, 2024Updated last year
- ☆55Mar 27, 2023Updated 2 years ago
- This project is a reverse-engineered version of Figma's tone changer. It uses Groq's Llama-3-8b for high-speed inference and to adjust th…☆90Jul 26, 2024Updated last year
- ☆22Sep 19, 2023Updated 2 years ago
- Suri: Multi-constraint instruction following for long-form text generation (EMNLP’24)☆27Oct 3, 2025Updated 5 months ago
- Information Processing Evaluation for Large Language Models☆49Jan 25, 2026Updated last month
- Network for procedural editing of text with LLMs☆23Dec 6, 2025Updated 3 months ago
- ☆49May 27, 2024Updated last year
- This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?☆1,468Nov 13, 2025Updated 3 months ago
- Framework-Agnostic RL Environments for LLM Fine-Tuning☆44Feb 28, 2026Updated last week
- ☆26Nov 21, 2022Updated 3 years ago
- [EMNLP 2024 Findings] ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs☆29May 22, 2025Updated 9 months ago
- Locally hosted AI Agent Python Tool To Generate Novel Research Hypothesis + Titles + Abstracts☆30Apr 30, 2025Updated 10 months ago
- Evaluating LLMs with fewer examples☆169Apr 12, 2024Updated last year
- ☆31Jun 12, 2024Updated last year