☆54Oct 24, 2024Updated last year
Alternatives and similar repositories for nocha
Users that are interested in nocha are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12Sep 1, 2021Updated 4 years ago
- ☆35Dec 17, 2025Updated 4 months ago
- ☆29Dec 2, 2024Updated last year
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- The HELMET Benchmark☆214Apr 17, 2026Updated 2 weeks ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Homepage for ProLong (Princeton long-context language models) and paper "How to Train Long-Context Language Models (Effectively)"☆250Sep 12, 2025Updated 7 months ago
- ☆22Sep 19, 2023Updated 2 years ago
- Repository for DEMETR: Diagnosing Evaluation Metrics for Translation☆17Nov 29, 2022Updated 3 years ago
- Yet another frontend for LLM, written using .NET and WinUI 3☆11Sep 14, 2025Updated 7 months ago
- ☆25Dec 12, 2025Updated 4 months ago
- GPT* - Training faster small transformers using ALiBi, Parallel Residual Connections and more!☆20Oct 29, 2022Updated 3 years ago
- Official demo repository for our ACL 2019 long paper "Generating Question-Answer Hierarchies".☆20Feb 13, 2026Updated 2 months ago
- Exploring limitations of LLM-as-a-judge☆20Aug 17, 2024Updated last year
- [EMNLP2025] Remedy: Learning Machine Translation Evaluation from Human Preferences with Reward Modeling☆15Nov 20, 2025Updated 5 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [ICLR 2025] Monet: Mixture of Monosemantic Experts for Transformers☆78Jun 23, 2025Updated 10 months ago
- an auto-sleeping and -waking framework around llama.cpp☆12Feb 8, 2025Updated last year
- DiffusER: Discrete Diffusion via Edit-based Reconstruction (Reid, Hellendoorn & Neubig, 2022)☆55Aug 3, 2025Updated 9 months ago
- ☆15Jul 1, 2020Updated 5 years ago
- JacQues is a Dash-based interactive web application that facilitates real-time chat and document management.☆23Jan 5, 2026Updated 4 months ago
- AI_Powered_Dev_Search_Engine☆12Mar 10, 2024Updated 2 years ago
- ☆12Jan 30, 2023Updated 3 years ago
- BABILong is a benchmark for LLM evaluation using the needle-in-a-haystack approach.☆248Sep 2, 2025Updated 8 months ago
- This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?☆1,528Nov 13, 2025Updated 5 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [EMNLP 2023] Question Answering as Programming for Solving Time-Sensitive Questions☆12Dec 18, 2023Updated 2 years ago
- ☆17Apr 7, 2025Updated last year
- Evaluating LLMs with fewer examples☆175Apr 12, 2024Updated 2 years ago
- Generate python documentation using LLMs☆69Jun 28, 2024Updated last year
- Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward☆33Oct 5, 2025Updated 7 months ago
- Implementation of "Von Mises-Fisher Loss for Training Sequence to Sequence Models with Continuous Outputs"☆77Jun 12, 2021Updated 4 years ago
- ☆111Aug 21, 2025Updated 8 months ago
- Official repository for "PostMark: A Robust Blackbox Watermark for Large Language Models"☆27Aug 30, 2024Updated last year
- Code & data for EMNLP 2020 paper "MOCHA: A Dataset for Training and Evaluating Reading Comprehension Metrics".☆16May 3, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Running Microsoft's BitNet via Electron, React & Astro☆62Sep 26, 2025Updated 7 months ago
- An original implementation of the paper "CREPE: Open-Domain Question Answering with False Presuppositions"☆16Nov 5, 2024Updated last year
- ☆13Jun 4, 2024Updated last year
- [ACL'24] WebCiteS: Attributed Query-Focused Summarization on Chinese Web Search Results with Citations☆13Sep 11, 2024Updated last year
- ☆24Apr 2, 2024Updated 2 years ago
- A package to generate summaries of long-form text and evaluate the coherence of these summaries. Official package for our ICLR 2024 paper…☆131Oct 1, 2024Updated last year
- ☆98Nov 6, 2024Updated last year