MEXMA: Token-level objectives improve sentence representations
☆43Jan 6, 2025Updated last year
Alternatives and similar repositories for mexma
Users that are interested in mexma are comparing it to the libraries listed below
Sorting:
- Make running benchmark simple yet maintainable, again. Now only supports Korean-based cross-encoder.☆29Dec 2, 2025Updated 3 months ago
- ☆14Jul 7, 2024Updated last year
- Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numba☆35Oct 16, 2025Updated 4 months ago
- Look Back to Reason Forward: Revisitable Memory for Long-Context LLM Agents☆23Feb 21, 2026Updated last week
- KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models☆25Aug 24, 2024Updated last year
- PreRanker: reranking tools before tool-use☆21Apr 9, 2025Updated 10 months ago
- ☆16Mar 3, 2024Updated 2 years ago
- Korean Sentence Embedding Model Performance Benchmark for RAG☆50Jan 27, 2025Updated last year
- The training codes of Jasper-Token-Compression-600M☆19Nov 19, 2025Updated 3 months ago
- Code for the paper "Multi-Field Adaptive Retrieval," a research project on a semi-structured document retrieval☆16Feb 13, 2026Updated 2 weeks ago
- A framework for benchmarking embedding models in hybrid search scenarios (BM25 + vector search) using Weaviate.☆38Feb 12, 2026Updated 3 weeks ago
- ☆110Jan 4, 2026Updated 2 months ago
- ☆43Apr 22, 2025Updated 10 months ago
- Efficient Finetuning for OpenAI GPT-OSS☆23Oct 2, 2025Updated 5 months ago
- ☆18Apr 18, 2025Updated 10 months ago
- Training hybrid models for dummies.☆29Nov 1, 2025Updated 4 months ago
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…☆372Dec 12, 2024Updated last year
- [EMNLP 2025] Code for paper "Table-R1: Inference-Time Scaling for Table Reasoning"☆29Jun 3, 2025Updated 9 months ago
- 🚀 LLM-I: Transform LLMs into natural interleaved multimodal creators! ✨ Tool-use framework supporting image search, generation, code ex…☆41Oct 20, 2025Updated 4 months ago
- The source code for running LLMs on the AAAR-1.0 benchmark.☆18Apr 5, 2025Updated 11 months ago
- ☆19May 16, 2024Updated last year
- This repository contains code and datasets for our paper on the effects of document multiplicity while the context size is fixed in Retri…☆18Mar 13, 2025Updated 11 months ago
- Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model".☆930Oct 28, 2024Updated last year
- 논문/발표에서 쓰기 좋은 학술 영어 문장들을 정리해보자.☆16Aug 31, 2020Updated 5 years ago
- Training tiny models to prove hard theorems☆41Feb 15, 2026Updated 2 weeks ago
- ☆19Oct 2, 2023Updated 2 years ago
- Code for paper: Long cOntext aliGnment via efficient preference Optimization☆24Oct 10, 2025Updated 4 months ago
- Korean-MTEB☆74Jan 25, 2026Updated last month
- Evaluate gpt-4o on CLIcK (Korean NLP Dataset)☆20May 18, 2024Updated last year
- KURE: 고려대학교에서 개발한, 한국어 검색에 특화된 임베딩 모델☆205Feb 26, 2026Updated last week
- Official implementation of Vector-ICL: In-context Learning with Continuous Vector Representations (ICLR 2025)☆21Jun 2, 2025Updated 9 months ago
- AskUp Search ChatGPT Plugin☆20May 27, 2023Updated 2 years ago
- Visualize expert firing frequencies across sentences in the Mixtral MoE model☆18Dec 22, 2023Updated 2 years ago
- official repository for ListT5☆48Nov 27, 2025Updated 3 months ago
- Official Implementation of FastKV: Decoupling of Context Reduction and KV Cache Compression for Prefill-Decoding Acceleration☆29Nov 22, 2025Updated 3 months ago
- ☆57Dec 27, 2025Updated 2 months ago
- A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).☆344Dec 16, 2025Updated 2 months ago
- ☆56Nov 6, 2024Updated last year
- Jina VDR is a multilingual, multi-domain benchmark for visual document retrieval☆38Aug 4, 2025Updated 7 months ago