☆14Jul 7, 2024Updated last year
Alternatives and similar repositories for Contrastive-Accumulation
Users that are interested in Contrastive-Accumulation are comparing it to the libraries listed below
Sorting:
- The training codes of Jasper-Token-Compression-600M☆19Nov 19, 2025Updated 4 months ago
- Make running benchmark simple yet maintainable, again. Now only supports Korean-based cross-encoder.☆29Dec 2, 2025Updated 3 months ago
- DSBA code study☆30Nov 7, 2023Updated 2 years ago
- PyTorch distributed training comparison☆15Apr 11, 2021Updated 4 years ago
- 🎹 Instruct.KR 2025 Summer Meetup: 오픈소스 LLM, vLLM으로 Production까지 🎹☆23Aug 2, 2025Updated 7 months ago
- MEXMA: Token-level objectives improve sentence representations☆43Jan 6, 2025Updated last year
- Code for the paper "Multi-Field Adaptive Retrieval," a research project on a semi-structured document retrieval☆18Feb 13, 2026Updated last month
- Application of Retrieval-Augmented Reasoning on a domain-specific body of knowledge☆34Feb 27, 2026Updated 3 weeks ago
- A framework for benchmarking embedding models in hybrid search scenarios (BM25 + vector search) using Weaviate.☆38Mar 14, 2026Updated last week
- "CS224n 2021 winter" study - KoreaUniv. DSBA Lab☆15Apr 18, 2022Updated 3 years ago
- Evaluate state-of-the-art sparse embedding models on the LIMIT dataset (`limit-small` and `limit`) from google's paper `On the Theoretica…☆16Sep 4, 2025Updated 6 months ago
- RAPID: Training-free Retrieval-based Log Anomaly Detection with PLM considering Token-level information☆18Jul 11, 2024Updated last year
- Review papers of NLP, mainly LLM.☆31Apr 8, 2024Updated last year
- AutoRAG example about benchmarking Korean embeddings.☆43Oct 2, 2024Updated last year
- Time-series-LLM☆23Oct 31, 2023Updated 2 years ago
- ☆58Jan 26, 2025Updated last year
- Korean-MTEB☆75Mar 12, 2026Updated last week
- Multi^2OIE: Multilingual Open Information Extraction Based on Multi-Head Attention with BERT (Findings of ACL: EMNLP 2020)☆56Aug 14, 2022Updated 3 years ago
- huggingface transformers tutorial, code, resources☆26Apr 7, 2024Updated last year
- Active Learning Helps Pretrained Models Learn the Intended Task (https://arxiv.org/abs/2204.08491) by Alex Tamkin, Dat Nguyen, Salil Desh…☆11Nov 22, 2022Updated 3 years ago
- Korean Sentence Embedding Model Performance Benchmark for RAG☆50Jan 27, 2025Updated last year
- KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models☆25Aug 24, 2024Updated last year
- ☆19May 16, 2024Updated last year
- Efficiently computing & storing token n-grams from large corpora☆27Oct 6, 2024Updated last year
- [AAAI 24] GradTree: Gradient-Based Axis-Aligned Decision Trees☆15Aug 28, 2024Updated last year
- Training code for Sparse Autoencoders on Embedding models☆39Feb 27, 2025Updated last year
- ☆21Nov 30, 2022Updated 3 years ago
- ☆34Feb 27, 2024Updated 2 years ago
- This repository contains the official code for the paper: "Prompt Injection: Parameterization of Fixed Inputs"☆32Sep 13, 2024Updated last year
- ☆22Nov 23, 2023Updated 2 years ago
- ☆22Dec 1, 2022Updated 3 years ago
- ☆23Mar 19, 2024Updated 2 years ago
- ☆25Feb 27, 2023Updated 3 years ago
- Simple replication of DPR (Dense Passage Retrieval)☆54Nov 10, 2023Updated 2 years ago
- Lightning template for easy prototyping⚡️☆13Oct 31, 2022Updated 3 years ago
- 대학생을 위한 IT 스펙 저장소 PRE:FOLIO 클라이언트☆10Jul 19, 2023Updated 2 years ago
- Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numba☆37Oct 16, 2025Updated 5 months ago
- ☆25Nov 24, 2023Updated 2 years ago
- pytorch reimplementation for Detecting Adversarial Examples from Sensitivity Inconsistency of Spatial-Transform Domain☆11Oct 30, 2022Updated 3 years ago