gauss5930 / iDUSView external linksLinks
An unofficial implementation of SOLAR-10.7B model and the newly proposed interlocked-DUS(iDUS) implementation and experiment details.
☆14Mar 20, 2024Updated last year
Alternatives and similar repositories for iDUS
Users that are interested in iDUS are comparing it to the libraries listed below
Sorting:
- BERT score for text generation☆12Jan 15, 2025Updated last year
- KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models☆25Aug 24, 2024Updated last year
- A Flutter plugin for integrating Liquid AI's LEAP SDK, enabling on-device deployment of small language models in Flutter applications.☆22Sep 3, 2025Updated 5 months ago
- Understanding the correlation between different LLM benchmarks☆29Jan 11, 2024Updated 2 years ago
- ☆12Dec 20, 2024Updated last year
- 자체 구축한 한국어 평가 데이터셋을 이용한 한국어 모델 평가☆31May 31, 2024Updated last year
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆37Oct 9, 2025Updated 4 months ago
- ☆16Apr 11, 2022Updated 3 years ago
- Sakura-SOLAR-DPO: Merge, SFT, and DPO☆116Dec 30, 2023Updated 2 years ago
- Simple Model Similarities Analysis☆21Feb 3, 2024Updated 2 years ago
- ☆31Oct 15, 2021Updated 4 years ago
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM☆61Apr 8, 2024Updated last year
- DSBA code study☆30Nov 7, 2023Updated 2 years ago
- Reward Model을 이용하여 언어모델의 답변을 평가하기☆29Feb 23, 2024Updated last year
- The most modern LLM evaluation toolkit☆70Nov 9, 2025Updated 3 months ago
- [Findings of NAACL2022] A Dog Is Passing Over The Jet? A Text-Generation Dataset for Korean Commonsense Reasoning and Evaluation☆28Dec 9, 2022Updated 3 years ago
- huggingface에 있는 한국어 데이터 세트☆35Oct 10, 2024Updated last year
- Official code for "Interpretable part-whole hierarchies and conceptual-semantic relationships in neural networks" (CVPR2022)☆34Jun 13, 2022Updated 3 years ago
- [ACL 2025 Main] Official Repository for "Evaluating Language Models as Synthetic Data Generators"☆40Dec 13, 2024Updated last year
- Panorama_498全景图像数据集☆14Apr 8, 2022Updated 3 years ago
- GestureX is an OpenCV-based hand motion sensing system for intuitive, efficient user control.This project aims to investigate the potenti…☆16Jun 29, 2024Updated last year
- A part of the course Mobile Application Development☆13Nov 30, 2021Updated 4 years ago
- 哔哩哔哩-API收集整理【不断更新中....】☆10Apr 25, 2025Updated 9 months ago
- Kor-IR: Korean Information Retrieval Benchmark☆87Jul 3, 2024Updated last year
- [KOREAN] Code for generating synthetic text images as described in "Synthetic Data for Text Localisation in Natural Images", Ankush Gupta…☆32Nov 25, 2019Updated 6 years ago
- HyFormer: Hybrid Transformer and CNN For Pixel-level Multispectral Image Classification☆15Feb 15, 2023Updated 2 years ago
- A library for training crosscoders☆15May 28, 2025Updated 8 months ago
- This is the official GDSC repo with all of the source code presented in the video tutorials☆14Jun 27, 2023Updated 2 years ago
- Official code for AL-PINNS: Augmented Lagrangian relaxation method for Physics-Informed Neural Networks☆12Jul 29, 2023Updated 2 years ago
- MLOps Pipeline for Amazon Forecast written in AWS CDK☆11Apr 10, 2025Updated 10 months ago
- TransientViT: A novel CNN - Vision Transformer hybrid real/bogus transient classifier for the Kilodegree Automatic Transient Survey☆10Nov 7, 2024Updated last year
- Implementation of a simple linear regression algorithm in MAMBA☆10Feb 12, 2020Updated 6 years ago
- This is code for the EMNLP 2022 Paper "UniRPG: Unified Discrete Reasoning over Table and Text as Program Generation".☆10Apr 30, 2023Updated 2 years ago
- WindSR Dataset contains more than 22,000 pairs of HR/LR wind speed images, which are processed using the NASA's GEOS-5 Nature Run dataset…☆11Jan 18, 2024Updated 2 years ago
- Example Next.js app that will consume our Serverless Node.js API from https://github.com/codingforentrepreneurs/serverless-nodejs-api☆11Jan 27, 2024Updated 2 years ago
- Implemention based on lightrag and nano-graphrag to connect with psql☆15Oct 28, 2024Updated last year
- Predict the number of deaths due to covid19 in the next two weeks☆11Oct 2, 2022Updated 3 years ago
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆13Nov 11, 2024Updated last year
- Implementation of a Hierarchical Mamba as described in the paper: "Hierarchical State Space Models for Continuous Sequence-to-Sequence Mo…☆15Nov 11, 2024Updated last year