Repository for "I am a Strange Dataset: Metalinguistic Tests for Language Models"
☆45Jan 11, 2024Updated 2 years ago
Alternatives and similar repositories for i-am-a-strange-dataset
Users that are interested in i-am-a-strange-dataset are comparing it to the libraries listed below
Sorting:
- Visualizing 230 years of US Census data☆12Feb 23, 2020Updated 6 years ago
- Simple and scalable tools for data-driven pretraining data selection.☆29Jun 9, 2025Updated 9 months ago
- TPU에서 한국어용 LLM 추론을 위한 Jax/Flax 구현체입니다.☆12Jun 12, 2023Updated 2 years ago
- 커버리스트 - 북 커버 생성 AI 서비스☆13Sep 11, 2022Updated 3 years ago
- Copyright-free Artificial Lyrics Dataset (ISMIR 2024 LBD)☆12Sep 1, 2024Updated last year
- ☆16Mar 22, 2025Updated 11 months ago
- SKT'22 AI Fellowship, 딥러닝 기반 흑백 이미지 컬러화 기술 개발☆13Jun 7, 2023Updated 2 years ago
- Serving large language model with transformers☆13Oct 18, 2022Updated 3 years ago
- a Jax/Flax inference code of StarCoder☆12Jun 12, 2023Updated 2 years ago
- Detecting topic clusters in arXiv ML papers.☆14Oct 10, 2020Updated 5 years ago
- 🥈12th place solution on G2Net Detecting Continuous Gravitational Waves🥈☆14Jan 4, 2023Updated 3 years ago
- 한국의 COVID-19에 대한 한국 사회의 대응 및 데이터 기반 사회문화적 이슈의 분석☆22May 16, 2022Updated 3 years ago
- ☆15Nov 29, 2021Updated 4 years ago
- https://footprints.baulab.info☆18Oct 4, 2024Updated last year
- ☆16Jul 20, 2023Updated 2 years ago
- <딥러닝 일러스트레이티드>(시그마프레스, 2021)의 코드 저장소☆16Dec 7, 2022Updated 3 years ago
- 청와대 국민청원 데이터 아카이브☆15Aug 29, 2020Updated 5 years ago
- Convenient Text-to-Text Training for Transformers☆19Dec 10, 2021Updated 4 years ago
- [COLM 2024] Early Weight Averaging meets High Learning Rates for LLM Pre-training☆19Oct 12, 2024Updated last year
- Source code of paper “A Novel Three-Stage Learning Framework for Low-Resource Knowledge-Grounded Dialogue Generation”☆16Nov 25, 2021Updated 4 years ago
- Minimum Description Length probing for neural network representations☆20Jan 28, 2025Updated last year
- 🥇 LG-AI-Challenge 2022 1위 솔루션 입니다.☆13Jun 6, 2023Updated 2 years ago
- Code for ACL 2023 Paper: ACLM: A Selective-Denoising based Generative Data Augmentation Approach for Low-Resource Complex NER☆21Jul 19, 2023Updated 2 years ago
- 🎖️ 5th place solution in the Google American Sign Language Fingerspelling Recognition Competition🎖️☆16Sep 19, 2023Updated 2 years ago
- Exploring limitations of LLM-as-a-judge☆20Aug 17, 2024Updated last year
- Code repo for "Model-Generated Pretraining Signals Improves Zero-Shot Generalization of Text-to-Text Transformers" (ACL 2023)☆22Nov 1, 2023Updated 2 years ago
- ☆23Jan 27, 2025Updated last year
- Submission to the inverse scaling prize☆23Jul 23, 2023Updated 2 years ago
- 《금융 전문가를 위한 머신러닝 알고리즘》 예제 코드☆18Dec 23, 2020Updated 5 years ago
- Evaluate interpretability methods on localizing and disentangling concepts in LLMs.☆57Oct 30, 2025Updated 4 months ago
- Triton Implementation of HyperAttention Algorithm☆48Dec 11, 2023Updated 2 years ago
- Jax/Flax implementation of DeiT and DeiT-III (ViT)☆19Dec 21, 2024Updated last year
- M2D2: A Massively Multi-domain Language Modeling Dataset (EMNLP 2022) by Machel Reid, Victor Zhong, Suchin Gururangan, Luke Zettlemoyer☆54Nov 21, 2022Updated 3 years ago
- Code and data for the paper "Disentangling Uncertainty in Machine Translation Evaluation", accepted at EMNLP 2022.☆23Jun 23, 2023Updated 2 years ago
- Course for Interpreting ML Models☆52Feb 16, 2023Updated 3 years ago
- Simple GRPO scripts and configurations.☆59Feb 6, 2025Updated last year
- A hotel recommender system using SageMaker☆20Jun 13, 2020Updated 5 years ago
- Minimum Bayes Risk Decoding for Hugging Face Transformers☆60Jun 3, 2024Updated last year
- Code for "The Unreasonable Effectiveness of Linear Prediction as a Perceptual Metric"☆23Jan 26, 2024Updated 2 years ago