TristanThrush / i-am-a-strange-datasetView external linksLinks
Repository for "I am a Strange Dataset: Metalinguistic Tests for Language Models"
☆45Jan 11, 2024Updated 2 years ago
Alternatives and similar repositories for i-am-a-strange-dataset
Users that are interested in i-am-a-strange-dataset are comparing it to the libraries listed below
Sorting:
- Visualizing 230 years of US Census data☆12Feb 23, 2020Updated 5 years ago
- Simple and scalable tools for data-driven pretraining data selection.☆29Jun 9, 2025Updated 8 months ago
- 커버리스트 - 북 커버 생성 AI 서비스☆13Sep 11, 2022Updated 3 years ago
- ☆16Mar 22, 2025Updated 10 months ago
- TPU에서 한국어용 LLM 추론을 위한 Jax/Flax 구현체입니다.☆12Jun 12, 2023Updated 2 years ago
- Copyright-free Artificial Lyrics Dataset (ISMIR 2024 LBD)☆12Sep 1, 2024Updated last year
- a Jax/Flax inference code of StarCoder☆12Jun 12, 2023Updated 2 years ago
- 광운대학교 컴퓨터 비전 AI 경진대회 1등 솔루션입니다.☆15Oct 5, 2022Updated 3 years ago
- ☆33Dec 9, 2022Updated 3 years ago
- ☆15Nov 29, 2021Updated 4 years ago
- https://footprints.baulab.info☆17Oct 4, 2024Updated last year
- 한국의 COVID-19에 대한 한국 사회의 대응 및 데이터 기반 사회문화적 이슈의 분석☆22May 16, 2022Updated 3 years ago
- ☆15Jul 20, 2023Updated 2 years ago
- <딥러닝 일러스트레이티드>(시그마프레스, 2021)의 코드 저장소☆16Dec 7, 2022Updated 3 years ago
- 🥇 LG-AI-Challenge 2022 1위 솔루션 입니다.☆13Jun 6, 2023Updated 2 years ago
- [COLM 2024] Early Weight Averaging meets High Learning Rates for LLM Pre-training☆19Oct 12, 2024Updated last year
- 청와대 국민청원 데이터 아카이브☆15Aug 29, 2020Updated 5 years ago
- 🎖️ 5th place solution in the Google American Sign Language Fingerspelling Recognition Competition🎖️☆16Sep 19, 2023Updated 2 years ago
- Minimum Description Length probing for neural network representations☆20Jan 28, 2025Updated last year
- Convenient Text-to-Text Training for Transformers☆19Dec 10, 2021Updated 4 years ago
- Source code of paper “A Novel Three-Stage Learning Framework for Low-Resource Knowledge-Grounded Dialogue Generation”☆16Nov 25, 2021Updated 4 years ago
- Implementation of Hopfield Neural Network in Python based on Hebbian Learning Algorithm☆13Aug 10, 2019Updated 6 years ago
- ☆23Jan 27, 2025Updated last year
- Code repo for "Model-Generated Pretraining Signals Improves Zero-Shot Generalization of Text-to-Text Transformers" (ACL 2023)☆22Nov 1, 2023Updated 2 years ago
- Exploring limitations of LLM-as-a-judge☆20Aug 17, 2024Updated last year
- ☆22Nov 8, 2023Updated 2 years ago
- 《금융 전문가를 위한 머신러닝 알고리즘》 예제 코드☆18Dec 23, 2020Updated 5 years ago
- Evaluate interpretability methods on localizing and disentangling concepts in LLMs.☆57Oct 30, 2025Updated 3 months ago
- Triton Implementation of HyperAttention Algorithm☆48Dec 11, 2023Updated 2 years ago
- Jax/Flax implementation of DeiT and DeiT-III (ViT)☆19Dec 21, 2024Updated last year
- M2D2: A Massively Multi-domain Language Modeling Dataset (EMNLP 2022) by Machel Reid, Victor Zhong, Suchin Gururangan, Luke Zettlemoyer☆54Nov 21, 2022Updated 3 years ago
- Code and data for the paper "Disentangling Uncertainty in Machine Translation Evaluation", accepted at EMNLP 2022.☆23Jun 23, 2023Updated 2 years ago
- Simple GRPO scripts and configurations.☆59Feb 6, 2025Updated last year
- A hotel recommender system using SageMaker☆20Jun 13, 2020Updated 5 years ago
- Implementation of "SMaLL-100: Introducing Shallow Multilingual Machine Translation Model for Low-Resource Languages" paper, accepted to E…☆28Feb 8, 2023Updated 3 years ago
- Minimum Bayes Risk Decoding for Hugging Face Transformers☆60Jun 3, 2024Updated last year
- Code for paper "Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs"☆28May 2, 2022Updated 3 years ago
- Efficiently computing & storing token n-grams from large corpora☆26Oct 6, 2024Updated last year
- Code for paper: “What Data Benefits My Classifier?” Enhancing Model Performance and Interpretability through Influence-Based Data Selecti…☆24May 17, 2024Updated last year