sfeucht / footprintsView external linksLinks
https://footprints.baulab.info
☆17Oct 4, 2024Updated last year
Alternatives and similar repositories for footprints
Users that are interested in footprints are comparing it to the libraries listed below
Sorting:
- PathPiece tokenizer☆13Nov 10, 2024Updated last year
- Official implementation of "Data Mixture Inference: What do BPE tokenizers reveal about their training data?"☆18May 15, 2025Updated 8 months ago
- Implementation code for ACL2024:Advancing Parameter Efficiency in Fine-tuning via Representation Editing☆15Apr 20, 2024Updated last year
- Implementation of Gradient Information Optimization (GIO) for effective and scalable training data selection☆14Jun 22, 2023Updated 2 years ago
- a version of baby agi using dspy and typed predictors☆16Mar 9, 2024Updated last year
- Mixtral finetuning☆19Feb 2, 2024Updated 2 years ago
- 👩💻 Code for the ACL paper "Detecting Edit Failures in LLMs: An Improved Specificity Benchmark"☆20Jan 19, 2024Updated 2 years ago
- ☆25May 7, 2025Updated 9 months ago
- ☆29Oct 24, 2025Updated 3 months ago
- Plug-and-play Search Interfaces with Pyserini and Hugging Face☆32Aug 5, 2023Updated 2 years ago
- ☆12Oct 7, 2020Updated 5 years ago
- The NDIF server, which performs deep inference and serves nnsight requests remotely☆41Updated this week
- Code for Zero-Shot Tokenizer Transfer☆142Jan 14, 2025Updated last year
- Code for paper: Aligning Large Language Models with Representation Editing: A Control Perspective☆35Jan 31, 2025Updated last year
- Mathematical foundations of data analysis, Winter semester 22-23☆13Jan 31, 2023Updated 3 years ago
- User-friendly viewer for Parquet files☆10Jan 10, 2026Updated last month
- ☆49Apr 4, 2025Updated 10 months ago
- Efficient implementation of a parallel bucket-sort with OpenMP☆10Mar 19, 2017Updated 8 years ago
- a blog starter project☆11Oct 29, 2018Updated 7 years ago
- ☆14Apr 29, 2025Updated 9 months ago
- ☆82Jan 31, 2026Updated 2 weeks ago
- A framework for evaluating Machine Translation models.☆12May 26, 2025Updated 8 months ago
- ☆15Oct 24, 2023Updated 2 years ago
- ☆14Apr 14, 2025Updated 10 months ago
- Punch Out Model Synthesis - a program for constraint based tiling generation☆18Feb 1, 2026Updated last week
- Remote Audio Data iOS SDK☆11Aug 19, 2020Updated 5 years ago
- Simple repository for training small reasoning models☆49Feb 6, 2025Updated last year
- Conversion of audio files to text using whisper from OpenAI with a simple tkinter GUI☆10Apr 13, 2023Updated 2 years ago
- Repository for "I am a Strange Dataset: Metalinguistic Tests for Language Models"☆45Jan 11, 2024Updated 2 years ago
- [NeurIPS 2024 D&B Track] DACO: Towards Application-Driven and Comprehensive Data Analysis via Code Generation☆12Mar 5, 2025Updated 11 months ago
- My personal site, using Wowchemy☆12Updated this week
- ☆11Feb 25, 2025Updated 11 months ago
- Sample code to show how to create an in-memory RAG☆10Mar 10, 2024Updated last year
- Seamless Voice Interactions with LLMs☆12Oct 28, 2023Updated 2 years ago
- ☆11Jul 19, 2018Updated 7 years ago
- ☆10Jun 12, 2023Updated 2 years ago
- Benchmark of common hash functions☆10Sep 15, 2019Updated 6 years ago
- Official codebase for our paper "Do Language Models Use Their Depth Efficiently?"☆29Jun 25, 2025Updated 7 months ago
- Surgically de-slop LLMs☆14Jun 1, 2025Updated 8 months ago