swiss-ai / apertus-tech-reportLinks
Tech Report of the Apertus LLM Suite
☆127Updated 4 months ago
Alternatives and similar repositories for apertus-tech-report
Users that are interested in apertus-tech-report are comparing it to the libraries listed below
Sorting:
- ☆151Updated 4 months ago
- ☆252Updated last week
- Alice in Wonderland code base for experiments and raw experiments data☆131Updated 4 months ago
- Tracking instruction-tuned LLM openness. Paper: Liesenfeld, Andreas, Alianda Lopez, and Mark Dingemanse. 2023. “Opening up ChatGPT: Track…☆119Updated 10 months ago
- A toolkit for describing model features and intervening on those features to steer behavior.☆226Updated last month
- Massive Multimodal Open RAG & Extraction A scalable multimodal pipeline for processing, indexing, and querying multimodal documents Eve…☆183Updated this week
- Pivotal Token Search☆142Updated last month
- open source interpretability platform 🧠☆632Updated this week
- Open source interpretability artefacts for R1.☆167Updated 8 months ago
- Code for collecting, processing, and preparing datasets for the Common Pile☆247Updated 4 months ago
- Evaluating LLMs with fewer examples☆169Updated last year
- Train, tune, and infer Bamba model☆138Updated 7 months ago
- code for training & evaluating Contextual Document Embedding models☆202Updated 8 months ago
- A framework for standardizing evaluations of large foundation models, beyond single-score reporting and rankings.☆173Updated 2 weeks ago
- ☆218Updated 2 months ago
- Pretraining data reconstruction scripts for Apertus☆112Updated 2 months ago
- Curated collection of community environments☆204Updated last week
- Simple & Scalable Pretraining for Neural Architecture Research☆306Updated last month
- lossily compress representation vectors using product quantization☆59Updated 2 months ago
- Tooling for exact and MinHash deduplication of large-scale text datasets☆52Updated this week
- [ICML 2024] Binoculars: Zero-Shot Detection of LLM-Generated Text☆336Updated last year
- DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation☆785Updated 6 months ago
- Functional Benchmarks and the Reasoning Gap☆89Updated last year
- ☆90Updated last month
- Python library to use Pleias-RAG models☆67Updated 8 months ago
- SWE-Bench Pro: Can AI Agents Solve Long-Horizon Software Engineering Tasks?☆246Updated last week
- chrome extension for renaming tabs showing paper-pdfs from common providers☆97Updated last year
- The Granite Guardian models are designed to detect risks in prompts and responses.☆127Updated 3 months ago
- Open-source release accompanying Gao et al. 2025☆490Updated last month
- The public specifications for the C2PA☆161Updated this week