swiss-ai / apertus-tech-reportLinks
Tech Report of the Apertus LLM Suite
☆127Updated 3 months ago
Alternatives and similar repositories for apertus-tech-report
Users that are interested in apertus-tech-report are comparing it to the libraries listed below
Sorting:
- Massive Multimodal Open RAG & Extraction A scalable multimodal pipeline for processing, indexing, and querying multimodal documents Eve…☆174Updated 2 weeks ago
- Code for collecting, processing, and preparing datasets for the Common Pile☆248Updated 3 months ago
- ☆175Updated 4 months ago
- ☆144Updated 3 months ago
- ☆233Updated 3 weeks ago
- Alice in Wonderland code base for experiments and raw experiments data☆131Updated 3 months ago
- code for training & evaluating Contextual Document Embedding models☆201Updated 7 months ago
- Pretraining data reconstruction scripts for Apertus☆110Updated last month
- A framework for standardizing evaluations of large foundation models, beyond single-score reporting and rankings.☆173Updated 3 weeks ago
- Your buddy in the (L)LM space.☆64Updated last year
- We track and analyze the activity and performance of autonomous code agents in the wild☆47Updated 2 weeks ago
- A toolkit for describing model features and intervening on those features to steer behavior.☆223Updated last week
- The Granite Guardian models are designed to detect risks in prompts and responses.☆123Updated 2 months ago
- lossily compress representation vectors using product quantization☆59Updated last month
- A preprint version of our recent research on the capability of frontier AI systems to do self-replication☆58Updated last year
- ☆212Updated last month
- Code for the paper "Fishing for Magikarp"☆176Updated 7 months ago
- Open-source release accompanying Gao et al. 2025☆450Updated last week
- Public repository containing METR's DVC pipeline for eval data analysis☆164Updated 8 months ago
- SWE-Bench Pro: Can AI Agents Solve Long-Horizon Software Engineering Tasks?☆233Updated last month
- Pivotal Token Search☆135Updated this week
- open source interpretability platform 🧠☆562Updated this week
- ☆481Updated 5 months ago
- Train, tune, and infer Bamba model☆137Updated 6 months ago
- DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation☆778Updated 5 months ago
- Open source interpretability artefacts for R1.☆165Updated 8 months ago
- Tracking instruction-tuned LLM openness. Paper: Liesenfeld, Andreas, Alianda Lopez, and Mark Dingemanse. 2023. “Opening up ChatGPT: Track…☆119Updated 9 months ago
- An AI benchmark for creative, human-like problem solving using Sudoku variants☆146Updated last week
- ☆87Updated 2 weeks ago
- Simple UI for debugging correlations of text embeddings☆305Updated 6 months ago