msaroufim / mynotes
β17Updated 2 weeks ago
Related projects β
Alternatives and complementary repositories for mynotes
- Experiments on GPT-3's ability to fit numerical models in-context.β14Updated 2 years ago
- NLP Examples using the π€ librariesβ42Updated 3 years ago
- Helper scripts I use to run many experiments in the morning to check at nightβ19Updated 3 years ago
- A sample pattern for running CI tests on Modalβ13Updated last month
- Embedding Recycling for Language modelsβ38Updated last year
- Basic guidance on how to contribute to Papers with Codeβ20Updated 2 years ago
- Named Entity Recognition with an decoder-only (autoregressive) LLM using HuggingFaceβ28Updated this week
- Minimum Description Length probing for neural network representationsβ16Updated last week
- Codes, scripts, and notebooks on various aspects of transformer models.β27Updated last year
- Streamlit demo app to demonstrate the features of transformers interpret with multiple models.β25Updated 3 years ago
- See https://github.com/cuda-mode/triton-index/ instead!β11Updated 6 months ago
- PyTorch implementation for MRLβ18Updated 8 months ago
- Training and Inference Notebooks for the RedPajama (OpenLlama) modelsβ18Updated last year
- β19Updated 4 years ago
- β18Updated 6 months ago
- Repository for my master thesis on automated string handlingβ16Updated 3 years ago
- β14Updated last year
- A framework for implementing equivariant DLβ10Updated 3 years ago
- An unofficial Python client library for Lambda Lab's Cloud Computing Platformβ13Updated last year
- This repository hosts the code to port NumPy model weights of BiT-ResNets to TensorFlow SavedModel format.β14Updated 2 years ago
- β27Updated last year
- ML/DL Math and Method notesβ57Updated 11 months ago
- Engineering the state of RNN language models (Mamba, RWKV, etc.)β32Updated 5 months ago
- BERT Probe: A python package for probing attention based robustness to character and word based adversarial evaluation. Also, with recipeβ¦β18Updated 2 years ago
- Code associated to papers on superposition (in ML interpretability)β24Updated 2 years ago
- Collection of python scripts to demonstrate asynchronous programming in pythonβ11Updated 2 years ago
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning Pβ¦β34Updated last year
- A Chainlit App Used to Showcase: Async, Caching, Additional Chainlit Methods, and more!β11Updated last month
- Using short models to classify long textsβ20Updated last year