Exploring neural networks, machine learning algorithms, and deep learning architectures.
An introduction to Bayesian concepts in deep learning that links marginal likelihood, curvature, and effective dimensionality to the geometry of flat minima and generalization.
Read more →

Image by Google DeepMind on Unsplash
In this article, I explore a fun and probably useless application of language models: encrypting messages in natural language. This article includes a demo with an encrypted chat.
Read more →

In this article I present a Bayesian derivation of the Adam optimizer. I think is a more convincing explanation of it's success compared to more traditional derivations which use a bound on the regret.
Read more →
