Balaram Neupane
Home
Blogs
Notes
← Home
Notes
NLP
Positional Encoding: one of the most clever engineering decisions in the Attention Is All You Need paper
June 3, 2026
Building intuition for attention by understanding the limitations of RNNs
May 6, 2026
Problems with pure RNNs and GRUs, and how LSTM tries to solve them
May 6, 2026
The predecessors of Transformers: RNNs, how they work, and why attention was the missing block
May 5, 2026
Word representation using the common bag of words model
May 5, 2026
Stepping through the training of a neural network while building a sentiment classification model
May 5, 2026
Understanding the gradient flow in a computation graph by building a simple feedforward network
May 5, 2026