Annotated transformer

This Harvard blog post is really nice. It filled in the detail implementation for the “Attention is all you need” paper.

Leave a Reply

Your email address will not be published. Required fields are marked *