Go to the source code of this file.
Classes | |
| class | working_gpt.Head |
| one head of self-attention More... | |
| class | working_gpt.MultiHeadAttention |
| multiple heads of self-attention in parallel More... | |
| class | working_gpt.FeedFoward |
| a simple linear layer followed by a non-linearity More... | |
| class | working_gpt.Block |
| Transformer block: communication followed by computation. More... | |
| class | working_gpt.GPTLanguageModel |
Namespaces | |
| namespace | working_gpt |
Functions | |
| working_gpt.get_batch (split) | |
| working_gpt.estimate_loss () | |