Go to the source code of this file.
Classes | |
class | working_gpt.Head |
one head of self-attention More... | |
class | working_gpt.MultiHeadAttention |
multiple heads of self-attention in parallel More... | |
class | working_gpt.FeedFoward |
a simple linear layer followed by a non-linearity More... | |
class | working_gpt.Block |
Transformer block: communication followed by computation. More... | |
class | working_gpt.GPTLanguageModel |
Namespaces | |
namespace | working_gpt |
Functions | |
working_gpt.get_batch (split) | |
working_gpt.estimate_loss () | |