GPTLayer
GPT Layer.
Inherits from tensorflow.keras.layers.Layer.
Arguments
__init__ args:
depth(int): depth of the model (corresponds to embedding size).heads(int): number of attention heads.ff_nodes(int): size of Dense ReLU layer in Pointwise FF block.rate(float): dropout probability. Defaults to 0.1 as in original paper.
call args:
input_tensor(tf.tensor): input tensor (usually from PositionalEmbedding layer).
Returns
- (tf.tensor): Layer’s output.