View on GitHub

maximal

A TensorFlow-compatible Python library that provides models and layers to implement custom Transformer neural networks. Built on TensorFlow 2.

ImageEmbedding()

Takes a batch of images of shape [batch_size, height, width, channels], breaks them into image patches, and combines their linear projections with positional ambeddings. This Embedding layer is supposed to be the input of Vision Transformer architectures.

Inherits from tensorflow.keras.layers.Layer.

Arguments

__init__ arguments:

call arguments:

Returns

Used in tutorial