On this page
tf.raw_ops.QuantizedMatMul
Perform a quantized matrix multiplication of a
by the matrix b
.
tf.raw_ops.QuantizedMatMul(
a,
b,
min_a,
max_a,
min_b,
max_b,
Toutput=tf.dtypes.qint32,
transpose_a=False,
transpose_b=False,
Tactivation=tf.dtypes.quint8,
name=None
)
The inputs must be two-dimensional matrices and the inner dimension of a
(after being transposed if transpose_a
is non-zero) must match the outer dimension of b
(after being transposed if transposed_b
is non-zero).
Args | |
---|---|
a |
A Tensor . Must be one of the following types: qint8 , quint8 , qint32 , qint16 , quint16 . Must be a two-dimensional tensor. |
b |
A Tensor . Must be one of the following types: qint8 , quint8 , qint32 , qint16 , quint16 . Must be a two-dimensional tensor. |
min_a |
A Tensor of type float32 . The float value that the lowest quantized a value represents. |
max_a |
A Tensor of type float32 . The float value that the highest quantized a value represents. |
min_b |
A Tensor of type float32 . The float value that the lowest quantized b value represents. |
max_b |
A Tensor of type float32 . The float value that the highest quantized b value represents. |
Toutput |
An optional tf.DType from: tf.qint8, tf.quint8, tf.qint32, tf.qint16, tf.quint16 . Defaults to tf.qint32 . |
transpose_a |
An optional bool . Defaults to False . If true, a is transposed before multiplication. |
transpose_b |
An optional bool . Defaults to False . If true, b is transposed before multiplication. |
Tactivation |
An optional tf.DType from: tf.qint8, tf.quint8, tf.qint32, tf.qint16, tf.quint16 . Defaults to tf.quint8 . The type of output produced by activation function following this operation. |
name |
A name for the operation (optional). |
Returns | |
---|---|
A tuple of Tensor objects (out, min_out, max_out). |
|
out |
A Tensor of type Toutput . |
min_out |
A Tensor of type float32 . |
max_out |
A Tensor of type float32 . |
© 2022 The TensorFlow Authors. All rights reserved.
Licensed under the Creative Commons Attribution License 4.0.
Code samples licensed under the Apache 2.0 License.
https://www.tensorflow.org/versions/r2.9/api_docs/python/tf/raw_ops/QuantizedMatMul