tf.raw_ops.QuantizedMatMulWithBiasAndDequantize
tf.raw_ops.QuantizedMatMulWithBiasAndDequantize(
a,
b,
bias,
min_a,
max_a,
min_b,
max_b,
min_freezed_output,
max_freezed_output,
Toutput,
transpose_a=False,
transpose_b=False,
input_quant_mode='MIN_FIRST',
name=None
)
Args |
a |
A Tensor . Must be one of the following types: qint8 , quint8 , qint32 , qint16 , quint16 . |
b |
A Tensor . Must be one of the following types: qint8 , quint8 , qint32 , qint16 , quint16 . |
bias |
A Tensor . Must be one of the following types: float32 , qint32 . |
min_a |
A Tensor of type float32 . |
max_a |
A Tensor of type float32 . |
min_b |
A Tensor of type float32 . |
max_b |
A Tensor of type float32 . |
min_freezed_output |
A Tensor of type float32 . |
max_freezed_output |
A Tensor of type float32 . |
Toutput |
A tf.DType from: tf.float32 . |
transpose_a |
An optional bool . Defaults to False . |
transpose_b |
An optional bool . Defaults to False . |
input_quant_mode |
An optional string from: "MIN_FIRST", "SCALED" . Defaults to "MIN_FIRST" . |
name |
A name for the operation (optional). |
Returns |
A Tensor of type Toutput . |