tf.raw_ops.UnicodeEncode
Stay organized with collections Save and categorize content based on your preferences.
Encode a tensor of ints into unicode strings.
tf.raw_ops.UnicodeEncode(
input_values,
input_splits,
output_encoding,
errors='replace',
replacement_char=65533,
name=None
)
Returns a vector of strings, where output[i]
is constructed by encoding the Unicode codepoints in input_values[input_splits[i]:input_splits[i+1]]
using output_encoding
.
Example:
input_values = [72, 101, 108, 108, 111, 87, 111, 114, 108, 100]
input_splits = [0, 5, 10]
output_encoding = 'UTF-8'
output = ['Hello', 'World']
Args |
---|
input_values | A Tensor of type int32 . A 1D tensor containing the unicode codepoints that should be encoded. |
input_splits | A Tensor . Must be one of the following types: int32 , int64 . A 1D tensor specifying how the unicode codepoints should be split into strings. In particular, output[i] is constructed by encoding the codepoints in the slice input_values[input_splits[i]:input_splits[i+1]] . |
output_encoding | A string from: "UTF-8", "UTF-16-BE", "UTF-32-BE" . Unicode encoding of the output strings. Valid encodings are: "UTF-8", "UTF-16-BE", and "UTF-32-BE" . |
errors | An optional string from: "ignore", "replace", "strict" . Defaults to "replace" . Error handling policy when there is invalid formatting found in the input. The value of 'strict' will cause the operation to produce a InvalidArgument error on any invalid input formatting. A value of 'replace' (the default) will cause the operation to replace any invalid formatting in the input with the replacement_char codepoint. A value of 'ignore' will cause the operation to skip any invalid formatting in the input and produce no corresponding output character. |
replacement_char | An optional int . Defaults to 65533 . The replacement character codepoint to be used in place of any invalid formatting in the input when errors='replace' . Any valid unicode codepoint may be used. The default value is the default unicode replacement character is 0xFFFD (U+65533). |
name | A name for the operation (optional). |
Returns |
---|
A Tensor of type string . |
Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates. Some content is licensed under the numpy license.
Last updated 2024-04-26 UTC.
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Missing the information I need","missingTheInformationINeed","thumb-down"],["Too complicated / too many steps","tooComplicatedTooManySteps","thumb-down"],["Out of date","outOfDate","thumb-down"],["Samples / code issue","samplesCodeIssue","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2024-04-26 UTC."],[],[]]