繁体 English 中英

保存在 Tensorflow 模型中的自定义文本预处理

[英]Custom text pre-processing saved in Tensorflow model

原文 2022-07-15 10:51:42 4 1 python/ tensorflow/ keras/ nlp

如何编写可以保存为模型一部分的自定义文本预处理？

假设我想要两个功能：

使用某些功能自动更正字符串输入。 此操作后文字可能会发生变化
对字符串输入进行查询扩展，这样结果文本/标记可能包含很少的额外单词（将为其训练权重）。

像这样的东西：

飞往伦敦-> 飞往伦敦
飞往伦敦 -> 飞往伦敦loc_city
-> 这个标记需要提前在词汇表中，这可以做到

在步骤 1 和/或 2 之后，将结果提供给 TextVectorisation / Embedding 层？

有standardize回调，但我看不到使用现有 tf.string 操作的明显方法。

理想情况下，有一个回调函数/层接受字符串（或标记）并映射到另一个字符串（或字符串标记）。

1 个解决方案

您可以像这样获取字符串的第一个字符：

import tensorflow as tf

class StringLayer(tf.keras.layers.Layer):
  def __init__(self):
    super(StringLayer, self).__init__()

  def call(self, inputs):
    return tf.squeeze(tf.strings.bytes_split(inputs), axis=1).to_tensor()[:, 0]

s = tf.constant([['next_string'], ['some_string']])
layer = StringLayer()
print(layer(s))
# tf.Tensor([b'n' b's'], shape=(2,), dtype=string)

如何将某些预处理步骤包含到 Tensorflow 服务模型中

[英]How to INCLUDE certain pre-processing step into model for Tensorflow serving

从 tensorflow lite model 推断的正确预处理管道

[英]Correct pre-processing pipeline for inference from tensorflow lite model

使用NLTK进行文本预处理

[英]Text Pre-processing with NLTK

在 AWS SageMaker 中使用预处理和后处理创建和部署预训练 tensorflow model

[英]Creating and deploying pre-trained tensorflow model with pre-processing and post-processing in AWS SageMaker

将预处理步骤捆绑到 Tensorflow SavedModel

[英]Bundle pre-processing steps to Tensorflow SavedModel

将Tensorflow预处理添加到现有Keras模型（用于Tensorflow服务）

[英]Add Tensorflow pre-processing to existing Keras model (for use in Tensorflow Serving)

结合 scikit-learn model 使用 TensorFlow 预处理（tf.feature_column）

[英]Using TensorFlow pre-processing (tf.feature_column) in combination with scikit-learn model

复制 Python 工作流程以在 Javascript 环境中对 Tensorflow 的图像进行预处理

[英]Replicating a Python workflow for pre-processing of an image for Tensorflow in a Javascript environment

在 NLP 文本预处理中处理正则表达式时出错

[英]Error while processing the regular expression in NLP text pre-processing

将自定义 arguments 传递给棉花糖模式和预处理方法

[英]Passing custom arguments to marshmallow Schema and Pre-processing methods

暂无

暂无

声明:本站的技术帖子网页，遵循CC BY-SA 4.0协议，如果您需要转载，请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

相关问题 如何将某些预处理步骤包含到 Tensorflow 服务模型中从 tensorflow lite model 推断的正确预处理管道使用NLTK进行文本预处理在 AWS SageMaker 中使用预处理和后处理创建和部署预训练 tensorflow model 将预处理步骤捆绑到 Tensorflow SavedModel 将Tensorflow预处理添加到现有Keras模型（用于Tensorflow服务）结合 scikit-learn model 使用 TensorFlow 预处理（tf.feature_column）复制 Python 工作流程以在 Javascript 环境中对 Tensorflow 的图像进行预处理在 NLP 文本预处理中处理正则表达式时出错将自定义 arguments 传递给棉花糖模式和预处理方法

相关标签

粤ICP备18138465号 © 2020-2024 STACKOOM.COM