启动工作程序后数据流作业未完成/失败

问题描述

我创建了一个数据流管道,它从存储桶中读取文件,并对数据进行简单的转换(例如:修剪空格)。

当我执行数据流作业时,作业已启动,日志显示工作人员在一个区域中启动,但之后没有任何反应。作业永远不会完成或失败。我不得不手动停止作业。

Dataflow 作业已由具有 dataflow.worker 角色、dataflow.developer 角色和 dataflow.objectAdmin 角色的服务帐号执行。

请有人建议为什么数据流作业没有完成或者为什么工作启动后作业没有执行。

2021-02-09 11:01:29.753 GMTWorker configuration: n1-standard-1 in europe-west2-b.
Warning
2021-02-09 11:01:30.015 GMTThe network sdas-global-dev doesn't have rules that open TCP ports 12345-12346 for internal connection with other VMs. Only rules with a target tag 'dataflow' or empty target tags set apply. If you don't specify such a rule,any pipeline with more than one worker that shuffles data will hang. Causes: No firewall rules associated with your network.
Info
2021-02-09 11:01:31.067 GMTExecuting operation Read files/Read+ManageData/ParDo(ManageData)
Info
2021-02-09 11:01:31.115 GMTStarting 1 workers in europe-west2-b...
Warning
2021-02-09 11:07:33.341 GMTThe network sdas-global-dev doesn't have rules that open TCP ports 12345-12346 for internal connection with other VMs. Only rules with a target tag 'dataflow' or empty target tags set apply. If you don't specify such a rule,any pipeline with more than one worker that shuffles data will hang. Causes: No firewall rules associated with your network.

解决方法

我发现了问题。我在 VPC 位于不同区域的区域中运行该作业。因此,工人无法旋转起来。把地域和VPC一样,然后一切顺利。