Cadence 中的匹配服务问题

问题描述

两天前，我们开始提出节奏设置的一些问题。我们注意到的第一件事是 Open 工作流在完成后并没有从列表中消失。例如，此工作流在列表中显示为“打开”：

但是当你点击它时，你会看到它实际上已经完成了：

与此同时，我们注意到有几个工作流需要很长时间才能完成，其中几个会停留在“计划”状态，再也不会从那里走得更远。检查日志后，我们看到的唯一错误是：

{"level":"error","ts":"2021-03-06T19:12:04.865Z","msg":"Persistent store operation failure","service":"cadence-matching","component":"matching-engine","wf-task-list-name":"cadence-sys-history-scanner-tasklist-0","wf-task-list-type":1,"store-operation":"create-task","error":"InternalServiceError{Message: CreateTasks operation failed. Error : Request on table cadence.tasks with ttl of 630720000 seconds exceeds maximum supported expiration date of 2038-01-19T03:14:06+00:00. In order to avoid this use a lower TTL,change the expiration date overflow policy or upgrade to a version where this limitation is fixed. See CASSANDRA-14092 for more details.}","number":6300094,"next-number":6300094,"logging-call-at":"taskWriter.go:176","stacktrace":"github.com/uber/cadence/common/log/loggerimpl.(*loggerImpl).Error\n\t/cadence/common/log/loggerimpl/logger.go:134\ngithub.com/uber/cadence/service/matching.(*taskWriter).taskWriterLoop\n\t/cadence/service/matching/taskWriter.go:176"}

有人知道为什么会这样吗？

解决方法

第一个是因为默认情况下启用了可见性采样（以保护默认核心数据库）。您可以通过 configure cron 到 false 禁用它。

但是当您这样做时，最好将可见性和默认存储区分开到不同的数据库集群中，这样可见性不会降低默认（核心数据模型）数据库。

在https://github.com/uber/cadence/issues/3884中查看更多

第二个是 0.16.0 中修复的错误升级服务器应该可以解决。

见https://github.com/uber/cadence/pull/3627 和https://docs.datastax.com/en/dse-trblshoot/doc/troubleshooting/recoveringTtlYear2038Problem.html

cadence-workflow uber-cadence

Cadence 中的匹配服务问题

问题描述

解决方法

相关问答