越来越多的“ DB :: NetException:在从套接字读取时,对等方重置连接”错误产生了很多噪音

问题描述

我正在使用认设置运行Click House版本'20 .6.4'。在浏览日志时,我发现这些日志很多。

ServerErrorHandler: Code: 210,e.displayText() = DB::NetException: Connection reset by peer,while reading from socket

u003cWarning\u003e ConnectionPoolWithFailover: Connection Failed at try №1,reason: Code: 209,e.displayText() = DB::NetException: Timeout: connect timed out: 172.16.*.*:9000 (172.16.*.*:9000) (version 20.6.3.28 (official build))","msg_id":"SERVER-1","namespace":"clickhouse.server","priority":6,"timestamp":"2020-09-21T00:01:23.623067Z","user_id":"","user_name":""}

我使用认设置的Go-Clickhouse客户端(未更改任何超时) 几乎每分钟插入一次数据(大约60-70k行) 尽管似乎没有任何影响,但是得到了很多 这些是我与超时相关的设置:

 name                                             value       type
 
 connect_timeout                                  10          SettingSeconds             
 connect_timeout_with_failover_ms                 50          SettingMilliseconds        
 connect_timeout_with_failover_secure_ms          100         SettingMilliseconds        
 receive_timeout                                  300         SettingSeconds             
 send_timeout                                     300         SettingSeconds             
 tcp_keep_alive_timeout                           0           SettingSeconds             
 idle_connection_timeout                          3600        SettingUInt64              
 distributed_directory_monitor_sleep_time_ms      100         SettingMilliseconds        
 distributed_directory_monitor_max_sleep_time_ms  30000       SettingMilliseconds        
 insert_in_memory_parts_timeout                   600000      SettingMilliseconds        
 replication_alter_columns_timeout                60          SettingUInt64              
 insert_quorum_timeout                            600000      SettingMilliseconds        
 use_client_time_zone                             0           SettingBool                
 insert_distributed_timeout                       0           SettingUInt64              
 distributed_ddl_task_timeout                     180         SettingInt64               
 stream_poll_timeout_ms                           500         SettingMilliseconds        
 http_connection_timeout                          1           SettingSeconds             
 http_send_timeout                                1800        SettingSeconds             
 http_receive_timeout                             1800        SettingSeconds             
 query_profiler_real_time_period_ns               1000000000  SettingUInt64              
 query_profiler_cpu_time_period_ns                1000000000  SettingUInt64              
 max_execution_time                               0           SettingSeconds             
 timeout_overflow_mode                            throw       SettingOverflowMode        
 timeout_before_checking_execution_speed          10          SettingSeconds             
 temporary_live_view_timeout                      5           SettingSeconds             
 lock_acquire_timeout                             120         SettingSeconds             
 mark_cache_min_lifetime                          0           SettingUInt64              
 date_time_input_format                           basic       SettingDateTimeInputFormat 

Is there anything i can change to minimize these errrors??

解决方法

这是两个不同的问题。

尝试№1时连接失败 connect_timeout_with_failover_ms 50

50 ms

cat /etc/clickhouse-server/conf.d/user_substitutes.xml

 <?xml version="1.0"?>
<yandex>
    <profiles>
        <default>
            <connect_timeout_with_failover_ms>1000</connect_timeout_with_failover_ms>
        <default>
        </profiles>
        </yandex>
,

从套接字读取消息时,对等方重置连接

该邮件中的IP地址是什么?是CH服务器还是客户端? 不过,这是一个真正的错误。这仅表示客户端已消失,服务器未获得所有预期数据。