doris3.1版本,资源使用不高的情况下,频繁报brpc错误

Viewed 4

doris3.1版本,9台服务器,3个fe节点,9个be节点。报错日志见附件。频繁报其中一台服务器brpc通信有问题,无法连接。
W20250919 11:46:03.726321 3704128 load_stream_stub.cpp:359] LoadStreamStub load_id=2802af132031451e-a5ac49934858f58b, src_id=1750424923086, dst_id=1757417361464, stream_id=301937 is cancelled because of [CANCELLED]PStatus: (10.138.175.134)[CANCELLED]failed to send brpc when exchange, error=Host is down, error_text=[E110]Fail to read from Socket{id=124865 fd=15453 addr=10.138.175.142:8060:49466} (0x0x7f9d30dfcf80): Connection timed out [R1][E112]Not connected to 10.138.175.142:8060 yet, server_id=124865 [R2][E112]Not connected to 10.138.175.142:8060 yet, server_id=124865 [R3][E112]Not connected to 10.138.175.142:8060 yet, server_id=124865 [R4][E112]Not connected to 10.138.175.142:8060 yet, server_id=124865 [R5][E112]Not connected to 10.138.175.142:8060 yet, server_id=124865 [R6][E112]Not connected to 10.138.175.142:8060 yet, server_id=124865 [R7][E112]Not connected to 10.138.175.142:8060 yet, server_id=124865 [R8][E112]Not connected to 10.138.175.142:8060 yet, server_id=124865 [R9][E112]Not connected to 10.138.175.142:8060 yet, server_id=124865 [R10][E112]Not connected to 10.138.175.142:8060 yet, server_id=124865, client: 10.138.175.134, latency = 7439181
W20250919 11:46:10.005395 2515944 fragment_mgr.cpp:888] Query a28663d76db4bdd-aa88983b6f0cd458 does not exists, failed to cancel it
W20250919 11:46:24.988931 2515548 fragment_mgr.cpp:888] Query 2bf90acf090247d1-9ebe0ba2727d6d88 does not exists, failed to cancel it
W20250919 11:46:25.188158 2516560 status.h:427] meet error status: [INTERNAL_ERROR]StreamWrite failed, err=221个小时内资源使用情况.png

0 Answers