各位大佬,在doris 2.1版本遇到一个问题:现象为be端集体失联几秒钟,排查网络没有发现异常,还应怎么排查或调整
fe日志:
2025-07-01 05:30:01,739 WARN (heartbeat-mgr-pool-0|189) [HeartbeatMgr$BackendHeartbeatHandler.call():300] backend heartbeat got exception
org.apache.thrift.transport.TTransportException: Socket is closed by peer.
at org.apache.thrift.transport.TIOStreamTransport.read(TIOStreamTransport.java:184) ~[libthrift-0.16.0.jar:0.16.0]
at org.apache.thrift.transport.TTransport.readAll(TTransport.java:109) ~[libthrift-0.16.0.jar:0.16.0]
at org.apache.thrift.protocol.TBinaryProtocol.readAll(TBinaryProtocol.java:464) ~[libthrift-0.16.0.jar:0.16.0]
at org.apache.thrift.protocol.TBinaryProtocol.readI32(TBinaryProtocol.java:362) ~[libthrift-0.16.0.jar:0.16.0]
at org.apache.thrift.protocol.TBinaryProtocol.readMessageBegin(TBinaryProtocol.java:245) ~[libthrift-0.16.0.jar:0.16.0]
at org.apache.thrift.TServiceClient.receiveBase(TServiceClient.java:77) ~[libthrift-0.16.0.jar:0.16.0]
at org.apache.doris.thrift.HeartbeatService$Client.recvHeartbeat(HeartbeatService.java:61) ~[fe-common-1.2-SNAPSHOT.jar:1.2-SNAPSHOT]
at org.apache.doris.thrift.HeartbeatService$Client.heartbeat(HeartbeatService.java:48) ~[fe-common-1.2-SNAPSHOT.jar:1.2-SNAPSHOT]
at org.apache.doris.system.HeartbeatMgr$BackendHeartbeatHandler.call(HeartbeatMgr.java:243) ~[doris-fe.jar:1.2-SNAPSHOT]
at org.apache.doris.system.HeartbeatMgr$BackendHeartbeatHandler.call(HeartbeatMgr.java:217) ~[doris-fe.jar:1.2-SNAPSHOT]
at java.util.concurrent.FutureTask.run(FutureTask.java:266) ~[?:1.8.0_432]
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) ~[?:1.8.0_432]
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) ~[?:1.8.0_432]
at java.lang.Thread.run(Thread.java:750) ~[?:1.8.0_432]
be日志:
W20250701 05:30:06.080868 81700 stream_load.cpp:677] plan streaming load failed. errmsg=[ANALYSIS_ERROR]TStatus: errCode = 2, detailMessage = No available backendsid=2c404420d5bce939-fca0e7c113a21b89, job_id=-1, txn_id=70624606, label=flink_connector_20250701_053006_12a38462b21445e6aeb71c71943cf256, elapse(s)=0
W20250701 05:30:06.099490 81710 stream_load.cpp:677] plan streaming load failed. errmsg=[ANALYSIS_ERROR]TStatus: errCode = 2, detailMessage = No available backendsid=2540c38ca9a5b76a-26ffdb0b441bf490, job_id=-1, txn_id=70624608, label=flink_connector_20250701_053006_8fc887ec0e384fc48ea695e155286b6f, elapse(s)=0
W20250701 05:30:06.100279 81701 stream_load.cpp:677] plan streaming load failed. errmsg=[ANALYSIS_ERROR]TStatus: errCode = 2, detailMessage = No available backendsid=c24a2de532722eaa-b4051da83dffb0ae, job_id=-1, txn_id=70624609, label=flink_connector_20250701_053006_06e31bcfc6ab4daa9702e682d9fc15f2, elapse(s)=0
W20250701 05:30:06.121579 81692 stream_load.cpp:677] plan streaming load failed. errmsg=[ANALYSIS_ERROR]TStatus: errCode = 2, detailMessage = No available backendsid=cf4fb9cfdfd68e9f-3eed2b094b8d39b0, job_id=-1, txn_id=70624613, label=flink_connector_20250701_053006_7ea9db57cc054b009954752d7ff70a91, elapse(s)=0