Broker Load 导入报错 【io.grpc.StatusRuntimeException: DEADLINE_EXCEEDED】

Viewed 12
  • Doris版本:doris-2.1.5

报错如下:

         JobId: 49579717
         Label: label_20250812_0900_29896
         State: CANCELLED
      Progress: 24.14% (4475/18536)
          Type: BROKER
       EtlInfo: NULL
      TaskInfo: cluster:hdfs_cluster; timeout(s):14400; max_filter_ratio:0.0; priority:NORMAL
      ErrorMsg: type:LOAD_RUN_FAIL; msg:send fragments failed. io.grpc.StatusRuntimeException: DEADLINE_EXCEEDED: deadline exceeded after 34.999926650s. Name resolution delay 0.000000000 seconds. [closed=[], committed=[buffered_nanos=2325753, remote_addr=10.****.209/10.****.209:8060]], host: 10.****.209
    CreateTime: 2025-08-12 09:55:17
  EtlStartTime: 2025-08-12 09:58:37
 EtlFinishTime: 2025-08-12 09:58:37
 LoadStartTime: 2025-08-12 09:58:37
LoadFinishTime: 2025-08-12 10:01:33
           URL: NULL
    JobDetails: {"Unfinished backends":{"2ca43fdb72e4c92-962130f98851ce51":[10237,10043,10238,10240,10041,10241,10239,10242,10133,10364,10301,10302,10362,10300,10263,10244,10243]},"ScannedRows":2854091497,"TaskNumber":1,"LoadBytes":937517390345,"All backends":{"2ca43fdb72e4c92-962130f98851ce51":[10363,27244286,10231,10233,10232,10227,10229,10226,10228,10235,10234,10037,10236,10237,10129,10366,10137,10153,10144,10145,10147,10146,10148,10356,10152,10121,10122,10313,10150,10151,10124,10317,10337,10123,10318,10149,10126,10125,10128,10127,10120,10316,10140,10141,10142,10139,10138,10358,10357,10143,10117,10315,10116,10119,10118,10114,10115,10076,10314,10113,10136,10057,10308,10056,10309,10310,10311,10312,10359,10051,10052,10049,10303,10050,10054,10304,10305,10053,10306,10055,10307,10368,10367,10134,10360,10047,10361,10046,10048,10043,10238,10040,10039,10045,10044,10038,10240,10041,10241,10239,10042,10242,10135,10365,10130,10133,10364,10301,10302,10362,10300,10263,10244,10243]},"FileNumber":8769,"FileSize":1792700575556}

到对应的be节点上查看日志,并有报错,但不理解原因

I20250812 10:01:33.395941 465993 plan_fragment_executor.cpp:553] PlanFragmentExecutor::cancel 2ca43fdb72e4c92-962130f98851ce51|2ca43fdb72e4c92-962130f98851cf79 reason 3 error msg
W20250812 10:01:33.397295 465903 runtime_state.h:209] Task is cancelled, instance: 2ca43fdb72e4c92-962130f98851ce51|2ca43fdb72e4c92-962130f98851cf7a, st = [CANCELLED]
I20250812 10:01:33.394969 465879 plan_fragment_executor.cpp:553] PlanFragmentExecutor::cancel 2ca43fdb72e4c92-962130f98851ce51|2ca43fdb72e4c92-962130f98851cf77 reason 3 error msg
W20250812 10:01:33.398072 465879 runtime_state.h:209] Task is cancelled, instance: 2ca43fdb72e4c92-962130f98851ce51|2ca43fdb72e4c92-962130f98851cf77, st = [CANCELLED]
W20250812 10:01:33.398507 465879 status.h:412] meet error status: [ABORTED]

        0#  doris::PlanFragmentExecutor::cancel(doris::PPlanFragmentCancelReason const&, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&) at /home/zcp/repo_center/doris_release/doris/be/src/common/status.h:0
        1#  doris::FragmentMgr::cancel_instance(doris::TUniqueId const&, doris::PPlanFragmentCancelReason const&, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&) at /home/zcp/repo_center/doris_release/doris/be/src/runtime/fragment_mgr.cpp:0
        2#  std::_Function_handler<void (), doris::PInternalServiceImpl::cancel_plan_fragment(google::protobuf::RpcController*, doris::PCancelPlanFragmentRequest const*, doris::PCancelPlanFragmentResult*, google::protobuf::Closure*)::$_0>::_M_invoke(std::_Any_data const&) at /var/local/ldb-toolchain/bin/../lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/bits/basic_string.h:187
        3#  doris::WorkThreadPool<false>::work_thread(int) at /var/local/ldb-toolchain/bin/../lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/bits/atomic_base.h:646
        4#  execute_native_thread_routine at /data/gcc-11.1.0/build/x86_64-pc-linux-gnu/libstdc++-v3/include/bits/unique_ptr.h:85
        5#  ?
        6#  ?
W20250812 10:01:33.395648 465950 runtime_state.h:209] Task is cancelled, instance: 2ca43fdb72e4c92-962130f98851ce51|2ca43fdb72e4c92-962130f98851cf7b, st = [CANCELLED]
W20250812 10:01:33.397485 465993 runtime_state.h:209] Task is cancelled, instance: 2ca43fdb72e4c92-962130f98851ce51|2ca43fdb72e4c92-962130f98851cf79, st = [CANCELLED]
W20250812 10:01:33.399207 465950 status.h:412] meet error status: [ABORTED]

        0#  doris::PlanFragmentExecutor::cancel(doris::PPlanFragmentCancelReason const&, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&) at /home/zcp/repo_center/doris_release/doris/be/src/common/status.h:0
        1#  doris::FragmentMgr::cancel_instance(doris::TUniqueId const&, doris::PPlanFragmentCancelReason const&, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&) at /home/zcp/repo_center/doris_release/doris/be/src/runtime/fragment_mgr.cpp:0
        2#  std::_Function_handler<void (), doris::PInternalServiceImpl::cancel_plan_fragment(google::protobuf::RpcController*, doris::PCancelPlanFragmentRequest const*, doris::PCancelPlanFragmentResult*, google::protobuf::Closure*)::$_0>::_M_invoke(std::_Any_data const&) at /var/local/ldb-toolchain/bin/../lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/bits/basic_string.h:187
        3#  doris::WorkThreadPool<false>::work_thread(int) at /var/local/ldb-toolchain/bin/../lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/bits/atomic_base.h:646
        4#  execute_native_thread_routine at /data/gcc-11.1.0/build/x86_64-pc-linux-gnu/libstdc++-v3/include/bits/unique_ptr.h:85
        5#  ?
        6#  ?
W20250812 10:01:33.399487 465993 status.h:412] meet error status: [ABORTED]

        0#  doris::PlanFragmentExecutor::cancel(doris::PPlanFragmentCancelReason const&, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&) at /home/zcp/repo_center/doris_release/doris/be/src/common/status.h:0
        1#  doris::FragmentMgr::cancel_instance(doris::TUniqueId const&, doris::PPlanFragmentCancelReason const&, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&) at /home/zcp/repo_center/doris_release/doris/be/src/runtime/fragment_mgr.cpp:0
        2#  std::_Function_handler<void (), doris::PInternalServiceImpl::cancel_plan_fragment(google::protobuf::RpcController*, doris::PCancelPlanFragmentRequest const*, doris::PCancelPlanFragmentResult*, google::protobuf::Closure*)::$_0>::_M_invoke(std::_Any_data const&) at /var/local/ldb-toolchain/bin/../lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/bits/basic_string.h:187
        3#  doris::WorkThreadPool<false>::work_thread(int) at /var/local/ldb-toolchain/bin/../lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/bits/atomic_base.h:646
        4#  execute_native_thread_routine at /data/gcc-11.1.0/build/x86_64-pc-linux-gnu/libstdc++-v3/include/bits/unique_ptr.h:85
        5#  ?
        6#  ?
W20250812 10:01:33.400547 465903 status.h:412] meet error status: [ABORTED]

        0#  doris::PlanFragmentExecutor::cancel(doris::PPlanFragmentCancelReason const&, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&) at /home/zcp/repo_center/doris_release/doris/be/src/common/status.h:0
        1#  doris::FragmentMgr::cancel_instance(doris::TUniqueId const&, doris::PPlanFragmentCancelReason const&, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&) at /home/zcp/repo_center/doris_release/doris/be/src/runtime/fragment_mgr.cpp:0


W20250812 10:01:33.433724 464577 vtablet_writer.cpp:587] cancel node channel VNodeChannel[27745951-10358], load_id=2ca43fdb72e4c92-962130f98851ce51, txn_id=34931676, node=10.******.40:8060, error message: [CANCELLED]Cancelled
W20250812 10:01:33.434144 464577 vtablet_writer.cpp:587] cancel node channel VNodeChannel[27745951-10231], load_id=2ca43fdb72e4c92-962130f98851ce51, txn_id=34931676, node=10.******.167:8060, error message: [CANCELLED]Cancelled
W20250812 10:01:33.438495 466080 status.h:431] meet error status: [INTERNAL_ERROR]PStatus: (10.******.220)[INTERNAL_ERROR]fail to add batch in load channel. unknown load_id=02ca43fdb72e4c92-962130f98851ce51

        0#  doris::Status doris::Status::create<true>(doris::PStatus const&) at /var/local/ldb-toolchain/bin/../lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/bits/basic_string.h:187
        1#  doris::vectorized::VNodeChannel::_add_block_success_callback(doris::PTabletWriterAddBlockResult const&, doris::vectorized::WriteBlockCallbackContext const&) at /home/zcp/repo_center/doris_release/doris/be/src/common/status.h:481
        2#  std::_Function_handler<void (doris::PTabletWriterAddBlockResult const&, doris::vectorized::WriteBlockCallbackContext const&), doris::vectorized::VNodeChannel::init(doris::RuntimeState*)::$_1>::_M_invoke(std::_Any_data const&, doris::PTabletWriterAddBlockResult const&, doris::vectorized::WriteBlockCallbackContext const&) at /var/local/ldb-toolchain/bin/../lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/ext/atomicity.h:98
        3#  doris::vectorized::WriteBlockCallback<doris::PTabletWriterAddBlockResult>::call() at /var/local/ldb-toolchain/bin/../lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/bits/std_function.h:0
        4#  doris::AutoReleaseClosure<doris::PTabletWriterAddBlockRequest, doris::vectorized::WriteBlockCallback<doris::PTabletWriterAddBlockResult> >::Run() at /var/local/ldb-toolchain/bin/../lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/ext/atomicity.h:98
        5#  brpc::Controller::EndRPC(brpc::Controller::CompletionInfo const&)
        6#  brpc::policy::ProcessRpcResponse(brpc::InputMessageBase*)
        7#  brpc::ProcessInputMessage(void*)
        8#  bthread::TaskGroup::task_runner(long)
        9#  bthread_make_fcontext
W20250812 10:01:33.439035 466080 vtablet_writer.cpp:587] cancel node channel VNodeChannel[27745951-10317], load_id=2ca43fdb72e4c92-962130f98851ce51, txn_id=34931676, node=10.******.220:8060, error message: VNodeChannel[27745951-10317], load_id=2ca43fdb72e4c92-962130f98851ce51, txn_id=34931676, node=10.******.220:8060, add batch req success but status isn't ok, err: [INTERNAL_ERROR]PStatus: (10.******.220)[INTERNAL_ERROR]fail to add batch in load channel. unknown load_id=02ca43fdb72e4c92-962130f98851ce51

        0#  doris::Status doris::Status::create<true>(doris::PStatus const&) at /var/local/ldb-toolchain/bin/../lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/bits/basic_string.h:187
        1#  doris::vectorized::VNodeChannel::_add_block_success_callback(doris::PTabletWriterAddBlockResult const&, doris::vectorized::WriteBlockCallbackContext const&) at /home/zcp/repo_center/doris_release/doris/be/src/common/status.h:481
        2#  std::_Function_handler<void (doris::PTabletWriterAddBlockResult const&, doris::vectorized::WriteBlockCallbackContext const&), doris::vectorized::VNodeChannel::init(doris::RuntimeState*)::$_1>::_M_invoke(std::_Any_data const&, doris::PTabletWriterAddBlockResult const&, doris::vectorized::WriteBlockCallbackContext const&) at /var/local/ldb-toolchain/bin/../lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/ext/atomicity.h:98
        3#  doris::vectorized::WriteBlockCallback<doris::PTabletWriterAddBlockResult>::call() at /var/local/ldb-toolchain/bin/../lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/bits/std_function.h:0
        4#  doris::AutoReleaseClosure<doris::PTabletWriterAddBlockRequest, doris::vectorized::WriteBlockCallback<doris::PTabletWriterAddBlockResult> >::Run() at /var/local/ldb-toolchain/bin/../lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/ext/atomicity.h:98
        5#  brpc::Controller::EndRPC(brpc::Controller::CompletionInfo const&)
        6#  brpc::policy::ProcessRpcResponse(brpc::InputMessageBase*)
        7#  brpc::ProcessInputMessage(void*)
        8#  bthread::TaskGroup::task_runner(long)
        9#  bthread_make_fcontext
W20250812 10:01:33.439509 466080 status.h:431] meet error status: [INTERNAL_ERROR]PStatus: (10.******.220)[INTERNAL_ERROR]fail to add batch in load channel. unknown load_id=02ca43fdb72e4c92-962130f98851ce51

        0#  doris::Status doris::Status::create<true>(doris::PStatus const&) at /var/local/ldb-toolchain/bin/../lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/bits/basic_string.h:187
        1#  void doris::AutoReleaseClosure<doris::PTabletWriterAddBlockRequest, doris::vectorized::WriteBlockCallback<doris::PTabletWriterAddBlockResult> >::_process_status<doris::PTabletWriterAddBlockResult>(doris::PTabletWriterAddBlockResult*) at /home/zcp/repo_center/doris_release/doris/be/src/common/status.h:481
        2#  doris::AutoReleaseClosure<doris::PTabletWriterAddBlockRequest, doris::vectorized::WriteBlockCallback<doris::PTabletWriterAddBlockResult> >::Run() at /home/zcp/repo_center/doris_release/doris/be/src/util/ref_count_closure.h:91
        3#  brpc::Controller::EndRPC(brpc::Controller::CompletionInfo const&)
        4#  brpc::policy::ProcessRpcResponse(brpc::InputMessageBase*)
        5#  brpc::ProcessInputMessage(void*)
        6#  bthread::TaskGroup::task_runner(long)
        7#  bthread_make_fcontext
W20250812 10:01:33.439718 466080 ref_count_closure.h:119] RPC meet error status: [INTERNAL_ERROR]PStatus: (10.******.220)[INTERNAL_ERROR]fail to add batch in load channel. unknown load_id=02ca43fdb72e4c92-962130f98851ce51
        0#  doris::Status doris::Status::create<true>(doris::PStatus const&) at /var/local/ldb-toolchain/bin/../lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/bits/basic_string.h:187
        1#  void doris::AutoReleaseClosure<doris::PTabletWriterAddBlockRequest, doris::vectorized::WriteBlockCallback<doris::PTabletWriterAddBlockResult> >::_process_status<doris::PTabletWriterAddBlockResult>(doris::PTabletWriterAddBlockResult*) at /home/zcp/repo_center/doris_release/doris/be/src/common/status.h:481
        2#  doris::AutoReleaseClosure<doris::PTabletWriterAddBlockRequest, doris::vectorized::WriteBlockCallback<doris::PTabletWriterAddBlockResult> >::Run() at /home/zcp/repo_center/doris_release/doris/be/src/util/ref_count_closure.h:91
        3#  brpc::Controller::EndRPC(brpc::Controller::CompletionInfo const&)
        4#  brpc::policy::ProcessRpcResponse(brpc::InputMessageBase*)
        5#  brpc::ProcessInputMessage(void*)
        6#  bthread::TaskGroup::task_runner(long)
        7#  bthread_make_fcontext
W20250812 10:01:33.439718 466080 ref_count_closure.h:119] RPC meet error status: [INTERNAL_ERROR]PStatus: (10.******.220)[INTERNAL_ERROR]fail to add batch in load channel. unknown load_id=02ca43fdb72e4c92-962130f98851ce51

        0#  doris::Status doris::Status::create<true>(doris::PStatus const&) at /var/local/ldb-toolchain/bin/../lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/bits/basic_string.h:187
        1#  void doris::AutoReleaseClosure<doris::PTabletWriterAddBlockRequest, doris::vectorized::WriteBlockCallback<doris::PTabletWriterAddBlockResult> >::_process_status<doris::PTabletWriterAddBlockResult>(doris::PTabletWriterAddBlockResult*) at /home/zcp/repo_center/doris_release/doris/be/src/common/status.h:481
        2#  doris::AutoReleaseClosure<doris::PTabletWriterAddBlockRequest, doris::vectorized::WriteBlockCallback<doris::PTabletWriterAddBlockResult> >::Run() at /home/zcp/repo_center/doris_release/doris/be/src/util/ref_count_closure.h:91
        3#  brpc::Controller::EndRPC(brpc::Controller::CompletionInfo const&)
        4#  brpc::policy::ProcessRpcResponse(brpc::InputMessageBase*)
        5#  brpc::ProcessInputMessage(void*)
        6#  bthread::TaskGroup::task_runner(long)
        7#  bthread_make_fcontext
W20250812 10:01:33.440117 464577 vtablet_writer.cpp:587] cancel node channel VNodeChannel[27745951-10228], load_id=2ca43fdb72e4c92-962130f98851ce51, txn_id=34931676, node=10.******.162:8060, error message: [CANCELLED]Cancelled
2 Answers

这个问题是 RPC 太多了导致,比看下有没有什么 add black list 的类似的报错。
有的话可以先禁用掉黑名单。
disable_backend_black_list

在fe上发现 ignore backend black list for backend: 10312, disabled: true 异常信息,backend:10312 对应任务失败的节点

2025-08-12 00:01:49,115 WARN (backend-rpc-callback-2|527630) [Coordinator$BackendExecState$1.onSuccess():3233] Failed to cancel query c214103e04f444d0-b117c93111bf4e77 instance initiated=true done=false backend: 10244,fragment instance id=c214103e04f444d0-b117c93111bf5065, reason: without status
2025-08-12 00:01:49,115 WARN (loading-load-task-scheduler_pool-2|515270) [SimpleScheduler.addToBlacklist():175] ignore backend black list for backend: 10312, disabled: true
2025-08-12 00:01:49,115 WARN (thrift-server-pool-411011|2599714) [Coordinator.updateFragmentExecStatus():2707] Instance c214103e04f444d0-b117c93111bf4e77 of query c214103e04f444d0-b117c93111bf4fc7 report failed status, error msg: Status [errorCode=CANCELLED, errorMsg=(10.******.201)[CANCELLED]Cancelled]
2025-08-12 00:01:49,116 WARN (backend-rpc-callback-31|527678) [Coordinator$BackendExecState$1.onSuccess():3233] Failed to cancel query c214103e04f444d0-b117c93111bf4e77 instance initiated=true done=false backend: 10244,fragment instance id=c214103e04f444d0-b117c93111bf5067, reason: without status
2025-08-12 00:01:49,115 WARN (loading-load-task-scheduler_pool-2|515270) [LoadTask.exec():96] LOAD_JOB=49564035, error_msg={Unexpected failed to execute load task}
org.apache.doris.rpc.RpcException: send fragments failed. io.grpc.StatusRuntimeException: DEADLINE_EXCEEDED: deadline exceeded after 34.999936785s. Name resolution delay 0.000000000 seconds. [closed=[], committed=[remote_addr=10.******.209/10.******.209:8060]], host: 10.******.209
        at org.apache.doris.qe.Coordinator.waitRpc(Coordinator.java:1139) ~[doris-fe.jar:1.2-SNAPSHOT]
        at org.apache.doris.qe.Coordinator.sendFragment(Coordinator.java:892) ~[doris-fe.jar:1.2-SNAPSHOT]
        at org.apache.doris.qe.Coordinator.execInternal(Coordinator.java:752) ~[doris-fe.jar:1.2-SNAPSHOT]
        at org.apache.doris.qe.Coordinator.exec(Coordinator.java:672) ~[doris-fe.jar:1.2-SNAPSHOT]
        at org.apache.doris.load.loadv2.LoadLoadingTask.actualExecute(LoadLoadingTask.java:196) ~[doris-fe.jar:1.2-SNAPSHOT]
        at org.apache.doris.load.loadv2.LoadLoadingTask.executeOnce(LoadLoadingTask.java:179) ~[doris-fe.jar:1.2-SNAPSHOT]
        at org.apache.doris.load.loadv2.LoadLoadingTask.executeTask(LoadLoadingTask.java:139) ~[doris-fe.jar:1.2-SNAPSHOT]
        at org.apache.doris.load.loadv2.LoadTask.exec(LoadTask.java:86) ~[doris-fe.jar:1.2-SNAPSHOT]
        at org.apache.doris.task.MasterTask.run(MasterTask.java:31) ~[doris-fe.jar:1.2-SNAPSHOT]
        at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) ~[?:1.8.0_91]
        at java.util.concurrent.FutureTask.run(FutureTask.java:266) ~[?:1.8.0_91]
        at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) ~[?:1.8.0_91]
        at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) ~[?:1.8.0_91]
        at java.lang.Thread.run(Thread.java:745) ~[?:1.8.0_91]
Caused by: java.util.concurrent.ExecutionException: io.grpc.StatusRuntimeException: DEADLINE_EXCEEDED: deadline exceeded after 34.999936785s. Name resolution delay 0.000000000 seconds. [closed=[], committed=[remote_addr=10.******.209/10.******.209:8060]]
        at com.google.common.util.concurrent.AbstractFuture.getDoneValue(AbstractFuture.java:592) ~[guava-32.1.2-jre.jar:?]
        at com.google.common.util.concurrent.AbstractFuture.get(AbstractFuture.java:443) ~[guava-32.1.2-jre.jar:?]
        at org.apache.doris.qe.Coordinator.waitRpc(Coordinator.java:1099) ~[doris-fe.jar:1.2-SNAPSHOT]
        ... 13 more
Caused by: io.grpc.StatusRuntimeException: DEADLINE_EXCEEDED: deadline exceeded after 34.999936785s. Name resolution delay 0.000000000 seconds. [closed=[], committed=[remote_addr=10.******.209/10.******.209:8060]]
        at io.grpc.Status.asRuntimeException(Status.java:537) ~[grpc-api-1.60.1.jar:1.60.1]
        at io.grpc.stub.ClientCalls$UnaryStreamToFuture.onClose(ClientCalls.java:538) ~[grpc-stub-1.60.1.jar:1.60.1]
        at io.grpc.internal.ClientCallImpl.closeObserver(ClientCallImpl.java:574) ~[grpc-core-1.60.1.jar:1.60.1]
        at io.grpc.internal.ClientCallImpl.access$300(ClientCallImpl.java:72) ~[grpc-core-1.60.1.jar:1.60.1]
        at io.grpc.internal.ClientCallImpl$ClientStreamListenerImpl$1StreamClosed.runInternal(ClientCallImpl.java:742) ~[grpc-core-1.60.1.jar:1.60.1]
        at io.grpc.internal.ClientCallImpl$ClientStreamListenerImpl$1StreamClosed.runInContext(ClientCallImpl.java:723) ~[grpc-core-1.60.1.jar:1.60.1]
        at io.grpc.internal.ContextRunnable.run(ContextRunnable.java:37) ~[grpc-core-1.60.1.jar:1.60.1]
        at io.grpc.internal.SerializingExecutor.run(SerializingExecutor.java:133) ~[grpc-core-1.60.1.jar:1.60.1]
        ... 3 more
2025-08-12 00:01:49,116 WARN (loading-load-task-scheduler_pool-2|515270) [LoadJob.unprotectedExecuteCancel():554] LOAD_JOB=49564035, transaction_id={34914016}, error_msg={Failed to execute load with error: send fragments failed. io.grpc.StatusRuntimeException: DEADLINE_EXCEEDED: deadline exceeded after 34.999936785s. Name resolution delay 0.000000000 seconds. [closed=[], committed=[remote_addr=10.******.209/10.******.209:8060]], host: 10.******.209}
2025-08-12 00:01:49,116 WARN (backend-rpc-callback-30|527677) [Coordinator$BackendExecState$1.onSuccess():3233] Failed to cancel query c214103e04f444d0-b117c93111bf4e77 instance initiated=true done=false backend: 10243,fragment instance id=c214103e04f444d0-b117c93111bf5068, reason: without status
2025-08-12 00:01:49,116 INFO (loading-load-task-scheduler_pool-2|515270) [TxnStateCallbackFactory.removeCallback():44] remove callback of txn state : 49564035. current callback size: 1
2025-08-12 00:01:49,116 WARN (backend-rpc-callback-28|527675) [Coordinator$BackendExecState$1.onSuccess():3233] Failed to cancel query c214103e04f444d0-b117c93111bf4e77 instance initiated=true done=false backend: 10243,fragment instance id=c214103e04f444d0-b117c93111bf5069, reason: without status
2025-08-12 00:01:49,116 WARN (backend-rpc-callback-18|527660) [Coordinator$BackendExecState$1.onSuccess():3233] Failed to cancel query c214103e04f444d0-b117c93111bf4e77 instance initiated=true done=false backend: 10243,fragment instance id=c214103e04f444d0-b117c93111bf506a, reason: without status
2025-08-12 00:01:49,116 WARN (backend-rpc-callback-27|527674) [Coordinator$BackendExecState$1.onSuccess():3233] Failed to cancel query c214103e04f444d0-b117c93111bf4e77 instance initiated=true done=false backend: 10243,fragment instance id=c214103e04f444d0-b117c93111bf506b, reason: without status
2025-08-12 00:01:49,118 INFO (loading-load-task-scheduler_pool-2|515270) [DatabaseTransactionMgr.abortTransaction():1621] abort transaction: TransactionState. transaction id: 34914016, label: label_20250811_2115_50233, db id: 322891, table id list: 27745950, callback id: 49564035, coordinator: FE: 10.229.163.136, transaction status: ABORTED, error replicas num: 0, replica ids: , prepare time: 1754927719415, commit time: -1, finish time: 1754928109116, reason: send fragments failed. io.grpc.StatusRuntimeException: DEADLINE_EXCEEDED: deadline exceeded after 34.999936785s. Name resolution delay 0.000000000 seconds. [closed=[], committed=[remote_addr=10.229.163.209/10.229.163.209:8060]], host: 10.229.163.209 successfully
2025-08-12 00:01:49,268 INFO (report-thread|240) [ReportHandler.storagePolicyReport():317] backend[10138] reports policies [], report resources: []
2025-08-12 00:01:49,268 INFO (report-thread|240) [ReportHandler.tabletReport():426] backend[10138] reports 15317 tablet(s). report version: 17419472858933
2025-08-12 00:01:49,268 INFO (thrift-server-pool-523|907) [ReportHandler.handleReport():206] receive report from be 10138. type: TABLET, current queue size: 1
2025-08-12 00:01:49,282 INFO (report-thread|240) [TabletInvertedIndex.tabletReport():376] finished to do tablet diff with backend[10138]. fe tablet num: 15317, backend tablet num: 15317. sync: 0. metaDel: 0. foundInMeta: 15317. migration: 0. backend partition num: 3850, backend need update: 0. found invalid transactions 0. found republish transactions 0. tabletToUpdate: 0. need recovery: 0. cost: 3 ms