BE频繁挂掉, 2.1.6

Viewed 9

版本2.1.6
2个集群中, 不同的的BE 频繁挂掉, 查询query_id, 发现是不同的sql(应该和SQL没直接关系), sql每天执行超百万次,只有个别次会导致BE挂
论坛发现有相同问题的帖子
https://doris-forum.org.cn/questions/D16c2/ge-jie-dian-quan-bu-gua-le

*** is nereids: 0 ***
*** tablet id: 0 ***
*** Aborted at 1756101946 (unix time) try "date -d @1756101946" if you are using GNU date ***
*** Current BE git commitID: 653e315ba5 ***
*** SIGSEGV address not mapped to object (@0x2c8) received by PID 196378 (TID 198774 OR 0x7f40346f3700) from PID 712; stack trace: ***
 0# doris::signal::(anonymous namespace)::FailureSignalHandler(int, siginfo_t*, void*) at /home/zcp/repo_center/doris_release/doris/be/src/common/signal_handler.h:421
 1# os::Linux::chained_handler(int, siginfo*, void*) in /mnt/disk0/env/jdk1.8.0_231/jre/lib/amd64/server/libjvm.so
 2# JVM_handle_linux_signal in /mnt/disk0/env/jdk1.8.0_231/jre/lib/amd64/server/libjvm.so
 3# signalHandler(int, siginfo*, void*) in /mnt/disk0/env/jdk1.8.0_231/jre/lib/amd64/server/libjvm.so
 4# 0x00007F5368C4E280 in /lib64/libc.so.6
 5# doris::pipeline::PriorityTaskQueue::push(doris::pipeline::PipelineTask*) at /home/zcp/repo_center/doris_release/doris/be/src/pipeline/task_queue.cpp:115
 6# doris::pipeline::MultiCoreTaskQueue::push_back(doris::pipeline::PipelineTask*, int) at /home/zcp/repo_center/doris_release/doris/be/src/pipeline/task_queue.cpp:217
 7# doris::pipeline::MultiCoreTaskQueue::push_back(doris::pipeline::PipelineTask*) at /home/zcp/repo_center/doris_release/doris/be/src/pipeline/task_queue.cpp:209
 8# doris::pipeline::TaskScheduler::schedule_task(doris::pipeline::PipelineTask*) at /home/zcp/repo_center/doris_release/doris/be/src/pipeline/task_scheduler.cpp:224
 9# doris::pipeline::PipelineXFragmentContext::submit() at /home/zcp/repo_center/doris_release/doris/be/src/pipeline/pipeline_x/pipeline_x_fragment_context.cpp:1418
10# doris::FragmentMgr::exec_plan_fragment(doris::TPipelineFragmentParams const&, doris::QuerySource, std::function<void (doris::RuntimeState*, doris::Status*)> const&) in /d/p1/env/doris/be/lib/doris_be
11# doris::FragmentMgr::exec_plan_fragment(doris::TPipelineFragmentParams const&, doris::QuerySource) at /home/zcp/repo_center/doris_release/doris/be/src/runtime/fragment_mgr.cpp:685
12# doris::PInternalServiceImpl::_exec_plan_fragment_impl(std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&, doris::PFragmentRequestVersion, bool, std::function<void (doris::RuntimeState*, doris::Status*)> const&) in /d/p1/env/doris/be/lib/doris_be
13# doris::PInternalServiceImpl::_exec_plan_fragment_in_pthread(google::protobuf::RpcController*, doris::PExecPlanFragmentRequest const*, doris::PExecPlanFragmentResult*, google::protobuf::Closure*) at /home/zcp/repo_center/doris_release/doris/be/src/service/internal_service.cpp:328
14# doris::WorkThreadPool<false>::work_thread(int) at /home/zcp/repo_center/doris_release/doris/be/src/util/work_thread_pool.hpp:159
15# execute_native_thread_routine at ../../../../../libstdc++-v3/src/c++11/thread.cc:84
16# start_thread in /lib64/libpthread.so.0
17# __clone in /lib64/libc.so.6

1 Answers