Doris v2.1.6 一台BE 节点异常重启

Viewed 15

问题:一台be 节点重启
时间:2026年1月3日 16:21:35(崩溃)至 16:35:39(恢复),
现象:分配到该节点重启,出现SIGSEGV段错误
影响:分配到该节点的查询失败,集群稳定性受影响

详细日志(版本是2.1.6 ,当时从2.1.3 升级上来,没换目录)

Call to AttachCurrentThread failed with error: -1
getJNIEnv: getGlobalJNIEnv failed
Call to AttachCurrentThread failed with error: -1
getJNIEnv: getGlobalJNIEnv failed
Call to AttachCurrentThread failed with error: -1
getJNIEnv: getGlobalJNIEnv failed
Call to AttachCurrentThread failed with error: -1
getJNIEnv: getGlobalJNIEnv failed
*** Query id: 0-0 ***
*** is nereids: 0 ***
*** tablet id: 0 ***
*** Aborted at 1767428495 (unix time) try "date -d @1767428495" if you are using GNU date ***
*** Current BE git commitID: 653e315ba5 ***
*** SIGSEGV address not mapped to object (@0x7e59af7e3000) received by PID 3629 (TID 708 OR 0x7e59af7e2700) from PID 18446744072358866944; stack trace: ***
 0# doris::signal::(anonymous namespace)::FailureSignalHandler(int, siginfo_t*, void*) at /home/zcp/repo_center/doris_release/doris/be/src/common/signal_handler.h:421
 1# os::Linux::chained_handler(int, siginfo*, void*) in /opt/programs/java/jre/lib/amd64/server/libjvm.so
 2# JVM_handle_linux_signal in /opt/programs/java/jre/lib/amd64/server/libjvm.so
 3# signalHandler(int, siginfo*, void*) in /opt/programs/java/jre/lib/amd64/server/libjvm.so
 4# 0x00007FD95B377400 in /lib64/libc.so.6
 5# toStringEveryLineImpl(std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >, StackTraceRefTriple const&, std::function<void (std::basic_string_view<cha
r, std::char_traits<char> >)>) at /home/zcp/repo_center/doris_release/doris/be/src/common/stack_trace.cpp:376
 6# toStringCached(std::array<void*, 45ul> const&, unsigned long, unsigned long, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&) at /home/zcp
/repo_center/doris_release/doris/be/src/common/stack_trace.cpp:456
 7# StackTrace::toString(int, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&) const at /home/zcp/repo_center/doris_release/doris/be/src/commo
n/stack_trace.cpp:472
 8# doris::get_stack_trace_by_libunwind(int, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&) at /home/zcp/repo_center/doris_release/doris/be/
src/util/stack_util.cpp:88
 9# doris::get_stack_trace(int, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >) at /home/zcp/repo_center/doris_release/doris/be/src/util/stack_util.
cpp:54
10# doris::Status doris::Status::Error<6, true, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >, char const*>(std::basic_string_view<char, std::char_
traits<char> >, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >&&, char const*&&) at /home/zcp/repo_center/doris_release/doris/be/src/common/status.h
:412
11# doris::FragmentMgr::coordinator_callback(doris::ReportStatusRequest const&) in /opt/programs/doris-2.1.3/apache-doris-2.1.3-bin-x64/be/lib/doris_be
12# std::_Function_handler<void (), doris::FragmentMgr::trigger_pipeline_context_report(doris::ReportStatusRequest, std::shared_ptr<doris::pipeline::PipelineFragmentContext>&&)::$_0
>::_M_invoke(std::_Any_data const&) at /var/local/ldb-toolchain/bin/../lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/bits/std_function.h:291
13# doris::ThreadPool::dispatch_thread() in /opt/programs/doris-2.1.3/apache-doris-2.1.3-bin-x64/be/lib/doris_be
14# doris::Thread::supervise_thread(void*) at /home/zcp/repo_center/doris_release/doris/be/src/util/thread.cpp:499
15# start_thread in /lib64/libpthread.so.0
16# __clone in /lib64/libc.so.6
 
StdoutLogger 2026-01-03 16:34:01,593 Start time: Sat Jan  3 16:34:01 CST 2026
INFO: java_cmd /opt/programs/java/bin/java
INFO: jdk_version 8
SLF4J: Class path contains multiple SLF4J bindings.
SLF4J: Found binding in [jar:file:/opt/programs/doris-2.1.3/apache-doris-2.1.3-bin-x64/be/lib/java_extensions/preload-extensions/preload-extensions-jar-with-dependencies.jar!/org/sl
f4j/impl/StaticLoggerBinder.class]
SLF4J: Found binding in [jar:file:/opt/programs/doris-2.1.3/apache-doris-2.1.3-bin-x64/be/lib/java_extensions/java-udf/java-udf-jar-with-dependencies.jar!/org/slf4j/impl/StaticLogge
rBinder.class]
SLF4J: Found binding in [jar:file:/opt/programs/doris-2.1.3/apache-doris-2.1.3-bin-x64/be/lib/hadoop_hdfs/common/lib/slf4j-reload4j-1.7.36.jar!/org/slf4j/impl/StaticLoggerBinder.cla
ss]
SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an explanation.
SLF4J: Actual binding is of type [org.slf4j.impl.Reload4jLoggerFactory]
StdoutLogger 2026-01-03 16:35:39,344 Start time: Sat Jan  3 16:35:39 CST 2026
INFO: java_cmd /opt/programs/java/bin/java
INFO: jdk_version 8
SLF4J: Class path contains multiple SLF4J bindings.
SLF4J: Found binding in [jar:file:/opt/programs/doris-2.1.3/apache-doris-2.1.3-bin-x64/be/lib/java_extensions/preload-extensions/preload-extensions-jar-with-dependencies.jar!/org/sl
f4j/impl/StaticLoggerBinder.class]
SLF4J: Found binding in [jar:file:/opt/programs/doris-2.1.3/apache-doris-2.1.3-bin-x64/be/lib/java_extensions/java-udf/java-udf-jar-with-dependencies.jar!/org/slf4j/impl/StaticLogge
rBinder.class]
SLF4J: Found binding in [jar:file:/opt/programs/doris-2.1.3/apache-doris-2.1.3-bin-x64/be/lib/hadoop_hdfs/common/lib/slf4j-reload4j-1.7.36.jar!/org/slf4j/impl/StaticLoggerBinder.cla
ss]
SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an explanation.
SLF4J: Actual binding is of type [org.slf4j.impl.Reload4jLoggerFactory]
0 Answers