在集群内,有一个be节点报错下面内容后就down了,随后被配置的supervisor拉起。
F20250506 16:32:42.203564 117278 pending_rowset_helper.cpp:43] Check failed: !_pending_rowset_set || (_rowset_id == other._rowset_id && _pending_rowset_set == other._pending_rowset_set) 02000000005a51995b4a5130c76683032293946c0b3ffebf 02000000005a51995b4a5130c76683032293946c0b3ffebf 0x7f48f1de7ba0 0
查看了dmesg ,没有oom
这是因为什么问题导致的宕机呢,该怎么去定位。
W20250506 16:32:36.882881 117225 task_worker_pool.cpp:1582] failed to publish version|signature=88045048|transaction_id=88045048|error_tablets_num=2|error=[E-914]could not find related rowset for table
t 195426523, txn id 88045048
0# doris::EnginePublishVersionTask::execute() at /home/zcp/repo_center/doris_release/doris/be/src/common/status.h:380
1# doris::PublishVersionWorkerPool::publish_version_callback(doris::TAgentTaskRequest const&) at /home/zcp/repo_center/doris_release/doris/be/src/common/status.h:380
2# std::_Function_handler<void (), doris::TaskWorkerPool::submit_task(doris::TAgentTaskRequest const&)::$_0::operator()<doris::TAgentTaskRequest const&>(doris::TAgentTaskRequest const&) const::{lambda()#1}>::_M_invoke(std::_Any_data const&) at /home/zcp/repo_center/doris_release/doris/be/src/agent/task_worker_pool.cpp:445
3# doris::ThreadPool::dispatch_thread() at /home/zcp/repo_center/doris_release/doris/be/src/util/threadpool.cpp:0
4# doris::Thread::supervise_thread(void*) at /var/local/ldb-toolchain/bin/../usr/include/pthread.h:562
5# start_thread
6# clone
W20250506 16:32:42.203296 118285 status.h:415] meet error status: [CORRUPTION]segment num mismatch in tablet 196712547, expected: 2, actual: 1, load_id: f27310b7b58f4116-9c7d060162aef708
0# doris::TabletStream::pre_close() at /home/zcp/repo_center/doris_release/doris/be/src/common/status.h:494
1# doris::IndexStream::close(std::vector<doris::PTabletID, std::allocator<doris::PTabletID> > const&, std::vector<long, std::allocator<long> >*, std::vector<std::pair<long, doris::Status>, std::allocator<std::pair<long, doris::Status> > >*) at /home/zcp/repo_center/doris_release/doris/be/src/runtime/load_stream.cpp:362
2# doris::LoadStream::close(long, std::vector<doris::PTabletID, std::allocator<doris::PTabletID> > const&, std::vector<long, std::allocator<long> >*, std::vector<std::pair<long, doris::Status>, std::allocator<std::pair<long, doris::Status> > >*) at /home/zcp/repo_center/doris_release/doris/be/src/runtime/load_stream.cpp:449
3# doris::LoadStream::_dispatch(unsigned long, doris::PStreamHeader const&, butil::IOBuf*) at /home/zcp/repo_center/doris_release/doris/be/src/common/status.h:352
4# doris::LoadStream::on_received_messages(unsigned long, butil::IOBuf* const*, unsigned long) at /home/zcp/repo_center/doris_release/doris/be/src/runtime/load_stream.cpp:586
5# brpc::Stream::Consume(void*, bthread::TaskIterator<butil::IOBuf*>&)
6# bthread::ExecutionQueueBase::_execute(bthread::TaskNode*, bool, int*)
7# bthread::ExecutionQueueBase::_execute_tasks(void*)
8# bthread::TaskGroup::task_runner(long)
9# bthread_make_fcontext
F20250506 16:32:42.203564 117278 pending_rowset_helper.cpp:43] Check failed: !_pending_rowset_set || (_rowset_id == other._rowset_id && _pending_rowset_set == other._pending_rowset_set) 02000000005a51995b4a5130c76683032293946c0b3ffebf 02000000005a51995b4a5130c76683032293946c0b3ffebf 0x7f48f1de7ba0 0
W20250506 16:33:02.577243 249274 timezone_utils.cpp:98] Meet illegal tzdata file: iso3166.tab. skipped
W20250506 16:33:02.736289 249274 timezone_utils.cpp:98] Meet illegal tzdata file: leapseconds. skipped
W20250506 16:33:02.737365 249274 timezone_utils.cpp:98] Meet illegal tzdata file: tzdata.zi. skipped
W20250506 16:33:02.737718 249274 timezone_utils.cpp:98] Meet illegal tzdata file: zone.tab. skipped
W20250506 16:33:02.738059 249274 timezone_utils.cpp:98] Meet illegal tzdata file: zone1970.tab. skipped
E20250506 16:33:04.024732 249274 variable.cpp:179] Already exposed `doris_cache_data_page_cache' whose value is `0'
E20250506 16:33:04.024762 249274 variable.cpp:179] Already exposed `doris_cache_data_page_cache_persecond' whose value is `0'
E20250506 16:33:04.024931 249274 variable.cpp:179] Already exposed `doris_cache_index_page_cache' whose value is `0'
E20250506 16:33:04.024942 249274 variable.cpp:179] Already exposed `doris_cache_index_page_cache_persecond' whose value is `0'
E20250506 16:33:04.025092 249274 variable.cpp:179] Already exposed `doris_cache_pkindex_page_cache' whose value is `0'
E20250506 16:33:04.025104 249274 variable.cpp:179] Already exposed `doris_cache_pkindex_page_cache_persecond' whose value is `0'
E20250506 16:33:04.025195 249274 variable.cpp:179] Already exposed `doris_cache_point_query_row_cache' whose value is `0'