doris-2.1.5 be启动报错,rocksdb sst文件丢失

Viewed 61

doris版本:2.1.5
报错信息:
I20250124 10:34:01.112049 36152 olap_meta.cpp:73] [Rocksdb] [db/db_impl_open.cc:1219] DB pointer 0x7ff44d2d1000
W20250124 10:34:01.746953 36153 status.h:412] meet error status: [E-3001]rocks db open failed, reason: Corruption: Can't access /000071.sst: IO error: while stat a file for size: /home/doris_data/meta/000071.sst: No such file or directory
Can't access /000069.sst: IO error: while stat a file for size: /home/doris_data/meta/000069.sst: No such file or directory
000071.sst与000069.sst 在目录/home/doris_data/meta/下的确不存在。
a6c911ce-c14c-489f-b153-99e19a508b77.png

  1. 问题一:rocksdb sst文件丢失的原因怎么排查?
  2. 问题二:删除meta信息后,doris 数据能自动恢复吗?
    强制删除/home/doris_data/meta/下的文件后重启be可以正常启动,但是需要通过设置命令ADMIN SET FRONTEND CONFIG ("recover_with_empty_tablet" = "true")后,数据才能正常访问,请问删除某个be meta后可以自动恢复吗?
3 Answers

如果是磁盘坏了,并且表是多副本的话,直接下掉这个磁盘即可。

dmesg -T 看看有文件系统或者IO 错误没。

dmesg -T命令对应的信息如下

[Mon Jan 20 11:29:47 2025] usb 1-5: new low-speed USB device number 120 using xhci_hcd
[Mon Jan 20 11:29:47 2025] usb 1-5: New USB device found, idVendor=17ef, idProduct=6050, bcdDevice= 0.09
[Mon Jan 20 11:29:47 2025] usb 1-5: New USB device strings: Mfr=0, Product=2, SerialNumber=0
[Mon Jan 20 11:29:47 2025] usb 1-5: Product: Lenovo Precision USB Mouse
[Mon Jan 20 11:29:47 2025] input: Lenovo Precision USB Mouse as /devices/pci0000:00/0000:00:14.0/usb1/1-5/1-5:1.0/input/input122
[Mon Jan 20 11:29:47 2025] hid-generic 0003:17EF:6050.0079: input,hidraw2: USB HID v1.11 Mouse [Lenovo Precision USB Mouse] on usb-0000:00:14.0-5/input0
[Mon Jan 20 11:30:22 2025] usb 1-5: USB disconnect, device number 120
[Mon Jan 20 11:30:23 2025] usb 1-6: USB disconnect, device number 119
[Mon Jan 20 11:43:03 2025] usb 1-5: new low-speed USB device number 121 using xhci_hcd
[Mon Jan 20 11:43:04 2025] usb 1-5: New USB device found, idVendor=17ef, idProduct=6050, bcdDevice= 0.09
[Mon Jan 20 11:43:04 2025] usb 1-5: New USB device strings: Mfr=0, Product=2, SerialNumber=0
[Mon Jan 20 11:43:04 2025] usb 1-5: Product: Lenovo Precision USB Mouse
[Mon Jan 20 11:43:04 2025] input: Lenovo Precision USB Mouse as /devices/pci0000:00/0000:00:14.0/usb1/1-5/1-5:1.0/input/input123
[Mon Jan 20 11:43:04 2025] hid-generic 0003:17EF:6050.007A: input,hidraw0: USB HID v1.11 Mouse [Lenovo Precision USB Mouse] on usb-0000:00:14.0-5/input0
[Mon Jan 20 11:43:11 2025] usb 1-6: new low-speed USB device number 122 using xhci_hcd
[Mon Jan 20 11:43:11 2025] usb 1-6: New USB device found, idVendor=413c, idProduct=2113, bcdDevice=33.08
[Mon Jan 20 11:43:11 2025] usb 1-6: New USB device strings: Mfr=0, Product=2, SerialNumber=0
[Mon Jan 20 11:43:11 2025] usb 1-6: Product: Dell KB216 Wired Keyboard
[Mon Jan 20 11:43:11 2025] input: Dell KB216 Wired Keyboard as /devices/pci0000:00/0000:00:14.0/usb1/1-6/1-6:1.0/input/input124
[Mon Jan 20 11:43:12 2025] hid-generic 0003:413C:2113.007B: input,hidraw1: USB HID v1.11 Keyboard [Dell KB216 Wired Keyboard] on usb-0000:00:14.0-6/input0
[Mon Jan 20 11:43:12 2025] input: Dell KB216 Wired Keyboard as /devices/pci0000:00/0000:00:14.0/usb1/1-6/1-6:1.1/input/input125
[Mon Jan 20 11:43:12 2025] hid-generic 0003:413C:2113.007C: input,hiddev0,hidraw2: USB HID v1.11 Device [Dell KB216 Wired Keyboard] on usb-0000:00:14.0-6/input1
[Mon Jan 20 12:03:37 2025] usb 1-5: USB disconnect, device number 121
[Mon Jan 20 12:03:43 2025] usb 1-6: USB disconnect, device number 122
[Sun Feb 2 08:50:57 2025] i40e 0000:1a:00.3 eno4: NIC Link is Down
[Wed Feb 5 12:35:27 2025] i40e 0000:1a:00.3 eno4: NIC Link is Up, 1000 Mbps Full Duplex, Flow Control: None
[Thu Feb 13 19:20:49 2025] xor: automatically using best checksumming function:
[Thu Feb 13 19:20:49 2025] avx : 7676.000 MB/sec