Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Kernel oops on zfs create, or later when copying files #13005

Closed
flokli opened this issue Jan 24, 2022 · 8 comments
Closed

Kernel oops on zfs create, or later when copying files #13005

flokli opened this issue Jan 24, 2022 · 8 comments
Labels
Status: Stale No recent activity for issue Type: Defect Incorrect behavior (e.g. crash, hang)

Comments

@flokli
Copy link

flokli commented Jan 24, 2022

System information

Type Version/Name
Distribution Name NixOS
Distribution Version 22.05pre-git
Kernel Version 5.10.92
Architecture amd64
OpenZFS Version zfs-2.1.2-1, zfs-kmod-2.1.2-1

Describe the problem you're observing

Did a NixOS installation, which copies a lot of files around, and failed to get a sqlite database lock.

I poked in dmesg, and was greeted by the following:

[80068.510353] BUG: unable to handle page fault for address: ffffffffffffffff
[80068.510387] #PF: supervisor read access in kernel mode
[80068.510404] #PF: error_code(0x0000) - not-present page
[80068.510423] PGD 7dda15067 P4D 7dda15067 PUD 7dda17067 PMD 0 
[80068.510443] Oops: 0000 [#1] SMP NOPTI
[80068.510456] CPU: 25 PID: 7121 Comm: z_wr_int_1 Tainted: P           O      5.10.92 #1-NixOS
[80068.510480] Hardware name: ASUSTeK COMPUTER INC. KRPA-U16 Series/KRPA-U16 Series, BIOS 4102 11/17/2021
[80068.510524] RIP: 0010:buf_hash_insert+0x90/0x170 [zfs]
[80068.510540] Code: dc 27 00 48 8d 34 d8 48 8b 0e 48 85 c9 0f 84 dd 00 00 00 48 8b 55 00 48 89 c8 31 db eb 0c 48 8b 40 20 83 c3 01 48 85 c0 74 2a <48> 39 10 75 ef 48 8b 7d 08 48 39 78 08 75 e5 48 8b 7d 10 48 39 78
[80068.510590] RSP: 0018:ffffaff90167fce0 EFLAGS: 00010246
[80068.510606] RAX: ffffffffffffffff RBX: 0000000000000000 RCX: ffffffffffffffff
[80068.510627] RDX: 0000000000000008 RSI: ffffaff93a562430 RDI: ffffffffc0ae9de0
[80068.510648] RBP: ffffa0d265864500 R08: 9ae16a3b2f90408f R09: 9ae16a3b2f90404f
[80068.510668] R10: ffffa12143ee6f00 R11: 0000000000000001 R12: ffffaff90167fd08
[80068.510692] R13: 0000000000007920 R14: ffffa0d252d54140 R15: ffffa0d2493a1ea0
[80068.510714] FS:  0000000000000000(0000) GS:ffffa1214e640000(0000) knlGS:0000000000000000
[80068.510737] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[80068.510754] CR2: ffffffffffffffff CR3: 00000007dda10000 CR4: 0000000000350ee0
[80068.510774] Call Trace:
[80068.510800]  arc_write_done+0x120/0x420 [zfs]
[80068.510832]  zio_done+0x3fc/0x1150 [zfs]
[80068.510862]  zio_execute+0x83/0x120 [zfs]
[80068.510877]  taskq_thread+0x2df/0x520 [spl]
[80068.510894]  ? wake_up_q+0xa0/0xa0
[80068.510920]  ? zio_gang_tree_free+0x60/0x60 [zfs]
[80068.511486]  ? taskq_thread_spawn+0x50/0x50 [spl]
[80068.512027]  kthread+0x11b/0x140
[80068.512554]  ? __kthread_bind_mask+0x60/0x60
[80068.513081]  ret_from_fork+0x22/0x30
[80068.513602] Modules linked in: ext4 crc16 mbcache jbd2 dm_crypt cbc encrypted_keys trusted tpm rng_core af_packet amd64_edac_mod edac_mce_amd edac_core amd_energy kvm_amd kvm irqbypass crc32_pclmul cfg80211 ghash_clmulni_intel rfkill aesni_intel ipmi_ssif evdev 8021q libaes crypto_simd joydev ast mousedev input_leds drm_vram_helper cryptd led_class hid_generic mac_hid glue_helper igb drm_ttm_helper sp5100_tco ptp ttm pps_core watchdog drm_kms_helper cdc_ether dca usbnet usbhid i2c_piix4 i2c_algo_bit rapl hid mii fb_sys_fops syscopyarea sysfillrect sysimgblt nvme acpi_ipmi nvme_core k10temp ip6_tables ipmi_si ipmi_devintf xt_conntrack ipmi_msghandler nf_conntrack nf_defrag_ipv6 pinctrl_amd hed tiny_power_button button nf_defrag_ipv4 acpi_cpufreq xt_tcpudp ip6t_rpfilter ipt_rpfilter xt_pkttype nft_compat nft_counter nf_tables libcrc32c crc32c_generic nfnetlink sch_fq_codel atkbd libps2 serio zfs(PO) zunicode(PO) zzstd(O) zlua(O) zavl(PO) icp(PO) zcommon(PO) znvpair(PO) spl(O) tun tap macvlan

Describe how to reproduce the problem

Unsure. Didn't want to immediately reboot, in case you want to further investigate this.


This smells similar to #4608, but contrary to there, the machine is running with ECC ram:

[ 19.574362] EDAC amd64: Node 0: DRAM ECC enabled.

@flokli flokli added the Type: Defect Incorrect behavior (e.g. crash, hang) label Jan 24, 2022
@flokli
Copy link
Author

flokli commented Jan 24, 2022

In case it's useful, in userspace I was seeing some stuck processes, which I killed and started again, and it showed me failing to get a lock (probably after the BUG has happened):

[pid  9471] newfstatat(AT_FDCWD, "/mnt/nix/store", {st_dev=makedev(0, 0x2d), st_ino=1792, st_mode=S_IFDIR|S_ISVTX|0775, st_nlink=3, st_uid=0, st_gid=30000, st_blksize=131072, st_blocks=1, st_size=3, st_atime=1643040150 /* 2022-01-24T16:02:30.270377401+0000 */, st_atime_nsec=270377401, st_mtime=1643040150 /* 2022-01-24T16:02:30.270377401+0000 */, st_mtime_nsec=270377401, st_ctime=1643040150 /* 2022-01-24T16:02:30.270377401+0000 */, st_ctime_nsec=270377401}, 0) = 0
[pid  9471] newfstatat(AT_FDCWD, "/mnt/nix/store", {st_dev=makedev(0, 0x2d), st_ino=1792, st_mode=S_IFDIR|S_ISVTX|0775, st_nlink=3, st_uid=0, st_gid=30000, st_blksize=131072, st_blocks=1, st_size=3, st_atime=1643040150 /* 2022-01-24T16:02:30.270377401+0000 */, st_atime_nsec=270377401, st_mtime=1643040150 /* 2022-01-24T16:02:30.270377401+0000 */, st_mtime_nsec=270377401, st_ctime=1643040150 /* 2022-01-24T16:02:30.270377401+0000 */, st_ctime_nsec=270377401}, AT_SYMLINK_NOFOLLOW) = 0
[pid  9471] newfstatat(AT_FDCWD, "/mnt/nix", {st_dev=makedev(0, 0x2d), st_ino=1537, st_mode=S_IFDIR|0755, st_nlink=4, st_uid=0, st_gid=0, st_blksize=131072, st_blocks=1, st_size=4, st_atime=1643040150 /* 2022-01-24T16:02:30.243375879+0000 */, st_atime_nsec=243375879, st_mtime=1643040150 /* 2022-01-24T16:02:30.270377401+0000 */, st_mtime_nsec=270377401, st_ctime=1643040150 /* 2022-01-24T16:02:30.270377401+0000 */, st_ctime_nsec=270377401}, AT_SYMLINK_NOFOLLOW) = 0
[pid  9471] newfstatat(AT_FDCWD, "/mnt", {st_dev=makedev(0, 0x2d), st_ino=34, st_mode=S_IFDIR|0755, st_nlink=8, st_uid=0, st_gid=0, st_blksize=512, st_blocks=1, st_size=8, st_atime=1643040145 /* 2022-01-24T16:02:25.880130124+0000 */, st_atime_nsec=880130124, st_mtime=1643040226 /* 2022-01-24T16:03:46.933695006+0000 */, st_mtime_nsec=933695006, st_ctime=1643040226 /* 2022-01-24T16:03:46.933695006+0000 */, st_ctime_nsec=933695006}, AT_SYMLINK_NOFOLLOW) = 0
[pid  9471] newfstatat(AT_FDCWD, "/mnt/nix/var/nix/db/reserved", {st_dev=makedev(0, 0x2d), st_ino=1921, st_mode=S_IFREG|0600, st_nlink=1, st_uid=0, st_gid=0, st_blksize=131072, st_blocks=1, st_size=8388608, st_atime=1643040150 /* 2022-01-24T16:02:30.270377401+0000 */, st_atime_nsec=270377401, st_mtime=1643040150 /* 2022-01-24T16:02:30.270377401+0000 */, st_mtime_nsec=270377401, st_ctime=1643040150 /* 2022-01-24T16:02:30.270377401+0000 */, st_ctime_nsec=270377401}, 0) = 0
[pid  9471] openat(AT_FDCWD, "/mnt/nix/var/nix/db/big-lock", O_RDWR|O_CREAT|O_CLOEXEC, 0600) = 4
[pid  9471] flock(4, LOCK_SH|LOCK_NB)   = 0
[pid  9471] newfstatat(AT_FDCWD, "/mnt/nix/var/nix/db/schema", 0x7ffc4f8405d0, AT_SYMLINK_NOFOLLOW) = -1 ENOENT (No such file or directory)
[pid  9471] access("/mnt/nix/var/nix/db", R_OK|W_OK) = 0
[pid  9471] newfstatat(AT_FDCWD, "/mnt/nix/var/nix/db/db.sqlite", {st_dev=makedev(0, 0x2d), st_ino=1923, st_mode=S_IFREG|0644, st_nlink=1, st_uid=0, st_gid=0, st_blksize=131072, st_blocks=1, st_size=0, st_atime=1643040150 /* 2022-01-24T16:02:30.270377401+0000 */, st_atime_nsec=270377401, st_mtime=1643040150 /* 2022-01-24T16:02:30.270377401+0000 */, st_mtime_nsec=270377401, st_ctime=1643040150 /* 2022-01-24T16:02:30.270377401+0000 */, st_ctime_nsec=270377401}, AT_SYMLINK_NOFOLLOW) = 0
[pid  9471] getpid()                    = 9471
[pid  9471] getpid()                    = 9471
[pid  9471] openat(AT_FDCWD, "/mnt/nix/var/nix/db/db.sqlite", O_RDWR|O_CREAT|O_NOFOLLOW|O_CLOEXEC, 0644) = 5
[pid  9471] newfstatat(5, "", {st_dev=makedev(0, 0x2d), st_ino=1923, st_mode=S_IFREG|0644, st_nlink=1, st_uid=0, st_gid=0, st_blksize=131072, st_blocks=1, st_size=0, st_atime=1643040150 /* 2022-01-24T16:02:30.270377401+0000 */, st_atime_nsec=270377401, st_mtime=1643040150 /* 2022-01-24T16:02:30.270377401+0000 */, st_mtime_nsec=270377401, st_ctime=1643040150 /* 2022-01-24T16:02:30.270377401+0000 */, st_ctime_nsec=270377401}, AT_EMPTY_PATH) = 0
[pid  9471] newfstatat(5, "", {st_dev=makedev(0, 0x2d), st_ino=1923, st_mode=S_IFREG|0644, st_nlink=1, st_uid=0, st_gid=0, st_blksize=131072, st_blocks=1, st_size=0, st_atime=1643040150 /* 2022-01-24T16:02:30.270377401+0000 */, st_atime_nsec=270377401, st_mtime=1643040150 /* 2022-01-24T16:02:30.270377401+0000 */, st_mtime_nsec=270377401, st_ctime=1643040150 /* 2022-01-24T16:02:30.270377401+0000 */, st_ctime_nsec=270377401}, AT_EMPTY_PATH) = 0
[pid  9471] newfstatat(AT_FDCWD, "/mnt/nix/var/nix/db/db.sqlite", {st_dev=makedev(0, 0x2d), st_ino=1923, st_mode=S_IFREG|0644, st_nlink=1, st_uid=0, st_gid=0, st_blksize=131072, st_blocks=1, st_size=0, st_atime=1643040150 /* 2022-01-24T16:02:30.270377401+0000 */, st_atime_nsec=270377401, st_mtime=1643040150 /* 2022-01-24T16:02:30.270377401+0000 */, st_mtime_nsec=270377401, st_ctime=1643040150 /* 2022-01-24T16:02:30.270377401+0000 */, st_ctime_nsec=270377401}, 0) = 0
[pid  9471] pread64(5, "", 100, 0)      = 0
[pid  9471] brk(0x557d53df6000)         = 0x557d53df6000
[pid  9471] fcntl(5, F_SETLK, {l_type=F_RDLCK, l_whence=SEEK_SET, l_start=1073741824, l_len=1}) = -1 EAGAIN (Resource temporarily unavailable)
[pid  9471] clock_nanosleep(CLOCK_REALTIME, 0, {tv_sec=0, tv_nsec=1000000}, NULL) = 0
[pid  9471] fcntl(5, F_SETLK, {l_type=F_RDLCK, l_whence=SEEK_SET, l_start=1073741824, l_len=1}) = -1 EAGAIN (Resource temporarily unavailable)
[pid  9471] clock_nanosleep(CLOCK_REALTIME, 0, {tv_sec=0, tv_nsec=2000000}, NULL) = 0
[pid  9471] fcntl(5, F_SETLK, {l_type=F_RDLCK, l_whence=SEEK_SET, l_start=1073741824, l_len=1}) = -1 EAGAIN (Resource temporarily unavailable)
[pid  9471] clock_nanosleep(CLOCK_REALTIME, 0, {tv_sec=0, tv_nsec=5000000}, NULL) = 0
[pid  9471] fcntl(5, F_SETLK, {l_type=F_RDLCK, l_whence=SEEK_SET, l_start=1073741824, l_len=1}) = -1 EAGAIN (Resource temporarily unavailable)
[pid  9471] clock_nanosleep(CLOCK_REALTIME, 0, {tv_sec=0, tv_nsec=10000000}, NULL) = 0
[pid  9471] fcntl(5, F_SETLK, {l_type=F_RDLCK, l_whence=SEEK_SET, l_start=1073741824, l_len=1}) = -1 EAGAIN (Resource temporarily unavailable)
[pid  9471] clock_nanosleep(CLOCK_REALTIME, 0, {tv_sec=0, tv_nsec=15000000}, NULL) = 0
[pid  9471] fcntl(5, F_SETLK, {l_type=F_RDLCK, l_whence=SEEK_SET, l_start=1073741824, l_len=1}) = -1 EAGAIN (Resource temporarily unavailable)
[pid  9471] clock_nanosleep(CLOCK_REALTIME, 0, {tv_sec=0, tv_nsec=20000000}, NULL) = 0
[pid  9471] fcntl(5, F_SETLK, {l_type=F_RDLCK, l_whence=SEEK_SET, l_start=1073741824, l_len=1}) = -1 EAGAIN (Resource temporarily unavailable)
[pid  9471] clock_nanosleep(CLOCK_REALTIME, 0, {tv_sec=0, tv_nsec=25000000}, NULL) = 0
[pid  9471] fcntl(5, F_SETLK, {l_type=F_RDLCK, l_whence=SEEK_SET, l_start=1073741824, l_len=1}) = -1 EAGAIN (Resource temporarily unavailable)
[pid  9471] clock_nanosleep(CLOCK_REALTIME, 0, {tv_sec=0, tv_nsec=25000000}, NULL) = 0
[pid  9471] fcntl(5, F_SETLK, {l_type=F_RDLCK, l_whence=SEEK_SET, l_start=1073741824, l_len=1}) = -1 EAGAIN (Resource temporarily unavailable)
[pid  9471] clock_nanosleep(CLOCK_REALTIME, 0, {tv_sec=0, tv_nsec=25000000}, NULL) = 0
[pid  9471] fcntl(5, F_SETLK, {l_type=F_RDLCK, l_whence=SEEK_SET, l_start=1073741824, l_len=1}) = -1 EAGAIN (Resource temporarily unavailable)
[pid  9471] clock_nanosleep(CLOCK_REALTIME, 0, {tv_sec=0, tv_nsec=50000000}, NULL) = 0
[pid  9471] fcntl(5, F_SETLK, {l_type=F_RDLCK, l_whence=SEEK_SET, l_start=1073741824, l_len=1}) = -1 EAGAIN (Resource temporarily unavailable)
[pid  9471] clock_nanosleep(CLOCK_REALTIME, 0, {tv_sec=0, tv_nsec=50000000}, NULL) = 0
[pid  9471] fcntl(5, F_SETLK, {l_type=F_RDLCK, l_whence=SEEK_SET, l_start=1073741824, l_len=1}) = -1 EAGAIN (Resource temporarily unavailable)
[pid  9471] clock_nanosleep(CLOCK_REALTIME, 0, {tv_sec=0, tv_nsec=100000000}, NULL) = 0

@flokli
Copy link
Author

flokli commented Jan 27, 2022

Some more:

[  105.727968] BUG: unable to handle page fault for address: ffffffffffffffff
[  105.728004] #PF: supervisor read access in kernel mode
[  105.728024] #PF: error_code(0x0000) - not-present page
[  105.728042] PGD 26fc15067 P4D 26fc15067 PUD 26fc17067 PMD 0 
[  105.728066] Oops: 0000 [#1] SMP NOPTI
[  105.728080] CPU: 44 PID: 2793 Comm: dp_sync_taskq Tainted: P           O      5.10.93 #1-NixOS
[  105.728109] Hardware name: ASUSTeK COMPUTER INC. KRPA-U16 Series/KRPA-U16 Series, BIOS 4102 11/17/2021
[  105.728157] RIP: 0010:buf_hash_find+0xa7/0x130 [zfs]
[  105.728176] Code: 25 c0 bb 01 00 48 89 85 e8 94 a6 c0 48 8b 05 48 c1 27 00 4a 8b 04 f0 48 85 c0 74 32 48 8b 13 eb 09 48 8b 40 20 48 85 c0 74 24 <48> 39 10 75 f2 48 8b 73 08 48 39 70 08 75 e8 4c 39 68 10 75 e2 4c
[  105.728232] RSP: 0018:ffffac480b823ac8 EFLAGS: 00010286
[  105.728250] RAX: ffffffffffffffff RBX: ffff9c3603a04440 RCX: 0000000000000061
[  105.728273] RDX: 0000000000000008 RSI: 0000000000000008 RDI: ffffffffc0a6a6f0
[  105.728296] RBP: 0000000000001230 R08: 9ae16a3b2f90408f R09: 9ae16a3b2f90404f
[  105.728319] R10: ffff9c3603b7c180 R11: 0000000000000001 R12: ffffffffc0a6a6f0
[  105.728343] R13: 000000000000026a R14: 000000000336a861 R15: 8cfae33e54958b7f
[  105.728366] FS:  0000000000000000(0000) GS:ffff9c84ceb00000(0000) knlGS:0000000000000000
[  105.728392] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  105.728412] CR2: ffffffffffffffff CR3: 000000026fc10000 CR4: 0000000000350ee0
[  105.728435] Call Trace:
[  105.728466]  arc_freed+0x30/0xc0 [zfs]
[  105.728499]  ? zfs_blkptr_verify+0x410/0x440 [zfs]
[  105.728533]  zio_free_sync+0x4a/0xf0 [zfs]
[  105.728566]  zio_free+0xb3/0xf0 [zfs]
[  105.728600]  dsl_dataset_block_kill+0x2a6/0x4e0 [zfs]
[  105.728638]  free_blocks+0xd4/0x1c0 [zfs]
[  105.728670]  ? dnode_undirty_dbufs+0x1d0/0x1d0 [zfs]
[  105.728703]  dnode_sync_free_range+0x1e0/0x270 [zfs]
[  105.728722]  ? _cond_resched+0x16/0x40
[  105.728753]  ? dnode_undirty_dbufs+0x1d0/0x1d0 [zfs]
[  105.728795]  range_tree_walk+0xad/0x1e0 [zfs]
[  105.728828]  dnode_sync+0x2bb/0xa30 [zfs]
[  105.728845]  ? __wake_up_common+0x80/0x180
[  105.728860]  ? _cond_resched+0x16/0x40
[  105.728891]  sync_dnodes_task+0x71/0xa0 [zfs]
[  105.728910]  taskq_thread+0x2df/0x520 [spl]
[  105.728927]  ? wake_up_q+0xa0/0xa0
[  105.728942]  ? taskq_thread_spawn+0x50/0x50 [spl]
[  105.728959]  kthread+0x11b/0x140
[  105.728973]  ? __kthread_bind_mask+0x60/0x60
[  105.728989]  ret_from_fork+0x22/0x30
[  105.729004] Modules linked in: ext4 crc16 mbcache jbd2 dm_crypt cbc encrypted_keys trusted tpm rng_core af_packet cfg80211 rfkill 8021q ip6_tables amd64_edac_mod xt_conntrack edac_mce_amd nf_conntrack edac_core amd_energy nf_defrag_ipv6 nf_defrag_ipv4 kvm_amd joydev input_leds mousedev led_class xt_tcpudp ipmi_ssif kvm ast evdev irqbypass drm_vram_helper mac_hid ip6t_rpfilter hid_generic crc32_pclmul drm_ttm_helper ghash_clmulni_intel aesni_intel ipt_rpfilter igb ttm libaes crypto_simd drm_kms_helper cryptd sp5100_tco ptp xt_pkttype glue_helper usbhid pps_core cdc_ether fb_sys_fops syscopyarea nft_compat watchdog dca hid nvme sysfillrect usbnet rapl i2c_piix4 i2c_algo_bit mii sysimgblt k10temp nvme_core nft_counter hed acpi_ipmi ipmi_si ipmi_devintf ipmi_msghandler nf_tables pinctrl_amd tiny_power_button libcrc32c crc32c_generic acpi_cpufreq button nfnetlink sch_fq_codel atkbd libps2 serio zfs(PO) zunicode(PO) zzstd(O) zlua(O) zavl(PO) icp(PO) zcommon(PO) znvpair(PO) spl(O) tun tap macvlan
[  105.729055]  bridge stp llc drm agpgart backlight fuse i2c_core pstore configfs ip_tables x_tables autofs4 squashfs sd_mod t10_pi crc_t10dif crct10dif_generic ahci xhci_pci libahci xhci_pci_renesas xhci_hcd libata crct10dif_pclmul crct10dif_common usbcore crc32c_intel scsi_mod usb_common rtc_cmos dm_mod overlay loop
[  105.729408] CR2: ffffffffffffffff
[  105.730607] ---[ end trace 6757f5bf80c5331b ]---
[  105.730608] BUG: unable to handle page fault for address: ffffffffffffffff
[  105.730609] #PF: supervisor read access in kernel mode
[  105.731641] RIP: 0010:buf_hash_find+0xa7/0x130 [zfs]
[  105.732652] #PF: error_code(0x0000) - not-present page
[  105.733673] Code: 25 c0 bb 01 00 48 89 85 e8 94 a6 c0 48 8b 05 48 c1 27 00 4a 8b 04 f0 48 85 c0 74 32 48 8b 13 eb 09 48 8b 40 20 48 85 c0 74 24 <48> 39 10 75 f2 48 8b 73 08 48 39 70 08 75 e8 4c 39 68 10 75 e2 4c
[  105.734725] PGD 26fc15067 
[  105.735772] RSP: 0018:ffffac480b823ac8 EFLAGS: 00010286
[  105.736849] P4D 26fc15067 
[  105.737941] RAX: ffffffffffffffff RBX: ffff9c3603a04440 RCX: 0000000000000061
[  105.737942] RDX: 0000000000000008 RSI: 0000000000000008 RDI: ffffffffc0a6a6f0
[  105.739002] PUD 26fc17067 PMD 0 
[  105.740084] RBP: 0000000000001230 R08: 9ae16a3b2f90408f R09: 9ae16a3b2f90404f

[  105.742266] R10: ffff9c3603b7c180 R11: 0000000000000001 R12: ffffffffc0a6a6f0
[  105.743334] Oops: 0000 [#2] SMP NOPTI
[  105.744440] R13: 000000000000026a R14: 000000000336a861 R15: 8cfae33e54958b7f
[  105.745520] CPU: 13 PID: 2794 Comm: dp_sync_taskq Tainted: P      D    O      5.10.93 #1-NixOS
[  105.746654] FS:  0000000000000000(0000) GS:ffff9c84ceb00000(0000) knlGS:0000000000000000
[  105.747755] Hardware name: ASUSTeK COMPUTER INC. KRPA-U16 Series/KRPA-U16 Series, BIOS 4102 11/17/2021
[  105.748885] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  105.748886] CR2: ffffffffffffffff CR3: 000000026fc10000 CR4: 0000000000350ee0
[  105.750070] RIP: 0010:buf_hash_find+0xa7/0x130 [zfs]
[  105.812598] Code: 25 c0 bb 01 00 48 89 85 e8 94 a6 c0 48 8b 05 48 c1 27 00 4a 8b 04 f0 48 85 c0 74 32 48 8b 13 eb 09 48 8b 40 20 48 85 c0 74 24 <48> 39 10 75 f2 48 8b 73 08 48 39 70 08 75 e8 4c 39 68 10 75 e2 4c
[  105.831581] RSP: 0018:ffffac480b82bac8 EFLAGS: 00010286
[  105.847585] RAX: ffffffffffffffff RBX: ffff9c36435dc840 RCX: 0000000000000645
[  105.863582] RDX: 0000000000000008 RSI: 0000000000000008 RDI: ffffffffc0a7c1b0
[  105.879597] RBP: 0000000000012cf0 R08: 9ae16a3b2f90408f R09: 9ae16a3b2f90404f
[  105.894461] R10: ffff9c3643734780 R11: 0000000000000001 R12: ffffffffc0a7c1b0
[  105.909583] R13: 000000000000026a R14: 0000000003356e45 R15: 8cfae33e54958b7f
[  105.925577] FS:  0000000000000000(0000) GS:ffff9c84ce340000(0000) knlGS:0000000000000000
[  105.941210] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  105.958858] CR2: ffffffffffffffff CR3: 000000026fc10000 CR4: 0000000000350ee0
[  105.973541] Call Trace:
[  105.987597]  arc_freed+0x30/0xc0 [zfs]
[  106.003420]  ? zfs_blkptr_verify+0x410/0x440 [zfs]
[  106.017677]  zio_free_sync+0x4a/0xf0 [zfs]
[  106.032576]  zio_free+0xb3/0xf0 [zfs]
[  106.048588]  dsl_dataset_block_kill+0x2a6/0x4e0 [zfs]
[  106.063573]  free_blocks+0xd4/0x1c0 [zfs]
[  106.078584]  ? dnode_undirty_dbufs+0x1d0/0x1d0 [zfs]
[  106.094593]  dnode_sync_free_range+0x1e0/0x270 [zfs]
[  106.109572]  ? _cond_resched+0x16/0x40
[  106.127594]  ? dnode_undirty_dbufs+0x1d0/0x1d0 [zfs]
[  106.142726]  range_tree_walk+0xad/0x1e0 [zfs]
[  106.157592]  dnode_sync+0x2bb/0xa30 [zfs]
[  106.172583]  ? __wake_up_common+0x80/0x180
[  106.187221]  ? _cond_resched+0x16/0x40
[  106.201759]  sync_dnodes_task+0x71/0xa0 [zfs]
[  106.218578]  taskq_thread+0x2df/0x520 [spl]
[  106.233101]  ? wake_up_q+0xa0/0xa0
[  106.247574]  ? taskq_thread_spawn+0x50/0x50 [spl]
[  106.263573]  kthread+0x11b/0x140
[  106.278573]  ? __kthread_bind_mask+0x60/0x60
[  106.296575]  ret_from_fork+0x22/0x30
[  106.312573] Modules linked in: ext4 crc16 mbcache jbd2 dm_crypt cbc encrypted_keys trusted tpm rng_core af_packet cfg80211 rfkill 8021q ip6_tables amd64_edac_mod xt_conntrack edac_mce_amd nf_conntrack edac_core amd_energy nf_defrag_ipv6 nf_defrag_ipv4 kvm_amd joydev input_leds mousedev led_class xt_tcpudp ipmi_ssif kvm ast evdev irqbypass drm_vram_helper mac_hid ip6t_rpfilter hid_generic crc32_pclmul drm_ttm_helper ghash_clmulni_intel aesni_intel ipt_rpfilter igb ttm libaes crypto_simd drm_kms_helper cryptd sp5100_tco ptp xt_pkttype glue_helper usbhid pps_core cdc_ether fb_sys_fops syscopyarea nft_compat watchdog dca hid nvme sysfillrect usbnet rapl i2c_piix4 i2c_algo_bit mii sysimgblt k10temp nvme_core nft_counter hed acpi_ipmi ipmi_si ipmi_devintf ipmi_msghandler nf_tables pinctrl_amd tiny_power_button libcrc32c crc32c_generic acpi_cpufreq button nfnetlink sch_fq_codel atkbd libps2 serio zfs(PO) zunicode(PO) zzstd(O) zlua(O) zavl(PO) icp(PO) zcommon(PO) znvpair(PO) spl(O) tun tap macvlan
[  106.312978]  bridge stp llc drm agpgart backlight fuse i2c_core pstore configfs ip_tables x_tables autofs4 squashfs sd_mod t10_pi crc_t10dif crct10dif_generic ahci xhci_pci libahci xhci_pci_renesas xhci_hcd libata crct10dif_pclmul crct10dif_common usbcore crc32c_intel scsi_mod usb_common rtc_cmos dm_mod overlay loop
[  106.430503] CR2: ffffffffffffffff
[  106.445497] ---[ end trace 6757f5bf80c5331c ]---
[  106.460582] RIP: 0010:buf_hash_find+0xa7/0x130 [zfs]
[  106.476572] Code: 25 c0 bb 01 00 48 89 85 e8 94 a6 c0 48 8b 05 48 c1 27 00 4a 8b 04 f0 48 85 c0 74 32 48 8b 13 eb 09 48 8b 40 20 48 85 c0 74 24 <48> 39 10 75 f2 48 8b 73 08 48 39 70 08 75 e8 4c 39 68 10 75 e2 4c
[  106.493576] RSP: 0018:ffffac480b823ac8 EFLAGS: 00010286
[  106.509036] RAX: ffffffffffffffff RBX: ffff9c3603a04440 RCX: 0000000000000061
[  106.524574] RDX: 0000000000000008 RSI: 0000000000000008 RDI: ffffffffc0a6a6f0
[  106.539754] RBP: 0000000000001230 R08: 9ae16a3b2f90408f R09: 9ae16a3b2f90404f
[  106.555583] R10: ffff9c3603b7c180 R11: 0000000000000001 R12: ffffffffc0a6a6f0
[  106.571567] R13: 000000000000026a R14: 000000000336a861 R15: 8cfae33e54958b7f
[  106.586499] FS:  0000000000000000(0000) GS:ffff9c84ce340000(0000) knlGS:0000000000000000
[  106.601583] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  106.615680] CR2: ffffffffffffffff CR3: 000000026fc10000 CR4: 0000000000350ee0

@flokli
Copy link
Author

flokli commented Jan 27, 2022

[   85.732263] BUG: unable to handle page fault for address: ffffffffffffffff
[   85.732330] #PF: supervisor read access in kernel mode
[   85.732369] #PF: error_code(0x0000) - not-present page
[   85.732406] PGD 3409a15067 P4D 3409a15067 PUD 3409a17067 PMD 0 
[   85.732451] Oops: 0000 [#1] SMP NOPTI
[   85.732479] CPU: 11 PID: 2774 Comm: dp_sync_taskq Tainted: P           O      5.10.93 #1-NixOS
[   85.732533] Hardware name: ASUSTeK COMPUTER INC. KRPA-U16 Series/KRPA-U16 Series, BIOS 4102 11/17/2021
[   85.732626] RIP: 0010:buf_hash_find+0xa7/0x130 [zfs]
[   85.732659] Code: 25 c0 bb 01 00 48 89 85 e8 f4 85 c0 48 8b 05 48 c1 27 00 4a 8b 04 f0 48 85 c0 74 32 48 8b 13 eb 09 48 8b 40 20 48 85 c0 74 24 <48> 39 10 75 f2 48 8b 73 08 48 39 70 08 75 e8 4c 39 68 10 75 e2 4c
[   85.732774] RSP: 0018:ffffb707473dfac8 EFLAGS: 00010286
[   85.732810] RAX: ffffffffffffffff RBX: ffff905db47a1240 RCX: 00000000000002cd
[   85.732859] RDX: 0000000000000010 RSI: 0000000000000010 RDI: ffffffffc0867b30
[   85.732907] RBP: 0000000000008670 R08: 9ae16a3b2f90408f R09: 9ae16a3b2f90404f
[   85.732935] R10: ffff905db51e7780 R11: 0000000000000001 R12: ffffffffc0867b30
[   85.732963] R13: 000000000000026a R14: 00000000037502cd R15: 9777ffc4d387a938
[   85.732993] FS:  0000000000000000(0000) GS:ffff90ac8e2c0000(0000) knlGS:0000000000000000
[   85.733025] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[   85.733046] CR2: ffffffffffffffff CR3: 0000003409a10000 CR4: 0000000000350ee0
[   85.733070] Call Trace:
[   85.733100]  arc_freed+0x30/0xc0 [zfs]
[   85.733135]  ? zfs_blkptr_verify+0x410/0x440 [zfs]
[   85.733170]  zio_free_sync+0x4a/0xf0 [zfs]
[   85.733205]  zio_free+0xb3/0xf0 [zfs]
[   85.733239]  dsl_dataset_block_kill+0x2a6/0x4e0 [zfs]
[   85.733277]  free_blocks+0xd4/0x1c0 [zfs]
[   85.733308]  ? dnode_undirty_dbufs+0x1d0/0x1d0 [zfs]
[   85.733343]  dnode_sync_free_range+0x1e0/0x270 [zfs]
[   85.733362]  ? _cond_resched+0x16/0x40
[   85.733391]  ? dnode_undirty_dbufs+0x1d0/0x1d0 [zfs]
[   85.733431]  range_tree_walk+0xad/0x1e0 [zfs]
[   85.733465]  dnode_sync+0x2bb/0xa30 [zfs]
[   85.733481]  ? _cond_resched+0x16/0x40
[   85.733513]  sync_dnodes_task+0x71/0xa0 [zfs]
[   85.733533]  taskq_thread+0x2df/0x520 [spl]
[   85.733551]  ? wake_up_q+0xa0/0xa0
[   85.733566]  ? taskq_thread_spawn+0x50/0x50 [spl]
[   85.733584]  kthread+0x11b/0x140
[   85.733597]  ? __kthread_bind_mask+0x60/0x60
[   85.733614]  ret_from_fork+0x22/0x30
[   85.733629] Modules linked in: ext4 crc16 mbcache jbd2 dm_crypt cbc encrypted_keys trusted tpm rng_core af_packet amd64_edac_mod edac_mce_amd edac_core amd_energy kvm_amd kvm irqbypass crc32_pclmul ghash_clmulni_intel igb ptp input_leds aesni_intel joydev ipmi_ssif cfg80211 led_class mousedev libaes ast crypto_simd drm_vram_helper pps_core rfkill cryptd hid_generic evdev mac_hid 8021q glue_helper sp5100_tco dca drm_ttm_helper ttm watchdog cdc_ether drm_kms_helper nvme usbhid usbnet i2c_piix4 i2c_algo_bit fb_sys_fops syscopyarea rapl sysfillrect hid mii sysimgblt k10temp nvme_core ip6_tables acpi_ipmi ipmi_si ipmi_devintf ipmi_msghandler xt_conntrack nf_conntrack nf_defrag_ipv6 hed nf_defrag_ipv4 pinctrl_amd tiny_power_button button acpi_cpufreq xt_tcpudp ip6t_rpfilter ipt_rpfilter xt_pkttype nft_compat nft_counter nf_tables libcrc32c crc32c_generic nfnetlink sch_fq_codel atkbd libps2 serio zfs(PO) zunicode(PO) zzstd(O) zlua(O) zavl(PO) icp(PO) zcommon(PO) znvpair(PO) spl(O) tun tap macvlan
[   85.733681]  bridge stp llc pstore fuse drm agpgart backlight i2c_core configfs ip_tables x_tables autofs4 squashfs sd_mod t10_pi crc_t10dif crct10dif_generic xhci_pci ahci xhci_pci_renesas libahci xhci_hcd libata crct10dif_pclmul usbcore crct10dif_common crc32c_intel scsi_mod usb_common rtc_cmos dm_mod overlay loop
[   85.734037] CR2: ffffffffffffffff
[   85.734051] ---[ end trace 2d0d878cbc5f7126 ]---
[   85.734054] BUG: unable to handle page fault for address: ffffffffffffffff
[   85.734057] #PF: supervisor read access in kernel mode
[   85.735249] RIP: 0010:buf_hash_find+0xa7/0x130 [zfs]
[   85.736573] #PF: error_code(0x0000) - not-present page
[   85.736575] PGD 3409a15067 
[   85.737867] Code: 25 c0 bb 01 00 48 89 85 e8 f4 85 c0 48 8b 05 48 c1 27 00 4a 8b 04 f0 48 85 c0 74 32 48 8b 13 eb 09 48 8b 40 20 48 85 c0 74 24 <48> 39 10 75 f2 48 8b 73 08 48 39 70 08 75 e8 4c 39 68 10 75 e2 4c
[   85.737869] RSP: 0018:ffffb707473dfac8 EFLAGS: 00010286
[   85.739124] P4D 3409a15067 PUD 3409a17067 

[   85.740376] RAX: ffffffffffffffff RBX: ffff905db47a1240 RCX: 00000000000002cd
[   85.741625] PMD 0 
[   85.742940] RDX: 0000000000000010 RSI: 0000000000000010 RDI: ffffffffc0867b30
[   85.742942] RBP: 0000000000008670 R08: 9ae16a3b2f90408f R09: 9ae16a3b2f90404f
[   85.744262] Oops: 0000 [#2] SMP NOPTI
[   85.745566] R10: ffff905db51e7780 R11: 0000000000000001 R12: ffffffffc0867b30
[   85.745568] R13: 000000000000026a R14: 00000000037502cd R15: 9777ffc4d387a938
[   85.746875] CPU: 48 PID: 2786 Comm: dp_sync_taskq Tainted: P      D    O      5.10.93 #1-NixOS
[   85.746878] Hardware name: ASUSTeK COMPUTER INC. KRPA-U16 Series/KRPA-U16 Series, BIOS 4102 11/17/2021
[   85.748198] FS:  0000000000000000(0000) GS:ffff90ac8e2c0000(0000) knlGS:0000000000000000
[   85.748200] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[   85.749525] RIP: 0010:buf_hash_find+0xa7/0x130 [zfs]
[   85.749528] Code: 25 c0 bb 01 00 48 89 85 e8 f4 85 c0 48 8b 05 48 c1 27 00 4a 8b 04 f0 48 85 c0 74 32 48 8b 13 eb 09 48 8b 40 20 48 85 c0 74 24 <48> 39 10 75 f2 48 8b 73 08 48 39 70 08 75 e8 4c 39 68 10 75 e2 4c
[   85.750857] CR2: ffffffffffffffff CR3: 0000003409a10000 CR4: 0000000000350ee0
[   85.764232] RSP: 0018:ffffb70747477ac8 EFLAGS: 00010286
[   85.765452] RAX: ffffffffffffffff RBX: ffff905db441de40 RCX: 000000000000061b
[   85.766666] RDX: 0000000000000008 RSI: 0000000000000008 RDI: ffffffffc08719d0
[   85.767868] RBP: 0000000000012510 R08: 9ae16a3b2f90408f R09: 9ae16a3b2f90404f
[   85.769049] R10: ffff905db64dab80 R11: 0000000000000001 R12: ffffffffc08719d0
[   85.770200] R13: 000000000000026a R14: 000000000375961b R15: 9777ffc4d387a938
[   85.771451] FS:  0000000000000000(0000) GS:ffff90ac8ec00000(0000) knlGS:0000000000000000
[   85.772730] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[   85.773924] CR2: ffffffffffffffff CR3: 0000003409a10000 CR4: 0000000000350ee0
[   85.775109] Call Trace:
[   85.776305]  arc_freed+0x30/0xc0 [zfs]
[   85.777534]  ? zfs_blkptr_verify+0x410/0x440 [zfs]
[   85.778740]  zio_free_sync+0x4a/0xf0 [zfs]
[   85.779927]  zio_free+0xb3/0xf0 [zfs]
[   85.781114]  dsl_dataset_block_kill+0x2a6/0x4e0 [zfs]
[   85.782327]  free_blocks+0xd4/0x1c0 [zfs]
[   85.783558]  ? dnode_undirty_dbufs+0x1d0/0x1d0 [zfs]
[   85.784766]  dnode_sync_free_range+0x1e0/0x270 [zfs]
[   85.785943]  ? _cond_resched+0x16/0x40
[   85.787135]  ? dnode_undirty_dbufs+0x1d0/0x1d0 [zfs]
[   85.788324]  range_tree_walk+0xad/0x1e0 [zfs]
[   85.789490]  dnode_sync+0x2bb/0xa30 [zfs]
[   85.790646]  ? __switch_to_asm+0x42/0x70
[   85.791791]  ? __switch_to+0x114/0x3e0
[   85.792922]  ? _cond_resched+0x16/0x40
[   85.794075]  sync_dnodes_task+0x71/0xa0 [zfs]
[   85.795207]  taskq_thread+0x2df/0x520 [spl]
[   85.796317]  ? wake_up_q+0xa0/0xa0
[   85.797432]  ? taskq_thread_spawn+0x50/0x50 [spl]
[   85.798545]  kthread+0x11b/0x140
[   85.799787]  ? __kthread_bind_mask+0x60/0x60
[   85.801062]  ret_from_fork+0x22/0x30
[   85.802229] Modules linked in: ext4 crc16 mbcache jbd2 dm_crypt cbc encrypted_keys trusted tpm rng_core af_packet amd64_edac_mod edac_mce_amd edac_core amd_energy kvm_amd kvm irqbypass crc32_pclmul ghash_clmulni_intel igb ptp input_leds aesni_intel joydev ipmi_ssif cfg80211 led_class mousedev libaes ast crypto_simd drm_vram_helper pps_core rfkill cryptd hid_generic evdev mac_hid 8021q glue_helper sp5100_tco dca drm_ttm_helper ttm watchdog cdc_ether drm_kms_helper nvme usbhid usbnet i2c_piix4 i2c_algo_bit fb_sys_fops syscopyarea rapl sysfillrect hid mii sysimgblt k10temp nvme_core ip6_tables acpi_ipmi ipmi_si ipmi_devintf ipmi_msghandler xt_conntrack nf_conntrack nf_defrag_ipv6 hed nf_defrag_ipv4 pinctrl_amd tiny_power_button button acpi_cpufreq xt_tcpudp ip6t_rpfilter ipt_rpfilter xt_pkttype nft_compat nft_counter nf_tables libcrc32c crc32c_generic nfnetlink sch_fq_codel atkbd libps2 serio zfs(PO) zunicode(PO) zzstd(O) zlua(O) zavl(PO) icp(PO) zcommon(PO) znvpair(PO) spl(O) tun tap macvlan
[   85.802280]  bridge stp llc pstore fuse drm agpgart backlight i2c_core configfs ip_tables x_tables autofs4 squashfs sd_mod t10_pi crc_t10dif crct10dif_generic xhci_pci ahci xhci_pci_renesas libahci xhci_hcd libata crct10dif_pclmul usbcore crct10dif_common crc32c_intel scsi_mod usb_common rtc_cmos dm_mod overlay loop
[   85.811300] CR2: ffffffffffffffff
[   85.812569] ---[ end trace 2d0d878cbc5f7127 ]---
[   85.812573] BUG: unable to handle page fault for address: ffffffffffffffff
[   85.813775] RIP: 0010:buf_hash_find+0xa7/0x130 [zfs]
[   85.813779] Code: 25 c0 bb 01 00 48 89 85 e8 f4 85 c0 48 8b 05 48 c1 27 00 4a 8b 04 f0 48 85 c0 74 32 48 8b 13 eb 09 48 8b 40 20 48 85 c0 74 24 <48> 39 10 75 f2 48 8b 73 08 48 39 70 08 75 e8 4c 39 68 10 75 e2 4c
[   85.815427] #PF: supervisor read access in kernel mode
[   85.815429] #PF: error_code(0x0000) - not-present page
[   85.816861] RSP: 0018:ffffb707473dfac8 EFLAGS: 00010286
[   85.818440] PGD 3409a15067 P4D 3409a15067 
[   85.819916] RAX: ffffffffffffffff RBX: ffff905db47a1240 RCX: 00000000000002cd
[   85.819918] RDX: 0000000000000010 RSI: 0000000000000010 RDI: ffffffffc0867b30
[   85.821499] PUD 3409a17067 PMD 0 
[   85.822969] RBP: 0000000000008670 R08: 9ae16a3b2f90408f R09: 9ae16a3b2f90404f
[   85.822971] R10: ffff905db51e7780 R11: 0000000000000001 R12: ffffffffc0867b30

[   85.824571] Oops: 0000 [#3] SMP NOPTI
[   85.826019] R13: 000000000000026a R14: 00000000037502cd R15: 9777ffc4d387a938
[   85.826021] FS:  0000000000000000(0000) GS:ffff90ac8ec00000(0000) knlGS:0000000000000000
[   85.827614] CPU: 45 PID: 2792 Comm: dp_sync_taskq Tainted: P      D    O      5.10.93 #1-NixOS
[   85.829179] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[   85.829182] CR2: ffffffffffffffff CR3: 0000003409a10000 CR4: 0000000000350ee0
[   85.830757] Hardware name: ASUSTeK COMPUTER INC. KRPA-U16 Series/KRPA-U16 Series, BIOS 4102 11/17/2021
[   85.830782] RIP: 0010:buf_hash_find+0xa7/0x130 [zfs]
[   85.843619] Code: 25 c0 bb 01 00 48 89 85 e8 f4 85 c0 48 8b 05 48 c1 27 00 4a 8b 04 f0 48 85 c0 74 32 48 8b 13 eb 09 48 8b 40 20 48 85 c0 74 24 <48> 39 10 75 f2 48 8b 73 08 48 39 70 08 75 e8 4c 39 68 10 75 e2 4c
[   85.844885] RSP: 0018:ffffb707474b7ac8 EFLAGS: 00010286
[   85.846151] RAX: ffffffffffffffff RBX: ffff905db3943e40 RCX: 00000000000002e3
[   85.847373] RDX: 0000000000000008 RSI: 0000000000000008 RDI: ffffffffc0867f50
[   85.848568] RBP: 0000000000008a90 R08: 9ae16a3b2f90408f R09: 9ae16a3b2f90404f
[   85.849753] R10: ffff905db3c90300 R11: 0000000000000001 R12: ffffffffc0867f50
[   85.850928] R13: 000000000000026a R14: 000000000374a2e3 R15: 9777ffc4d387a938
[   85.852099] FS:  0000000000000000(0000) GS:ffff90ac8eb40000(0000) knlGS:0000000000000000
[   85.853277] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[   85.854449] CR2: ffffffffffffffff CR3: 0000003409a10000 CR4: 0000000000350ee0
[   85.855621] Call Trace:
[   85.856884]  arc_freed+0x30/0xc0 [zfs]
[   85.858171]  ? zfs_blkptr_verify+0x410/0x440 [zfs]
[   85.859366]  zio_free_sync+0x4a/0xf0 [zfs]
[   85.860575]  zio_free+0xb3/0xf0 [zfs]
[   85.861777]  dsl_dataset_block_kill+0x2a6/0x4e0 [zfs]
[   85.862982]  free_blocks+0xd4/0x1c0 [zfs]
[   85.864177]  ? dnode_undirty_dbufs+0x1d0/0x1d0 [zfs]
[   85.865355]  dnode_sync_free_range+0x1e0/0x270 [zfs]
[   85.866532]  ? _cond_resched+0x16/0x40
[   85.867725]  ? dnode_undirty_dbufs+0x1d0/0x1d0 [zfs]
[   85.868970]  range_tree_walk+0xad/0x1e0 [zfs]
[   85.870171]  dnode_sync+0x2bb/0xa30 [zfs]
[   85.871356]  ? __switch_to_asm+0x42/0x70
[   85.872541]  ? __switch_to+0x114/0x3e0
[   85.873712]  ? _cond_resched+0x16/0x40
[   85.874876]  sync_dnodes_task+0x71/0xa0 [zfs]
[   85.876023]  taskq_thread+0x2df/0x520 [spl]
[   85.877174]  ? wake_up_q+0xa0/0xa0
[   85.878313]  ? taskq_thread_spawn+0x50/0x50 [spl]
[   85.879460]  kthread+0x11b/0x140
[   85.880601]  ? __kthread_bind_mask+0x60/0x60
[   85.881775]  ret_from_fork+0x22/0x30
[   85.882893] Modules linked in: ext4 crc16 mbcache jbd2 dm_crypt cbc encrypted_keys trusted tpm rng_core af_packet amd64_edac_mod edac_mce_amd edac_core amd_energy kvm_amd kvm irqbypass crc32_pclmul ghash_clmulni_intel igb ptp input_leds aesni_intel joydev ipmi_ssif cfg80211 led_class mousedev libaes ast crypto_simd drm_vram_helper pps_core rfkill cryptd hid_generic evdev mac_hid 8021q glue_helper sp5100_tco dca drm_ttm_helper ttm watchdog cdc_ether drm_kms_helper nvme usbhid usbnet i2c_piix4 i2c_algo_bit fb_sys_fops syscopyarea rapl sysfillrect hid mii sysimgblt k10temp nvme_core ip6_tables acpi_ipmi ipmi_si ipmi_devintf ipmi_msghandler xt_conntrack nf_conntrack nf_defrag_ipv6 hed nf_defrag_ipv4 pinctrl_amd tiny_power_button button acpi_cpufreq xt_tcpudp ip6t_rpfilter ipt_rpfilter xt_pkttype nft_compat nft_counter nf_tables libcrc32c crc32c_generic nfnetlink sch_fq_codel atkbd libps2 serio zfs(PO) zunicode(PO) zzstd(O) zlua(O) zavl(PO) icp(PO) zcommon(PO) znvpair(PO) spl(O) tun tap macvlan
[   85.882939]  bridge stp llc pstore fuse drm agpgart backlight i2c_core configfs ip_tables x_tables autofs4 squashfs sd_mod t10_pi crc_t10dif crct10dif_generic xhci_pci ahci xhci_pci_renesas libahci xhci_hcd libata crct10dif_pclmul usbcore crct10dif_common crc32c_intel scsi_mod usb_common rtc_cmos dm_mod overlay loop
[   85.892385] CR2: ffffffffffffffff
[   85.893642] ---[ end trace 2d0d878cbc5f7128 ]---
[   85.893644] BUG: unable to handle page fault for address: ffffffffffffffff
[   85.893647] #PF: supervisor read access in kernel mode
[   85.894875] RIP: 0010:buf_hash_find+0xa7/0x130 [zfs]
[   85.896269] #PF: error_code(0x0000) - not-present page
[   85.896271] PGD 3409a15067 
[   85.897740] Code: 25 c0 bb 01 00 48 89 85 e8 f4 85 c0 48 8b 05 48 c1 27 00 4a 8b 04 f0 48 85 c0 74 32 48 8b 13 eb 09 48 8b 40 20 48 85 c0 74 24 <48> 39 10 75 f2 48 8b 73 08 48 39 70 08 75 e8 4c 39 68 10 75 e2 4c
[   85.897742] RSP: 0018:ffffb707473dfac8 EFLAGS: 00010286
[   85.899119] P4D 3409a15067 PUD 3409a17067 

[   85.900550] RAX: ffffffffffffffff RBX: ffff905db47a1240 RCX: 00000000000002cd
[   85.901905] PMD 0 
[   85.903335] RDX: 0000000000000010 RSI: 0000000000000010 RDI: ffffffffc0867b30
[   85.903337] RBP: 0000000000008670 R08: 9ae16a3b2f90408f R09: 9ae16a3b2f90404f
[   85.904693] Oops: 0000 [#4] SMP NOPTI
[   85.906093] R10: ffff905db51e7780 R11: 0000000000000001 R12: ffffffffc0867b30
[   85.906095] R13: 000000000000026a R14: 00000000037502cd R15: 9777ffc4d387a938
[   85.907440] CPU: 5 PID: 2789 Comm: dp_sync_taskq Tainted: P      D    O      5.10.93 #1-NixOS
[   85.908840] FS:  0000000000000000(0000) GS:ffff90ac8eb40000(0000) knlGS:0000000000000000
[   85.908842] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[   85.910187] Hardware name: ASUSTeK COMPUTER INC. KRPA-U16 Series/KRPA-U16 Series, BIOS 4102 11/17/2021
[   85.910207] RIP: 0010:buf_hash_find+0xa7/0x130 [zfs]
[   85.911590] CR2: ffffffffffffffff CR3: 0000003409a10000 CR4: 0000000000350ee0
[   85.923982] Code: 25 c0 bb 01 00 48 89 85 e8 f4 85 c0 48 8b 05 48 c1 27 00 4a 8b 04 f0 48 85 c0 74 32 48 8b 13 eb 09 48 8b 40 20 48 85 c0 74 24 <48> 39 10 75 f2 48 8b 73 08 48 39 70 08 75 e8 4c 39 68 10 75 e2 4c
[   85.925275] RSP: 0018:ffffb70747497ac8 EFLAGS: 00010286
[   85.926495] RAX: ffffffffffffffff RBX: ffff905db47a1a40 RCX: 000000000000068f
[   85.927735] RDX: 0000000000000008 RSI: 0000000000000008 RDI: ffffffffc0872f90
[   85.928931] RBP: 0000000000013ad0 R08: 9ae16a3b2f90408f R09: 9ae16a3b2f90404f
[   85.930117] R10: ffff905db6320c00 R11: 0000000000000001 R12: ffffffffc0872f90
[   85.931304] R13: 000000000000026a R14: 000000000375368f R15: 9777ffc4d387a938
[   85.932479] FS:  0000000000000000(0000) GS:ffff90ac8e140000(0000) knlGS:0000000000000000
[   85.933659] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[   85.934820] CR2: ffffffffffffffff CR3: 0000003409a10000 CR4: 0000000000350ee0
[   85.935984] Call Trace:
[   85.937150]  arc_freed+0x30/0xc0 [zfs]
[   85.938313]  ? zfs_blkptr_verify+0x410/0x440 [zfs]
[   85.939474]  zio_free_sync+0x4a/0xf0 [zfs]
[   85.940662]  zio_free+0xb3/0xf0 [zfs]
[   85.941936]  dsl_dataset_block_kill+0x2a6/0x4e0 [zfs]
[   85.943176]  free_blocks+0xd4/0x1c0 [zfs]
[   85.944348]  ? dnode_undirty_dbufs+0x1d0/0x1d0 [zfs]
[   85.945520]  dnode_sync_free_range+0x1e0/0x270 [zfs]
[   85.946726]  ? _cond_resched+0x16/0x40
[   85.947921]  ? dnode_undirty_dbufs+0x1d0/0x1d0 [zfs]
[   85.949119]  range_tree_walk+0xad/0x1e0 [zfs]
[   85.950307]  dnode_sync+0x2bb/0xa30 [zfs]
[   85.951488]  ? __switch_to_asm+0x42/0x70
[   85.952657]  ? __switch_to+0x114/0x3e0
[   85.953854]  ? _cond_resched+0x16/0x40
[   85.955038]  sync_dnodes_task+0x71/0xa0 [zfs]
[   85.956215]  taskq_thread+0x2df/0x520 [spl]
[   85.957395]  ? wake_up_q+0xa0/0xa0
[   85.958550]  ? taskq_thread_spawn+0x50/0x50 [spl]
[   85.959706]  kthread+0x11b/0x140
[   85.960848]  ? __kthread_bind_mask+0x60/0x60
[   85.961992]  ret_from_fork+0x22/0x30
[   85.963131] Modules linked in: ext4 crc16 mbcache jbd2 dm_crypt cbc encrypted_keys trusted tpm rng_core af_packet amd64_edac_mod edac_mce_amd edac_core amd_energy kvm_amd kvm irqbypass crc32_pclmul ghash_clmulni_intel igb ptp input_leds aesni_intel joydev ipmi_ssif cfg80211 led_class mousedev libaes ast crypto_simd drm_vram_helper pps_core rfkill cryptd hid_generic evdev mac_hid 8021q glue_helper sp5100_tco dca drm_ttm_helper ttm watchdog cdc_ether drm_kms_helper nvme usbhid usbnet i2c_piix4 i2c_algo_bit fb_sys_fops syscopyarea rapl sysfillrect hid mii sysimgblt k10temp nvme_core ip6_tables acpi_ipmi ipmi_si ipmi_devintf ipmi_msghandler xt_conntrack nf_conntrack nf_defrag_ipv6 hed nf_defrag_ipv4 pinctrl_amd tiny_power_button button acpi_cpufreq xt_tcpudp ip6t_rpfilter ipt_rpfilter xt_pkttype nft_compat nft_counter nf_tables libcrc32c crc32c_generic nfnetlink sch_fq_codel atkbd libps2 serio zfs(PO) zunicode(PO) zzstd(O) zlua(O) zavl(PO) icp(PO) zcommon(PO) znvpair(PO) spl(O) tun tap macvlan
[   85.963176]  bridge stp llc pstore fuse drm agpgart backlight i2c_core configfs ip_tables x_tables autofs4 squashfs sd_mod t10_pi crc_t10dif crct10dif_generic xhci_pci ahci xhci_pci_renesas libahci xhci_hcd libata crct10dif_pclmul usbcore crct10dif_common crc32c_intel scsi_mod usb_common rtc_cmos dm_mod overlay loop
[   85.972139] CR2: ffffffffffffffff
[   85.973346] ---[ end trace 2d0d878cbc5f7129 ]---
[   85.973348] BUG: unable to handle page fault for address: ffffffffffffffff
[   85.973351] #PF: supervisor read access in kernel mode
[   85.974524] RIP: 0010:buf_hash_find+0xa7/0x130 [zfs]
[   85.974527] Code: 25 c0 bb 01 00 48 89 85 e8 f4 85 c0 48 8b 05 48 c1 27 00 4a 8b 04 f0 48 85 c0 74 32 48 8b 13 eb 09 48 8b 40 20 48 85 c0 74 24 <48> 39 10 75 f2 48 8b 73 08 48 39 70 08 75 e8 4c 39 68 10 75 e2 4c
[   85.976040] #PF: error_code(0x0000) - not-present page
[   85.976042] PGD 3409a15067 
[   85.977420] RSP: 0018:ffffb707473dfac8 EFLAGS: 00010286
[   85.978848] P4D 3409a15067 PUD 3409a17067 
[   85.980275] RAX: ffffffffffffffff RBX: ffff905db47a1240 RCX: 00000000000002cd
[   85.980277] RDX: 0000000000000010 RSI: 0000000000000010 RDI: ffffffffc0867b30
[   85.981765] PMD 0 
[   85.983217] RBP: 0000000000008670 R08: 9ae16a3b2f90408f R09: 9ae16a3b2f90404f
[   85.983219] R10: ffff905db51e7780 R11: 0000000000000001 R12: ffffffffc0867b30
[   85.984716] Oops: 0000 [#5] SMP NOPTI
[   85.984719] CPU: 14 PID: 2776 Comm: dp_sync_taskq Tainted: P      D    O      5.10.93 #1-NixOS
[   85.986099] R13: 000000000000026a R14: 00000000037502cd R15: 9777ffc4d387a938
[   85.986101] FS:  0000000000000000(0000) GS:ffff90ac8e140000(0000) knlGS:0000000000000000
[   85.987532] Hardware name: ASUSTeK COMPUTER INC. KRPA-U16 Series/KRPA-U16 Series, BIOS 4102 11/17/2021
[   85.987549] RIP: 0010:buf_hash_find+0xa7/0x130 [zfs]
[   85.988893] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[   85.988895] CR2: ffffffffffffffff CR3: 0000003409a10000 CR4: 0000000000350ee0
[   85.990326] Code: 25 c0 bb 01 00 48 89 85 e8 f4 85 c0 48 8b 05 48 c1 27 00 4a 8b 04 f0 48 85 c0 74 32 48 8b 13 eb 09 48 8b 40 20 48 85 c0 74 24 <48> 39 10 75 f2 48 8b 73 08 48 39 70 08 75 e8 4c 39 68 10 75 e2 4c
[   85.990328] RSP: 0018:ffffb7074740fac8 EFLAGS: 00010286
[   86.005559] RAX: ffffffffffffffff RBX: ffff905db53afa40 RCX: 00000000000005be
[   86.006809] RDX: 0000000000000008 RSI: 0000000000000008 RDI: ffffffffc0870860
[   86.008037] RBP: 00000000000113a0 R08: 9ae16a3b2f90408f R09: 9ae16a3b2f90404f
[   86.009243] R10: ffff905db53d1200 R11: 0000000000000001 R12: ffffffffc0870860
[   86.010467] R13: 000000000000026a R14: 00000000037675be R15: 9777ffc4d387a938
[   86.011666] FS:  0000000000000000(0000) GS:ffff90ac8e380000(0000) knlGS:0000000000000000
[   86.012912] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[   86.014093] CR2: ffffffffffffffff CR3: 000000014d598000 CR4: 0000000000350ee0
[   86.015304] Call Trace:
[   86.016506]  arc_freed+0x30/0xc0 [zfs]
[   86.017690]  ? zfs_blkptr_verify+0x410/0x440 [zfs]
[   86.018872]  zio_free_sync+0x4a/0xf0 [zfs]
[   86.020038]  zio_free+0xb3/0xf0 [zfs]
[   86.021202]  dsl_dataset_block_kill+0x2a6/0x4e0 [zfs]
[   86.022356]  free_blocks+0xd4/0x1c0 [zfs]
[   86.023510]  ? dnode_undirty_dbufs+0x1d0/0x1d0 [zfs]
[   86.024664]  dnode_sync_free_range+0x1e0/0x270 [zfs]
[   86.025885]  ? _cond_resched+0x16/0x40
[   86.027150]  ? dnode_undirty_dbufs+0x1d0/0x1d0 [zfs]
[   86.028361]  range_tree_walk+0xad/0x1e0 [zfs]
[   86.029549]  dnode_sync+0x2bb/0xa30 [zfs]
[   86.030765]  ? __switch_to_asm+0x42/0x70
[   86.031952]  ? __switch_to+0x114/0x3e0
[   86.033130]  ? _cond_resched+0x16/0x40
[   86.034321]  sync_dnodes_task+0x71/0xa0 [zfs]
[   86.035490]  taskq_thread+0x2df/0x520 [spl]
[   86.036661]  ? wake_up_q+0xa0/0xa0
[   86.037871]  ? taskq_thread_spawn+0x50/0x50 [spl]
[   86.039060]  kthread+0x11b/0x140
[   86.040239]  ? __kthread_bind_mask+0x60/0x60
[   86.041426]  ret_from_fork+0x22/0x30
[   86.042596] Modules linked in: ext4 crc16 mbcache jbd2 dm_crypt cbc encrypted_keys trusted tpm rng_core af_packet amd64_edac_mod edac_mce_amd edac_core amd_energy kvm_amd kvm irqbypass crc32_pclmul ghash_clmulni_intel igb ptp input_leds aesni_intel joydev ipmi_ssif cfg80211 led_class mousedev libaes ast crypto_simd drm_vram_helper pps_core rfkill cryptd hid_generic evdev mac_hid 8021q glue_helper sp5100_tco dca drm_ttm_helper ttm watchdog cdc_ether drm_kms_helper nvme usbhid usbnet i2c_piix4 i2c_algo_bit fb_sys_fops syscopyarea rapl sysfillrect hid mii sysimgblt k10temp nvme_core ip6_tables acpi_ipmi ipmi_si ipmi_devintf ipmi_msghandler xt_conntrack nf_conntrack nf_defrag_ipv6 hed nf_defrag_ipv4 pinctrl_amd tiny_power_button button acpi_cpufreq xt_tcpudp ip6t_rpfilter ipt_rpfilter xt_pkttype nft_compat nft_counter nf_tables libcrc32c crc32c_generic nfnetlink sch_fq_codel atkbd libps2 serio zfs(PO) zunicode(PO) zzstd(O) zlua(O) zavl(PO) icp(PO) zcommon(PO) znvpair(PO) spl(O) tun tap macvlan
[   86.042624]  bridge stp llc pstore fuse drm agpgart backlight i2c_core configfs ip_tables x_tables autofs4 squashfs sd_mod t10_pi crc_t10dif crct10dif_generic xhci_pci ahci xhci_pci_renesas libahci xhci_hcd libata crct10dif_pclmul usbcore crct10dif_common crc32c_intel scsi_mod usb_common rtc_cmos dm_mod overlay loop
[   86.051633] CR2: ffffffffffffffff
[   86.052983] ---[ end trace 2d0d878cbc5f712a ]---
[   86.054293] RIP: 0010:buf_hash_find+0xa7/0x130 [zfs]
[   86.055475] Code: 25 c0 bb 01 00 48 89 85 e8 f4 85 c0 48 8b 05 48 c1 27 00 4a 8b 04 f0 48 85 c0 74 32 48 8b 13 eb 09 48 8b 40 20 48 85 c0 74 24 <48> 39 10 75 f2 48 8b 73 08 48 39 70 08 75 e8 4c 39 68 10 75 e2 4c
[   86.056748] RSP: 0018:ffffb707473dfac8 EFLAGS: 00010286
[   86.057983] RAX: ffffffffffffffff RBX: ffff905db47a1240 RCX: 00000000000002cd
[   86.059241] RDX: 0000000000000010 RSI: 0000000000000010 RDI: ffffffffc0867b30
[   86.060490] RBP: 0000000000008670 R08: 9ae16a3b2f90408f R09: 9ae16a3b2f90404f
[   86.061734] R10: ffff905db51e7780 R11: 0000000000000001 R12: ffffffffc0867b30
[   86.062944] R13: 000000000000026a R14: 00000000037502cd R15: 9777ffc4d387a938
[   86.064131] FS:  0000000000000000(0000) GS:ffff90ac8e380000(0000) knlGS:0000000000000000
[   86.065331] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[   86.066534] CR2: ffffffffffffffff CR3: 000000014d598000 CR4: 0000000000350ee0

@flokli
Copy link
Author

flokli commented Jan 27, 2022

I initially suspected this was the dm-crypt/luks layer in between (had a zpool create -f zroot mirror /dev/mapper/root{1,2}), but even with using the raw partitions I was now able to reproduce another oops, now already when running zfs create, before even mounting and accessing files:

zpool create -f zroot mirror /dev/nvme{0,1}n1p3
zfs create -o acltype=posixacl -o xattr=sa -o encryption=aes-256-gcm -o keyformat=passphrase -o compression=lz4 -o mountpoint=none zroot/root

Produced the following one:

[  202.629848] BUG: unable to handle page fault for address: ffffffffffffffff
[  202.629883] #PF: supervisor read access in kernel mode
[  202.629900] #PF: error_code(0x0000) - not-present page
[  202.629916] PGD 177f815067 P4D 177f815067 PUD 177f817067 PMD 0 
[  202.629936] Oops: 0000 [#1] SMP NOPTI
[  202.629948] CPU: 52 PID: 3460 Comm: z_wr_int_0 Tainted: P           O      5.10.93 #1-NixOS
[  202.629973] Hardware name: ASUSTeK COMPUTER INC. KRPA-U16 Series/KRPA-U16 Series, BIOS 4102 11/17/2021
[  202.630016] RIP: 0010:buf_hash_insert+0x90/0x170 [zfs]
[  202.630032] Code: dc 27 00 48 8d 34 d8 48 8b 0e 48 85 c9 0f 84 dd 00 00 00 48 8b 55 00 48 89 c8 31 db eb 0c 48 8b 40 20 83 c3 01 48 85 c0 74 2a <48> 39 10 75 ef 48 8b 7d 08 48 39 78 08 75 e5 48 8b 7d 10 48 39 78
[  202.630080] RSP: 0018:ffffac0a8665fce0 EFLAGS: 00010246
[  202.630096] RAX: ffffffffffffffff RBX: 0000000000000000 RCX: ffffffffffffffff
[  202.630116] RDX: 0000000000000008 RSI: ffffac0abb3088c0 RDI: ffffffffc0b2c940
[  202.630136] RBP: ffff971a733f0000 R08: 9ae16a3b2f90408f R09: 9ae16a3b2f90404f
[  202.630156] R10: ffff9769436b2e40 R11: 0000000000000001 R12: ffffac0a8665fd08
[  202.630176] R13: 0000000000015480 R14: ffff971a6eece000 R15: ffff976943757ba0
[  202.630197] FS:  0000000000000000(0000) GS:ffff97694ed00000(0000) knlGS:0000000000000000
[  202.630219] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  202.630237] CR2: ffffffffffffffff CR3: 000000177f810000 CR4: 0000000000350ee0
[  202.630258] Call Trace:
[  202.630283]  arc_write_done+0x120/0x420 [zfs]
[  202.630314]  zio_done+0x3fc/0x1150 [zfs]
[  202.630343]  zio_execute+0x83/0x120 [zfs]
[  202.630359]  taskq_thread+0x2df/0x520 [spl]
[  202.630375]  ? wake_up_q+0xa0/0xa0
[  202.630402]  ? zio_gang_tree_free+0x60/0x60 [zfs]
[  202.630417]  ? taskq_thread_spawn+0x50/0x50 [spl]
[  202.630433]  kthread+0x11b/0x140
[  202.630443]  ? __kthread_bind_mask+0x60/0x60
[  202.630458]  ret_from_fork+0x22/0x30
[  202.630471] Modules linked in: af_packet amd64_edac_mod edac_mce_amd edac_core amd_energy kvm_amd ast kvm drm_vram_helper drm_ttm_helper irqbypass joydev ttm crc32_pclmul ghash_clmulni_intel mousedev input_leds led_class aesni_intel ipmi_ssif igb drm_kms_helper libaes evdev cfg80211 fb_sys_fops syscopyarea hid_generic mac_hid crypto_simd usbhid ptp rfkill sysfillrect sp5100_tco pps_core cryptd glue_helper dca cdc_ether watchdog usbnet nvme 8021q rapl i2c_piix4 i2c_algo_bit nvme_core mii k10temp hid sysimgblt ip6_tables acpi_ipmi hed ipmi_si ipmi_devintf ipmi_msghandler xt_conntrack nf_conntrack nf_defrag_ipv6 pinctrl_amd nf_defrag_ipv4 tiny_power_button button acpi_cpufreq xt_tcpudp ip6t_rpfilter ipt_rpfilter xt_pkttype nft_compat nft_counter nf_tables libcrc32c crc32c_generic nfnetlink sch_fq_codel atkbd libps2 serio zfs(PO) zunicode(PO) zzstd(O) zlua(O) zavl(PO) icp(PO) zcommon(PO) znvpair(PO) spl(O) tun tap macvlan bridge stp llc drm pstore fuse agpgart backlight i2c_core configfs
[  202.630518]  ip_tables x_tables autofs4 squashfs sd_mod t10_pi crc_t10dif crct10dif_generic xhci_pci ahci xhci_pci_renesas libahci xhci_hcd libata crct10dif_pclmul crct10dif_common usbcore crc32c_intel scsi_mod usb_common rtc_cmos dm_mod overlay loop
[  202.630800] CR2: ffffffffffffffff
[  202.630811] ---[ end trace 57efbb0d81536130 ]---
[  202.630839] RIP: 0010:buf_hash_insert+0x90/0x170 [zfs]
[  202.630854] Code: dc 27 00 48 8d 34 d8 48 8b 0e 48 85 c9 0f 84 dd 00 00 00 48 8b 55 00 48 89 c8 31 db eb 0c 48 8b 40 20 83 c3 01 48 85 c0 74 2a <48> 39 10 75 ef 48 8b 7d 08 48 39 78 08 75 e5 48 8b 7d 10 48 39 78
[  202.630901] RSP: 0018:ffffac0a8665fce0 EFLAGS: 00010246
[  202.630916] RAX: ffffffffffffffff RBX: 0000000000000000 RCX: ffffffffffffffff
[  202.630937] RDX: 0000000000000008 RSI: ffffac0abb3088c0 RDI: ffffffffc0b2c940
[  202.630957] RBP: ffff971a733f0000 R08: 9ae16a3b2f90408f R09: 9ae16a3b2f90404f
[  202.630976] R10: ffff9769436b2e40 R11: 0000000000000001 R12: ffffac0a8665fd08
[  202.630997] R13: 0000000000015480 R14: ffff971a6eece000 R15: ffff976943757ba0
[  202.631017] FS:  0000000000000000(0000) GS:ffff97694ed00000(0000) knlGS:0000000000000000
[  202.631039] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  202.632176] CR2: ffffffffffffffff CR3: 000000177f810000 CR4: 0000000000350ee0

I also left a memtest running on this machine over night to rule some hardware issues, which passed (and this machine has ECC RAM)

@behlendorf, do you have any idea what's going on here?

@flokli flokli changed the title Kernel BUG during NixOS installation Kernel oops on zfs create, or later when copying files Jan 27, 2022
@flokli
Copy link
Author

flokli commented Jan 27, 2022

On the same box, I created the zpool with an older ZFS version/OS:

zfs-2.0.7-1
zfs-kmod-2.0.7-1
Linux rescue 5.13.13 #1 SMP Thu Oct 28 09:11:58 UTC 2021 x86_64 GNU/Linux
Debian GNU/Linux 11 (bullseye)

I was able to proceed the installation, and after that, am now able to use the pool without problems from the more recent system (zfs-2.1.2-1, NixOS, 5.16.2). No kernel OOPses so far.

zpool status tells me some supported and requested features are not enabled on the pool, so I guess some of these new features are what's causing the issues?

@rincebrain
Copy link
Contributor

I would be surprised if it were the new features, from those stack traces, though you could always bisect between the point where 2.0-rc1 forked from master and 2.1.2.

Someone recently came around on IRC with similarly wild stacktraces and pointer dereferences, though theirs happened earlier, and it turned out the underlying issue was they had made some changes to their kernel config and recompiled the kernel, but not the modules, and it went away once they rebuilt the modules against the changed kernel. So I would wonder if a clean rebuild of 2.1.2 against your kernel might make the issues vanish? (I don't think it's incredibly likely unless you know you recently updated it, but the example was too similarly surprising to not mention.)

I'll try to think more about this one later, those are just two remarks that came quickly.

@flokli
Copy link
Author

flokli commented Feb 1, 2022 via email

@stale
Copy link

stale bot commented Feb 2, 2023

This issue has been automatically marked as "stale" because it has not had any activity for a while. It will be closed in 90 days if no further activity occurs. Thank you for your contributions.

@stale stale bot added the Status: Stale No recent activity for issue label Feb 2, 2023
@stale stale bot closed this as completed May 9, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Status: Stale No recent activity for issue Type: Defect Incorrect behavior (e.g. crash, hang)
Projects
None yet
Development

No branches or pull requests

2 participants