NFS over RDMA causes kernel crash on aarch64 CentOS7
Linux - ServerThis forum is for the discussion of Linux Software used in a server related context.
Notices
Welcome to LinuxQuestions.org, a friendly and active Linux Community.
You are currently viewing LQ as a guest. By joining our community you will have the ability to post topics, receive our newsletter, use the advanced search, subscribe to threads and access many other special features. Registration is quick, simple and absolutely free. Join our community today!
Note that registered members see fewer ads, and ContentLink is completely disabled once you log in.
If you have any problems with the registration process or your account login, please contact us. If you need to reset your password, click here.
Having a problem logging in? Please visit this page to clear all LQ-related cookies.
Get a virtual cloud desktop with the Linux distro that you want in less than five minutes with Shells! With over 10 pre-installed distros to choose from, the worry-free installation life is here! Whether you are a digital nomad or just looking for flexibility, Shells can put your Linux machine on the device that you want to use.
Exclusive for LQ members, get up to 45% off per month. Click here for more info.
NFS over RDMA causes kernel crash on aarch64 CentOS7
Hi, recently I install NFS over RDMA on the Cavium ThunderX2 ARM server with CentOS7. The NFSv4 service and client started without problems. However, any file operations on the clients, mounted with options proto=rdma, will cause the server kernel crash with a single line error:
kernel:Internal error: Oops: 96000006 [#1]
Is this a known issue or does anyone know how to debug this ?
Distribution: Currently: OpenMandriva. Previously: openSUSE, PCLinuxOS, CentOS, among others over the years.
Posts: 3,881
Rep:
Quote:
Originally Posted by Chris Lin
Hi, recently I install NFS over RDMA on the Cavium ThunderX2 ARM server with CentOS7. The NFSv4 service and client started without problems. However, any file operations on the clients, mounted with options proto=rdma, will cause the server kernel crash with a single line error:
kernel:Internal error: Oops: 96000006 [#1]
Is this a known issue or does anyone know how to debug this ?
Many thanks,
Chris
Yeah, it looks like it is a known issue. It's a problem in the code, so unless you're a kernel developer, then I'm not sure how you would "debug it". Just so you know that's a kernel oops, which isn't quite the same thing as a kernel panic - the system stops altogether on a panic.
LinuxQuestions.org is looking for people interested in writing
Editorials, Articles, Reviews, and more. If you'd like to contribute
content, let us know.