Professional Documents
Culture Documents
Handling L2 Watchdog Resets On The FAS 25XX Platforms
Handling L2 Watchdog Resets On The FAS 25XX Platforms
https://kb.netapp.com/onprem/ontap/hardware/Handling_L2_Watchdog_Resets_on_the_FAS_25XX_pl…
Updated: Wed, 03 May 2023 08:05:23 GMT
Applies to
• FAS 25XX systems
Issue
• Node reboots unexpectedly
• Node does not reboot after an unexpected shutdown
'NetApp provides no representations or warranties regarding the accuracy or reliability or serviceability of any information or recommendations
provided in this publication or with respect to any results that may be obtained by the use of the information or observance of any recommendations
provided herein. The information in this document is distributed AS IS and the use of this information or the implementation of any recommendations or 1
techniques herein is a customers responsibility and depends on the customers ability to evaluate and integrate them into the customers operational
environment. This document and the information contained herein may be used solely in connection with the NetApp products discussed in this
document.'
Event.critical]: hwassist l2_watchdog_reset (29)
Record 803: Sun Mar 06 15:09:23.000822 2021 [SP.critical]: Filer Reboot
• If node reboots, the following error can be seen in the EMS log files
• If node is unable to reboot, system senors from the SP may show senors unavailble (na) or faulted
(Fault)
© 2023 NetApp.No part of this document covered by copyright may be reproduced in any form or by any means—graphic, electronic, or mechanical,
including photocopying, recording, taping, or storage in an electronic retrieval system—without prior written permission of the copyright owner. For more
information, see Legal Notices. 2
Cause
• Node reboots unexpectedly reboots due to a watch dog reset
• A watchdog is an independent timer that monitors the progress of the main controller running Data ONTAP.
Its function is to serve as an automatic server restart in the event the system encounters an unrecoverable
system error.
Solution
1. Collect the following SP Logs
system log
events all
sp status -d
system senors
2. Review logs for abnormalities for the timestamp around the L2 Watchdog reset event
Replacement Criteria
Node unresponsive
No logs can be collected from an Attempt a PCM reseat, PCM may need to be
and unable to collect
unresponsive BMC replaced.
BMC logs
3. Any further scenarios may require the review of additional logs to determine the what course of action to take,
© 2023 NetApp.No part of this document covered by copyright may be reproduced in any form or by any means—graphic, electronic, or mechanical,
including photocopying, recording, taping, or storage in an electronic retrieval system—without prior written permission of the copyright owner. For more
information, see Legal Notices. 3
and may require a support case, please contact NetApp Technical Support or log into the NetApp Support Site
to create a case. Reference this article for further assistance.
Additional Information
• Handling watchdog resets (WDR)
© 2023 NetApp.No part of this document covered by copyright may be reproduced in any form or by any means—graphic, electronic, or mechanical,
including photocopying, recording, taping, or storage in an electronic retrieval system—without prior written permission of the copyright owner. For more
information, see Legal Notices. 4