Como apagar un Cluster Data ONTAP de NetApp

Estimados lectores,

Vamos a ver como apagar un Cluster Data ONTAP de NetApp de forma ordenada para evitar situaciones de «Dirty Shutdown» en las que pueda ser necesario realizar recovery. De hecho he tenido un caso reciente donde ha sido necesario llevar a cabo un procedimiento para recuperar un nodo en estado degradado de manera que es importante que el sistema de almacenamiento se cierre correctamente.

El siguiente procedimiento está basado en un clúster formado por un par HA cuyo Cluster-Interconnect es Direct Attached (solución switchless).

1) Acceso al clúster mediante SSH a su IP de gestión usando, por ejemplo, Putty.

2) Desactivar HA a nivel de cluster:

ClusterDavid::> cluster ha modify -configured false
Notice: HA is disabled.

3) Desactivar Storage Failover en cada nodo:

ClusterDavid::> storage failover modify -node ClusterDavid-01 -enabled false
ClusterDavid::> storage failover modify -node ClusterDavid-02 -enabled false

4) Acceder a cada nodo mediante la Service Processor, también podemos usar Putty.

5) Lanzar el siguiente comando en el primer nodo:

ClusterDavid::> halt local -inhibit-takeover true
 (system node halt)

Warning: This operation will put node "ClusterDavid-01" out of service. Node
 "ClusterDavid-01" is holding epsilon. If no node is assigned epsilon the
 cluster will be at an increased risk of a cluster-wide data serving
 outage when only half the cluster is in service. To avoid this
 condition move epsilon to a node that will remain in service to
 maintain data serving operations.
 Use (privilege:advanced) "cluster show" to verify the node that is
 currently holding epsilon.
 To transfer epsilon to another healthy node, use the following
 commands (privilege:advanced):
 cluster modify -epsilon false -node <out-of-service-node>
 cluster modify -epsilon true -node <healthy-node>

Do you want to continue? {y|n}: y
Warning: Are you sure you want to halt node "ClusterDavid-01"? {y|n}: y

Tras el comando y la aceptación de las preguntas el nodo debería quedarse en el LOADER:

Boot Loader version 4.3
Copyright (C) 2000-2003 Broadcom Corporation.
Portions Copyright (C) 2002-2014 NetApp, Inc. All Rights Reserved.

CPU Type: Intel(R) Xeon(R) CPU C3528 @ 1.73GHz
LOADER-A>

6) Lanzar el siguiente comando sobre el nodo que queda activo:

ClusterDavid::> halt local -inhibit-takeover true -skip-lif-migration-before-shutdown
 (system node halt)

Warning: Are you sure you want to halt node "ClusterDavid-02"? {y|n}: y

Para que el nodo pueda apagarse correctamente debemos especificar que «no intente migrar los LIFs». De no hacerlo recibiremos un mensaje de error como este:

ClusterDavid::> halt local -inhibit-takeover true
 (system node halt)

Warning: Are you sure you want to halt node "ClusterDavid-02"? {y|n}: y
Error: Could not migrate LIFs away from node: invalid operation
Reissue the command with "-skip-lif-migration-before-shutdown" to skip
 the migration and continue with takeover.

Con los parámetros correctos el nodo también debería quedarse en el LOADER:

Boot Loader version 4.3
Copyright (C) 2000-2003 Broadcom Corporation.
Portions Copyright (C) 2002-2014 NetApp, Inc. All Rights Reserved.

CPU Type: Intel(R) Xeon(R) CPU C3528 @ 1.73GHz
LOADER-B>

7) Apagar las fuentes de alimentación de chasis y bandejas de discos.

Para la puesta en marcha deberemos tener en cuenta el orden de encendido de las fuentes siendo primeras las correspondientes a las bandejas de discos, esperando a la completa inicialización de los mismos, y finalmente las del chasis. Una vez iniciados los nodos el clúster debería arrancar de forma correcta y solamente nos quedaría volver a activar el HA:

::> cluster ha modify -configured true
::> storage failover modify -node ClusterDavid-01 -enabled true
::> storage failover modify -node ClusterDavid-02 -enabled true

^{This Post by David Solé Pérez is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License}

4.8/5 - (6 votos)

4.709 Visitas del Post

¿Te ha gustado este artículo? ¡Suscríbete!

Autor: David Solé Pérez

Padre de Paula e Ivet, entusiasta de las Tecnologías de la Información y de la Comunicación. Lee todas las entradas de David Solé Pérez

Un comentario en “Como apagar un Cluster Data ONTAP de NetApp”

hola David, interesante,

tengo un proyecto apagar la solucion de netapp que tenemos para moverlo de lugar. y andaba buscando las mejores practicas.

mi ambiente es:

3 controladoras fas2620, fas2650 y 2750
san switch
cisco MDS

algo mas que tenga que tomar en cuenta, gracias anticipadas.

Responder

Como apagar un Cluster Data ONTAP de NetApp

¿Te ha gustado este artículo? ¡Suscríbete!

Artículos Relacionados

Autor: David Solé Pérez

Un comentario en “Como apagar un Cluster Data ONTAP de NetApp”

Deja una respuesta Cancelar la respuesta